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original architects of the AP® Statistics course. When the course first 
launched in the 1996-1997 school year, there were no textbooks written 
specifically for the high school student audience that were aligned to the 
AP® Statistics topic outline. Along came Dan Yates. His vision for such a 
text became reality with the publication of The Practice of Statistics (TPS) 
in 1998. Over a million students have used one of the first five editions of 
TPS for AP® Statistics! Dan also championed the importance of 
developing high-quality resources for AP® Statistics teachers, which were 
originally provided in a Teachers’ Resource Binder. We stand on the 
shoulders of two giants in statistics education as we carry forward their 
visions in this and future editions. 
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support of our longtime editor and team captain, Ann Heath. Her keen eye 
for design is evident throughout the pages of the student and teacher’s 
editions. More importantly, Ann’s ability to oversee all of the complex 
pieces of this project while maintaining a good sense of humor is 
legendary. Ann has continually challenged everyone involved with TPS to 
innovate in ways that benefit AP® Statistics students and teachers. She is a 
good friend and an inspirational leader. 

Teamwork is the secret sauce of TPS. We have been blessed to 
collaborate with many talented AP® Statistics teachers and introductory 
Statistics college professors over the years we have been working on this 
project. We sincerely appreciate their willingness to give us candid 
feedback about early drafts of the student edition, and to assist with the 
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The purpose of this book is to give you a working knowledge of the big 
ideas of statistics and of the methods used in solving statistical problems. 
Because data always come from a real-world context, doing statistics 
means more than just manipulating data. The Practice of Statistics (TPS), 
Sixth Edition, is full of data. Each set of data has some brief background to 
help you understand where the data come from. We deliberately chose 
contexts and data sets in the examples and exercises to pique your interest. 

TPS 6e is designed to be easy to read and easy to use. This book is 
written by current high school AP® Statistics teachers, for high school 
students. We aimed for clear, concise explanations and a conversational 
approach that would encourage you to read the book. We also tried to 
enhance both the visual appeal and the book’s clear organization in the 
layout of the pages. 

Be sure to take advantage of all that TPS 6e has to offer. You can learn 
a lot by reading the text, but you will develop deeper understanding by 
doing the Activities and Projects and answering the Check Your 
Understanding questions along the way. The walkthrough guide on pages 
xiV—xx gives you an inside look at the important features of the text. 

You learn statistics best by doing statistical problems. This book offers 
many different types of problems for you to tackle. 


e Section Exercises include paired odd- and even-numbered problems 
that test the same skill or concept from that section. There are also some 
multiple-choice questions to help prepare you for the AP® Statistics 
exam. Recycle and Review exercises at the end of each exercise set 
involve material you studied in preceding sections. 


e Chapter Review Exercises consist of free-response questions aligned 
to specific learning targets from the chapter. Go through the list of 
learning targets summarized in the Chapter Review and be sure you can 
say of each item on the list, “I can do that.” Then prove it by solving 
some problems. 


° The AP® Statistics Practice Test at the end of each chapter will help 
you prepare for in-class exams. Each test has about 10 multiple-choice 
questions and 3 free-response problems, very much in the style of the 
AP® Statistics exam. 


¢ Finally, the Cumulative AP® Practice Tests after Chapters 4, 7, 10, 
and 12 provide challenging, cumulative multiple-choice and free- 
response questions like those you might find on a midterm, final, or the 
AP® Statistics exam. 


The main ideas of statistics, like the main ideas of any important 
subject, took a long time to discover and thus take some time to master. 
The basic principle of learning them is to be persistent. Once you put it all 
together, statistics will help you make informed decisions based on data in 
your daily life. 


TPS and AP® Statistics 


The Practice of Statistics (TPS) was the first book written specifically for 
the Advanced Placement (AP®) Statistics course. Like the previous five 
editions, TPS 6e is organized to closely follow the AP® Statistics Course 
Description. Every item on the College Board’s “Topic Outline” is 
covered thoroughly in the text. Visit the book’s website at 
highschool.bfwpub.com/tps6e for a detailed alignment guide. The few 
topics in the book that go beyond the AP® Statistics syllabus are marked 
with an asterisk (*). 

Most importantly, TPS 6e is designed to prepare you for the Ap® 
Statistics exam. The author team has been involved in the AP® Statistics 
program since its early days. We have more than 40 years’ combined 
experience teaching AP® Statistics and grading the AP® exam! Both of us 
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have served as Question Leaders for more than 10 years, helping to write 

scoring rubrics for free-response questions. Including our Content 

Advisory Board and Supplements Team (page vi), we have extensive 

knowledge of how the AP® Statistics exam is developed and scored. 

TPS 6e will help you get ready for the AP® Statistics exam throughout 
the course by: 

e Using terms, notation, formulas, and tables consistent with those 
found on the AP® Statistics exam. Key terms are shown in bold in the 
text, and they are defined in the Glossary. Key terms also are cross- 
referenced in the Index. See page F-1 to find “Formulas for the AP® 
Statistics Exam,” as well as Tables A, B, and C in the back of the book 
for reference. 

¢ Following accepted conventions from AP® Statistics exam rubrics 
when presenting model solutions. Over the years, the scoring 
guidelines for free-response questions have become fairly consistent. 
We kept these guidelines in mind when writing the solutions that appear 
throughout TPS 6e. For example, the four-step State-Plan—Do— 
Conclude process that we use to complete inference problems in 
Chapters 8-12 closely matches the four-point AP® scoring rubrics. 

e Including AP® Exam Tips in the margin where appropriate. We 
place exam tips in the margins as “on-the-spot” reminders of common 
mistakes and how to avoid them. These tips are collected and 
summarized in the About the AP® Exam and AP® Exam Tips appendix. 

¢ Providing over 1600 AP®-style exercises throughout the book. Each 
chapter contains a mix of free-response and multiple-choice questions 
that are similar to those found on the AP® Statistics exam. At the start 
of each Chapter Wrap-Up, you will find a FRAPPY (Free Response 
AP® Problem, Yay!). Each FRAPPY gives you the chance to solve an 
AP®-style free-response problem based on the material in the chapter. 
After you finish, you can view and critique two example solutions from 
the book’s Student Site (highschool.bfwpub.com/tps6e). Then you can 
score your own response using a rubric provided by your teacher. 

Turn the page for a tour of the text. See how to use the book to realize 

success in the course and on the AP® Statistics exam. 
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READ THE TEXT and use the book’s features 
to help you grasp the big ideas. 


Section 3:1 Scattorplots snd Gorrclation Wy 


| SECTION 3.1 | Scatterplots and Correlation 


LEARNING TARGETS (8y thw ond of the section, you should be able to: 


* Distinguish between explanatory and response * Interpret the correlation. 


variables for quantitative data. * Understand the basic properties of correlation, 
+ Make a scatterplot to display the relationship including how the correlation is influenced by 
between two quantitative variables, outliers, 


* Describe the direction, form, and strength of * Distinguish correlation from causation. 
a relationship displayed in 8 scatterplot and 
identity unusual features. 


Mun statistical studies examine data on more than one variable. Fortunately, 
analysis of relationships between two variables builds on the same tools we 
used to analyze one variable, The principles that guide our work also remain 
the same: 


© Plot the data, then look for overall patterns and departures from those patterns. 


© Add numerical stunmaries. 
© When there's a regular overall pattern, use a simplified model to deseribe it, 


Explanatory and Response Variables 


tn the “Candy grab” activity, the mirmber of candies is the response variable. 
Hand spat is the explanatory variable because we anticipate that knowing a sta 
dent's hand span will help us predict the number of candies that student can grab. 


A response variable measures an outcome of a study. An explanatory variable 
may help predict or explain changes in a response variable. 


You will often see explanatory It is easiest to identify explanatery and response varlables when we initially 
varables called sdeperdentt specify the values of one variable to see how it affects another variable. For 
varaibles ard eesponse variables instance, to study the effect of aleohol an body temperature, researchers gave 
Called dependent verbies Becouse — several different amounts of alcohol to mice, Then they measured the change in 
(te words iaciepenateiit and cach mouse’s body temperature 15 minutes liter, In this case, amnonnt of aleohol 
brorenaralenrialsottad is the explanatory variable, and change in body temperature is the response vari- 

; able. When we don't specify the values of either variable before collecting the 
data, there may or may not be a clear explanatory variable. 


HOW TO DESCRIBE A SCATTERPLOT 


‘To describe a seatterplot, make sure to address the following fourcharacteristics 

in the context of the data: 

© Direction: A scatterplot can show a positive association, 1 
association, oF no association 

© Form: A scatterplot can show a linear form of a nonlinear for 
form is linear if the overall pattern follows a straight line, Ol 
form is nonlinear. 

© Strength: A scatterplot can show a weak, modente, or strong) 

An association is strong if the points don’t deviate much from 

identified, An usociation is weuk if the points deviate quite af , 


Kew relationships are linear for all values of the explanatory variable. Don't 
oO make predictions using values of x that are much larger of much smaller than 
those that actually appear in your data, 


LEARN STATISTICS BY DOING 
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STATISTICS 


TIVITY Bertie 


Inv this activity, you will investigate if students with a linger hand span can grab 
tnore candy than students with a smaller hand span! 


1. Measure the span of your dominant land te the nearest half 
centimeter (em). Hand span is the distance from the tip of the thitab 
to the tip of the pinkie finger on your fully streteled-out han 


ww 


. One student at a time, go to the front of the class and use your 
dominant hand to grab as many candies as possible from the 
container, You must grab the candies with your finger pointing 
down (ne scooping!) and hold the cundies for 2 seconds before 
counting them, After counting, put the candy back into the 
container, 


3. On the board, record your hund span and number of candies in 
a table with the following headings; 


y [Hand span (em) Nurober of candies 


| 4, While other students record their valties on the board, copy 
the table onto a piece of paper and make a griph, Begin by 
constructing a set of coordinate axes. Label the horizontal axis 


the vertical axis “Number of candies.” 
“ 
Gues the correlation ra 


scale for each axis and plot cach poiut 
eas accurately as you can on the graph, 

In this activity, we will have a class competition to see who can best guess the 

correlation 


Hl you about the relationship between 
of candies? Summarize your observa- 


1, Load the Guess the Correlation applet at wow rossmanchance.com/ 
applets 


2. The teacher will press the “New Sample” 

button to see a “random” seatterplot, As 

a class, try to. guess the correlation, Type 

Daa the guess in the “Correlation guess” box 
and press “Cheek Guess” to see how the 


Correlation Guessing Game 


Number of points: BE] yom 


© Edit’ Paste Dare 


Correlat 


Chapter 3 Project Investigating Relationships 
in Baseball 


What ts 2 better predictor ol the mutniber of wine for a baseball Become the number of wins a tenn has is dependent on 


team, the mimber of runs scored by the tearm of the mumnber 
of nany they allow the other team to sence? What variables eurt 
we use ty predict the tamber of nes a tearm scotes? Ty predict 
the number af nuns it allows the other teany to score? In this 
project, you will ase tetmotagy to help unewer these questions 
by exploring a lange set of duty from Major League Baseball 


both how many ruts they seore and ow many cans they 
allow, we can use a combination of both vartibles to pre- 
diet the rember of wins, Add 2 colunin in your data table 
foe a new Varluble, ran differential, Pill tn the values using 
the formula WG = RAKG 


Create 4 atterplat fo snvestigate the relationship between 


rin differential and wins. Then calonvtate the equation af 

Part | the Ieaveequates tegtession Tite, the standan! deviation af 

the reshihuats, anid ¢* 

1. Download the “MLB Team Datu 2012-2016" Excel file 9_ fs run diffcrential » better predictor then the variable you 
frown the hacks website, along with the “Glowary for MLB 
‘Team Data file.” which captains cach of the variables 
iieluiled int the date set. Lenport the deat inte 
cal paltware package you prefer 


chose fr Question 4? [xpliin your teasoning, 


i) 


Create a seutterplot to investigate the relationship} 

runs scored per game (RAG) and were (W) ‘Thd 

lute the eqention af the leaslaquiares regression 

stamdurd deviation of the reuduals, and r*, Note: 

the seotiot: bor hitting statistics atl Wo in the 
idelsieng slatiabion. 


CHECK YOUR UNDERSTANDING 


In Exercises 3 and 7, we asked you to make and describe a scatterplot for the 
hiker data shown in the table: 


Body weeight (Ib) 120 187 109 103 131 165 158 116 
Backpack worght (Ib) 2% 30 26 24 29 35 31 2B 


L. Galoulate the equation of the least-seniares regression line. 
2 Make a residual plot for the Hinewr model in Question 1 


3. What does the residual plot indicate about the appropriateness of the linear model? 
Eqplain your anywer. 


Ii abies tes back of te beni 


EXAMPLES: Model statistical problems and 
how to solve them 
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Read through each EXAMPLE, and then try out the concept yourself 

by working the FOR PRACTICE, TRY exercise in the Section Exercises. 
Old Faithful and fertility 
Describing a scatterplot 


Need extra help? Examples and exercises 
marked with the PLAY ICON \e) are supported 
by short video clips prepared by experienced 
AP® Statistics teachers, The video guides you 
through each step in the example and solution 
and provides additional explanation when you 
need it. 


PROBLEM: Describe the relationship in each of the 

following, contexts 

(a) The scatterplot on the left shows the relationship between 
the duration (in minutes) of an eruption and the interval 
of time until the next eruption (in mimstes) of Old Waithfal 
during a particular month 


(b) The soatterplot on the right shaws the relationship between the average income (gros domestic product 
per person, in dollars) and fertility rite (1mmbes of childten per worn) in 187 countries.* 


Example: Old Faithful and fertility 


Dever Re the relation 1” eseh of the Nukumeng camtents 


ah Ve hatte nt Mtn ems tha 


inn 


Average jacemwe ($+ 


nuit 


(a) There ie a strong, positive linear relationship between the duration 
of an eruption and the interval of time until the next eruption, There 
are two main clusters of points: one cluster has durations around 


Caffeine and pulse rates 
How random assignment works 


PROBLEM: A total of 20 students have agreed to participate 
in un experiment comparing the effects of caffeinated cola 
wid caffeine-free cola on pulse rates, Describe how you would 
randornily assign 10 students to cach of the two treatments: 


(a) Using 20 identical dips of paper 


THE VOICE OF THE TEACHER. 
Study the worked examples 


(b) Using technology 
(¢) Using Table D 


SOLUTION: and pay special attention to 
(a) On 10 slips of 5 write the letter "A"; on the remaining When descriding @ method of random | carefully placed ‘eacher 
r 10 ipa, write theleteer°B- Chute the sips of pepar and salgimaes Gorn aie rtng | me iin 


Talk” comment boxes that 
guide you step by step through 
the solution. These comments 
offer lots of good advice—as if 
your teacher is working directly 
with you to solve a problem. 


hand out one slip of paper to each volunteer, Students who get —_ groups. Maks sure t entity which group 
nA" slip receive the cola with caffeine and students who get —_ gets which treatment, 
3°" slip receive the cola without caffeine, 

(b) Label cach otudent with a different integer from 1 to 20. Then —_ When using a mindom number generator ora 
randomly generate 10 different integers from 1 to 20. The sable of random digits tp assign troatments, 
Studente with these labels receive the cola with caffeine, The muiko ure t0 account for the possibilty of 
remaining 10 students receive the cola without caffeine, _Mepeated numbers when descrivwng your method 

(c) Label each student with a different integer from 01 to 20. Go to alin of Table D and read two-digit groups 
moving from left to right. The first 10 different labels between 01 and 20 identify the 10 students who 
receive cola with caffeine. The remaining 10 students receive the caffeine-free cola. ignore groups of digits 
from 21 to00. 


08 PRACTICE. TRY EXERCISE 63} 


The blue page number icon next — 

5 63, Layofis and “survivor guilt” Workers who survive a 
to an exercise points you back layolf of other einphoyees at theis location ttuay sulfer 
to the page on which the model 2 froin “waneivor wuilt” A study of stteciwor allt and its 


cHeets uved as subjects 120 studetita whe were offered 
example appears. at Opportunity to carn estes coune ctedit by doing 
prooheading. Each sibject worked ix the sume cub 
cle as another student, whe wis an accomplice of the 
experimenters, Ata browk midway throvel: the work, 


EXERCISES: Practice makes perfect! 
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Start by reading the SECTION SUMMARY to be sure that 
you understand the big ideas and key concepts. 


Section 3.1 


Summary 


© Ascatterplot displays the relationship betwoen two quantitative variables 


measured on the same individuals. Mark vahiex of one variable on the hori- Practice! Work the EXERCISES 

zontal axis (x avis) ane values of the other variable on the vertical axis (y axis) 

Plot cach individual's data as 4 point on the graph assigned by your teacher. Compare 
©  Ifwe think thata variable x may help predict, explain, or even cause changes your answers to those in the 

in another vuriable y, we call x an explanatory variable und yu response vari- 

able, Always plot the explanatory variable on the x axis of a scatterplot. Plot Solutions appendix at the back of the 


the response variable on the axis book. Short solutions to the exercises 
© When describing a scutterplot, look for an overall pattern (direction, form, numbered in red are found in the 


strengths) ai i 
appendix. 


arewer itl ¢ 


*  Direet 


Section 3.1 | Exercises 


|. Coral reefs and cell phones Identify the explanatory Track and Field team."’ Deseribe the relationship 
98EB variable and the response variable for the following between height and weight for these athletes 


Most of the exercises are paired, meaning that 2 relationships, if possible. Explain your reasoning 
i 2 ‘ (a). ‘The weight gait of comals its aquariums where the 
odd- and even-numbered exercises test the et terperal ila como al ia dtSscent lec 
same skill or concept. If you answer an assigned (b). ‘The nirmbet of tear mesiages sent and the number af 
exercise incorrectly, try to figure out your mistake phone cally made in a sample of 100 students 
A f 


Then see if you can solve the paired exercise. - “Tootmyer und corm yield Identify the exphinatory 


variable and the respome variable tne the following 


nw 


tolationships, Hf posible. Explain your reasoning 


{a} ‘The height and acm span ofa semple of SO teenager 


(b). The yield of corm in bushels por acre and the amount Caen SOE NNN DE 

Look for ICONS that appear af min in the growing season Heigha (in) 
next to selected EXERCISES 3. Heavy backpacks Ninthagade students at the Webb 4. Starbucks The satterplat shows the relationship 

“ 999 Schools go on a hackpacking trp each tall. Studenits are between the amount of fat (in grane) and number 
They will guide you to e divided inte hiking groups of size 8 by — narnes of calories in products sold at Starlucks"! Deseribe 

from a hat. Before leaving, students and their backpacks the roluthonship between fat and calories for these 
¢ the Example that models fate weighed. The data here are from one hikiuig group. ui 
the exercise. Make a scatterplot by hand that slows how backpack 


weight relates to body weight 


: os that provide step-by- Body weight (Id) 120 187 100 108 131 185 158 116 no 
step instructions for solving Backpack weight (ij 20 MH) 26 24 29 35 HN OB Ae 


4. Putting success How well do profesional xolfers putt i 
from various distances to thre hole? The data show a 


Multiple Choice: Select the hext anewer for Exercises 71-28 
TL. Which of the fallowing is aot a characteristic of the 
Exercise: Heavy Backpacks leanhsquutes regresoiun bine? 

(a 


The slope of the lers-squures nearesdon Tinie te always 


Mahe scattered by Hea oat (Pata Dane See Dynes rghit te beinosen = and 


Sony met 


th) The least-squares regression line always goes throug! 
the point Ce, 9) 


(e} The leatequares regression fine minimizes the sum of 


yquared residual 


(d) The slope of the leavbsyutes regression line 
always have the same sign as the correlation 


Recycle and Review 
74, Fuel coonomy (2.2) In ity recent Fuel Eeunomy Gunde 
the Envi ptal Protection Ageney (PA) gives 
data on H1S2 vehichs There are a number of outliers, 
mainly vehicles with very poor gas milcasre or hybrids 
with wery good gus mileage. If we ignore the outliers, 
however, the combined city and highway gas tnibeage 
of the other 1120 orse vehicles is approximately Now- 


Various types of problems in the Section Exercises let you ral with mean 18.7 miles per gallon (mp) and stan 
% =. dard deviation +3 tpa 

practice solving Shs different types g eangisioni mening The Chevrolet Malibe with # foureylinder engine 

AP®-style multiple-choice and free-response. The Recycle has a combined gis mileage of 25 mpg, What percent 


and Review exercises refer back to concepts and skills learned warts! vehicles have wone yas mileage than the 
in an earlier section, noted in purple after the problem title. 


(eo) The leist-squiares regression Tine ts not resiituert 


REVIEW and PRACTICE for quizzes and 
tests and the AP® STATISTICS EXAM 
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Chapter 3 Wrap-Up at 


Chapter 3 Review 


Section 3.1: Scaftarplots and Corretation seulterplol ts (he appropriate type of graph to investigate rela 
In this secon, vou leamed how to explote the relationship — thoayhips between two quantitative variables. Tb desenbe 
between two quantitative variables, Ax with distributions of a soatterplot, be sure to discuss four characteristics: diree- 
a single vuriable, the first step in ulwayato make agaph A ten, form, strength, und uhusual featares. The direction of 


What Did You Learn? 


Learning Target 


Distioguish between explanatory and response variables for 

nti ct. 31 1t fa4 
Male & scattenplot to dieplay the rulationship between two 
quantitative varialves. 


Describe the ditection, form, and steng it of a relutionship dkaptiyadt 
in a Scatterplot and identity unusual fates, 


(ntaepret the Comotation. 


Understand Ife hasic properties of corredation, inclueling haw te 
correlation ia Inffuenced try outtiens. 


Distinguish cotretation from causation. 


Make predictions using regression lines keaping in mind the 
dangers of extrapolation 


Significance test tor p 


Name One-sarmple ztest tor (Qne-saenpie test for px 
reared) (1 Prone Teen 
Formula ~ i 
pe i: ee 
[nat 0 & 
‘i Va 
P-value toms standard Normal distripution Pome trom rdisrinuten wath df =a —1 


Chapter 3 Review Exercises 


the poputstion of Interest, the population of interest, 
10% When sampling without replacement, 10% When sampling without rephicernent, 
‘These evercises ure designed to help you seview the important —(e) Point Bist n<0.10N, 2<0.10N. 
ides and mothods of the chapter. point have d © Large Counts Gott ret ated oft — gh) areat last 1(). © Nonmal/Large Sample: Ihe cxgutation fas a Norra 
; That is, the eopected eumtier of successes and the istribution or the sample size is lange (9 = IO) W the 


R31 Born to be old? ts there a relationship between the 
eo gestational period (time from conception to bith) of 


expected number of thilures in the sample are both at populstion distribution has unknown supe and re < 30. 


a seatterplot of the gestational period and average life e 
span for 43 spectes of animals” meee 
For all bat the shallowest dives, there is an association 
between x= depth (in meters) and y = dive duration 


oA 8 (in minutes) that is different for cach penguin pig 
* study wives 4 scatterplot for one penguin titled 
Relation of Dive Duration (y) to Depth (x). The P P 
° terplot shows.an awociation that is positive, ti Review Exercise: Late bloomers? 
—m 6 ° and strong. = 
= 2° (a) Explain the meaning of the tenn positive associati ddr of a iocramchashnctaran oo 


Interpret the slope and y intercept of the line in this 


this context. 
1 “vVepr linear usnciatic setting. 


The correlation isr = —0.B5. = 
The equation of the LSRL is 9 = 33.12 — 4.69x, 

where 9 represents the predicted number of days and 
x represents the average Morch temperature. 
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Conditions © Random: Tly: dita come fom a random sample hom = Randome The cat Come from # randoen sample trom 


few 10. Use A grag ol the sample data to aesemy the Normality of 
an animal and its average life span? The figure shows ABZ Penguins the pupuiation. Do nut uae ¢ procedunes # tho graph shows 


stg sterner tis 


Chapter 3 AP® Statistics Practice Test 


Section |; Multiple Choice Select the best answer for each question, 


Ts. 


4 A school guidance counselor examines how many 
extracurricular activities students participate in and 
their grade point avenge. The guidance counselor 
ways, “The evidence indicates that the correlation 
between the number of extracurricular activities a 
student participates in und his or her grade point aver- 
age iy close to 0." Which of the following is the must 


Question TF.3-T3,5 refer to the following setting 
Scientisty esumined the activity level of 7 fish at different 
ternperatires. Fish activity wan tated ot a scale of 0 (no 
activity) to [00 (maxitnal activity), The temporature 

was measured in degrees Celsius. A computer 

regression pemtout and a cesidual plot are provided. 
Notice that the hertzontal wus on the residual plot is 


Each chapter concludes with an 
AP® STATISTICS PRACTICE TEST. 


This test includes about 10 AP®- 
style multiple-choice questions and 


3 free-response questions. 


appropriate conclusian? 


labeled “Fitted value,” which means the same thing as 


(a) Students inwolved in neimy exteicurricuhie activities “predicted value.” 
tend to be students with poor grades m 
(bh) Students with good grades tend to be students whe ate al 
pot involved in many extracurricular activities oa 
fe} Students involved in ouny estmourtioular activities are as 2) 
just a likely to wet good erides at bad grades | oot wiles 
(d) Studenty with good gnides tend to be students who are = . 


lavolved in many extmouricular activities, 


(ec) No conclusion sould be made based on the corelution 
without looking ata seatterplot of the dtu 


Cumulative AP® Practice Test 1 


Section |: Multiple Choice Choose the het answer for Questions API I-AP)14. 


APS.2 A child ts 4) inches tall, which places her at the Oth 
pereentile of all children of similar age. The heights 
tor children of ths age form an approximately Nor- 
tual distritration with a mean of 38 inches Based on 
this information, what is the standard deviation of 
the heiwhts of all childeen of this age? 

fa) O.20ineh 

th) O31 inch 


fc} 0.65 inch 


AP1.1 You look at real estate uds for houses in Sarasota, 
Florida. Many houses huwe prices from $200,000 to. 
$400,000. The few houses on the water, however, 
have prices up to $15 millian, Which of the fol- 
lowing statements best deseribes the distribution of 
homie prices in Sunisotat 

fa) “The distribution fy most likely skewed to the left, and 

the roean is greater than the redian 

(b) ‘The distribution is moyt likely skewed to the feft, and 

the mewn iy less than the median 

(oc) The distribution is roughly symmetric with » few high out 

liers, and the nncan is approximately equal to the median, 

(d) ‘The distribution is roost likely skewed to the right, and 

the meat) is preuter than the median. 

(e) “The distribution is mont likely skewed to the cght. and 

the mean is lose than the nvedian. 


Four CUMULATIVE AP® PRACTICE 
TESTS simulate the real exam. 
They are placed after Chapters 4, 
7, 10, and 12. The tests expand in 


(d) 1.21 inches 
(ec) 1.56 inches 


length and content coverage as you 
work through the book. The last test 
models a full AP® Statistics exam. 


FREE RESPONSE AP 


PR 


The following problem is modeled after actuat AP” Statistics 


stions. Your task i to ganerite a 00 


a) Briefly describe the association shown in the 
walterplot 


exam tree fr en goo 


plete, concise response in 15 minulas b) The equation of the least-squares regression line for 
these data is $= 180.8 + 15. Sy. Interpret the slape 
of the line in the context of the stusdly 

(ce) Caleulate and imterpret the cesidual for the flower 
that had 2 tablespoons of suger and looked fresh for 
DH hears 

Suppose that another group af students conducted 
a Situilar experiinent siting 12 fowen, but included 
different vaticties in addition to carmutions. Would 
you expect the valiie of r for the seodud group's data 


Directions: Show all your work. Indicate clearly the meth 
ods you use, because you will he sored on the vorrectnes 


of your methods as well to on the docuracy und complete 


new of your rweults and explanations 


Learn how to answer free 
response questions successfully 
by working the FRAPPY!—the 


TWo statistics students went to a flower shop and ean. «d) 
dotly selected 12 carnations When they gol home, the 
students prepared 12 identical caves with exactly the sime 
amount of water in each \ase. They put one tablespoon of 


sugar in 9 vases, two tablespoons of sugar in 3 vases, and to be greater than, lew than, or about the same as the 


value off for the first group's data? Explain. 


Free Response AP® Problem, 
Yay!—that begins the Chapter 
Wrap-Up in every chapter. 


three tablespoons af sugar in 3 vases In the cemaining F 
vases, they put no sugar After the vanes were prepared 
the students nindamly assigned | camubon to cach vase 
and observed how many hours each flower continued to 
look fresh. A seatterptot of the data bs shown below 


After you finish, you can view two example solutions o8 the 
book's wabsite (highschool biwpub com/tpsée) Ostermina 
whether you think gach solution is “complete. 
developing,” o¢ “minie 
imp wqaest lo the student who wrote it? 
. Finally, your 1 Tite you with 
your response and sote what, if anytieng. you would do diftereatly 
10 improve your Own score 


substantial 


t the solution is nat complete, what 


vermont 


ying rubric. Score 


Use TECHNOLOGY to discover and analyze 
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3. Technology Corner 


highschool, blwpub.com/tpsée. 


computer software. 

L One-variable statistics on the T1-83/54 
1. Enter the data in list L1. 
2. Find the summary statistics for the shoe data, 
© Press (STAT) (CALC); choose 1-VarStats. 


Arrow down to Calculate and press ENTER} 
Older OS: Press [2nd] [7] (L.1) and (ENTER) 


Descriptive Btatistics: Those 


Although the Technology 
Comers focus on the TI-83/84 
graphing calculator, output 
from multiple programs— 
including Minitab and JMP—is 
used in the book's Examples 
and Exercises to help you 
become familiar with reading 
and interpreting many 
different kinds of statistical 
summaries. 


© Your Pracice of Stataces, Gin Edition Course 
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COMPUTING NUMERICAL SUMMARIES 


Tl-Nspire and other technology instructions are on the book's website at 


Let’s find numerical summaries for the boys’ shoes data from the 
example on page 64. We'll start by showing you how to compute 
summary statistics on the 'T1-33/84 and then look at output from 


OS 2.55 or later: In the dialog box, press [2nd] [7] (Ll) and 
TER) to specify L} as the List Leave Freq! ist blank 


© Press [-¥] to see the rest of the one-variable statistics. 


Med=9 
IL. Output from statistical software We used Minitab statistical software to | Q211.5 
calenlate descriptive statistics for the boys’ shoes data. Minitab allowsyou | ™2*%=38 
to choose which mumerical summaries are included in the output. 


Variable N Mean StDev Minieum Q1 Median Q3 
Shoes 20 11.65 9.42 4.00 6.25 9.00 11.75 38.00 


Note: The T1384 gives the first and third quartiles of the boys’ shoes distribution as QO, = 6.5 and Q; = 11.5. 
Minitab reports that Q; = 6.25 and Q; = 11.75. What happened? Minitab and some other software use 
slightly different rules for locating quartiles. Results from the various rules are usually close to each other. 
ating quartiles as they may affect more than just the JOR. 


Maxioum 


3.2 Technology Comers 

TI-Nspire and other technology instructions are on the book's website at 
highschool. bfwpub.com/tpsse. 

9, Calculating least-squares regression lines Page 184 
10. Making residual plots Page 187 


The = 


Practice. 
Statistics 
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Use technology as a tool for discovery and analysis. 
TECHNOLOGY CORNERS give step-by-step 
instructions for using the Tl-83/84 calculator. 
Instructions for the Tl-Nspire and other calculators are 
on the book's Student Site (highschool.bfwpub.com/ 
tpsGe) and in the e-Book platform. 


Technology Corner videos are also available 
to walk you through the key strokes needed to 
perform each analysis. 


Sees orf 

= & « 
2:Plot2. off 
—<u be 
3:Plot3_off 
ee 


4:Plotaorf 
S:PlotsOn 


Find the Technology Comers easily by 
consulting the summary table at the 
end of each section or the complete 
table at the back of the book. 


Read, practice, access the 
resources, and do homework 
assignments online with the new 
Online Homework and e-Book 
Platform that may be purchased 


‘sx | to enhance your learning 


experience. 


Does listening to music while studying help or hinder learning? If an 
athlete fails a drug test, how sure can we be that she took a banned 
substance? Does having a pet help people live longer? How well do SAT 
scores predict college success? Do most people recycle? Which of two 
diets will help obese children lose more weight and keep it off? Can a new 
drug help people quit smoking? How strong is the evidence for global 
warming? 

These are just a few of the questions that statistics can help answer. 
But what is statistics? And why should you study it? 


istockphoto 


Data are usually numbers, but they are not “just numbers.” Data are 
numbers with a context. The number 10.5, for example, carries no 
information by itself. But if we hear that a family friend’s new baby 
weighed 10.5 pounds at birth, we congratulate her on the healthy size of 
the child. The context engages our knowledge about the world and allows 
us to make judgments. We know that a baby weighing 10.5 pounds is quite 
large, and that a human baby is unlikely to weigh 10.5 ounces or 10.5 
kilograms. The context makes the number meaningful. 

In your lifetime, you will be bombarded with data and statistical 
information. Poll results, television ratings, music sales, gas prices, 
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unemployment rates, medical study outcomes, and standardized test scores 
are discussed daily in the media. Using data effectively is a large and 
growing part of most professions. A solid understanding of statistics will 
enable you to make sound, data-based decisions in your career and 
everyday life. 


It is tempting to base conclusions on your own experiences or the 
experiences of those you know. But our experiences may not be typical. In 
fact, the incidents that stick in our memory are often the unusual ones. 


Do Cell Phones Cause Brain Cancer? 


Italian businessman Innocente Marcolini developed a brain tumor at age 
60. He also talked on a cellular phone up to 6 hours per day for 12 years as 
part of his job. Mr. Marcolini’s physician suggested that the brain tumor 
may have been caused by cell-phone use. So Mr. Marcolini decided to file 
suit in the Italian court system. A court ruled in his favor in October 2012. 


Bloomberg via Getty Images 


Several statistical studies have investigated the link between cell-phone 
use and brain cancer. One of the largest was conducted by the Danish 
Cancer Society. Over 350,000 residents of Denmark were included in the 
study. Researchers compared the brain-cancer rate for the cell-phone users 
with the rate in the general population. The result: no statistical difference 
in brain-cancer rates.+ In fact, most studies have produced similar 
conclusions. In spite of the evidence, many people (like Mr. Marcolini) are 
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still convinced that cell phones can cause brain cancer. 

In the public’s mind, the compelling story wins every time. A 
Statistically literate person knows better. Data are more reliable than 
personal experiences because they systematically describe an overall 
picture, rather than focus on a few incidents. 


Are You Kidding Me? 


The famous advice columnist Ann Landers once asked her readers, “If you 
had it to do over again, would you have children?” A few weeks later, her 
column was headlined “70% OF PARENTS SAY KIDS NOT WORTH 
IT.” Indeed, 70% of the nearly 10,000 parents who wrote in said they 
would not have children if they could make the choice again. Do you 
believe that 70% of all parents regret having children? 


istockphoto 


You shouldn’t. The people who took the trouble to write to Ann 
Landers are not representative of all parents. Their letters showed that 
many of them were angry with their children. All we know from these data 
is that there are some unhappy parents out there. A statistically designed 
poll, unlike Ann Landers’s appeal, targets specific people chosen in a way 
that gives all parents the same chance to be asked. Such a poll showed that 
91% of parents would have children again. 
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Where data come from matters a lot. If you are careless about how you 
get your data, you may announce 70% “No” when the truth is close to 90% 
“Yes.” 


Who Talks More—Women or Men? 


According to Louann Brizendine, author of The Female Brain, women say 
nearly 3 times as many words per day as men. Skeptical researchers 
devised a study to test this claim. They used electronic devices to record 
the talking patterns of 396 university students from Texas, Arizona, and 
Mexico. The device was programmed to record 30 seconds of sound every 
12.5 minutes without the carrier’s knowledge. What were the results? 

According to a published report of the study in Scientific American, 
“Men showed a slightly wider variability in words uttered. . . . But in the 
end, the sexes came out just about even in the daily averages: women at 
16,215 words and men at 15,669.”2 When asked where she got her figures, 
Brizendine admitted that she used unreliable sources.2 

The most important information about any statistical study is how the 
data were produced. Only carefully designed studies produce results that 
can be trusted. 


Yogi Berra, a famous New York Yankees baseball player known for his 
unusual quotes, had this to say: “You can observe a lot just by watching.” 
That’s a motto for learning from data. A carefully chosen graph is often 
more instructive than a bunch of numbers. 


Do People Live Longer in Wealthier Countries? 


The Gapminder website, www.gapminder.org, provides loads of data on 
the health and well-being of the world’s inhabitants. The graph below 
displays some data from Gapminder.4 The individual points represent all 
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the world’s nations for which data are available. Each point shows the 
income per person and life expectancy for one country, along with the 
region (color of point) and population (size of point). 


Hongqi Zhang/Alamy 


We expect people in richer countries to live longer. The overall pattern 
of the graph does show this, but the relationship has an interesting shape. 
Life expectancy rises very quickly as personal income increases and then 
levels off. People in very rich countries like the United States live no 
longer than people in poorer but not extremely poor nations. In some less 
wealthy countries, people live longer than in the United States. Several 
other nations stand out in the graph. What’s special about each of these 
countries? 


7 See 


(@ America 
© Europe & Russia 


Life expectancy (years) 


Asia & Pacific 
B Africa 


T T T T T T T 
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Income per person in 2015 ($) 
Graph of the life expectancy of people in many nations against each nation’s 
income per person in 2015. 


Individuals vary. Repeated measurements on the same individual vary. 
Chance outcomes—like spins of a roulette wheel or tosses of a coin—vary. 
Almost everything varies over time. Statistics provides tools for 
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understanding variation. 


Have Most Students Cheated on a Test? 


Commercial Eye/Getty Images 


Researchers from the Josephson Institute were determined to find out. So 
they surveyed about 23,000 students from 100 randomly selected schools 
(both public and private) nationwide. The question was: “How many times 
have you cheated during a test at school in the past year?” Fifty-one 
percent said they had cheated at least once.2 

If the researchers had asked the same question of all high school 
students, would exactly 51% have answered “Yes”? Probably not. If the 
Josephson Institute had selected a different sample of about 23,000 
students to respond to the survey, they would probably have gotten a 
different estimate. Variation is everywhere! 

Fortunately, statistics provides a description of how the sample results 
will vary in relation to the actual population percent. Based on the 
sampling method that this study used, we can say that the estimate of 51% 
is very likely to be within 1% of the true population value. That is, we can 
be quite confident that between 50% and 52% of all high school students 
would say that they have cheated on a test. 

Because variation is everywhere, conclusions are uncertain. 
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Statistics gives us a language for talking about uncertainty that is 
understood by statistically literate people everywhere. 
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| Data Analysis 


AfriPics.com/Alamy Stock Photo 
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Section 1.1 Analyzing Categorical Data 


Section 1.2 Displaying Quantitative Data with Graphs 
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INTRODUCTION Statistics: The Science and Art of 
Data 


LEARNING TARGETS By the end of the section, you should be able to: 


e Identify the individuals and variables in a set of data. 


e Classify variables as categorical or quantitative. 


We live in a world of data. Every day, the media report poll results, 
outcomes of medical studies, and analyses of data on everything from 
stock prices to standardized test scores to global warming. The data are 
trying to tell us a story. To understand what the data are saying, you need 
to learn more about statistics. 


DEFINITION Statistics 


Statistics is the science and art of collecting, analyzing, and drawing 
conclusions from data. 


A solid understanding of statistics will help you make good decisions 
based on data in your daily life. 


Organizing Data 


Every year, the U.S. Census Bureau collects data from over 3 million 
households as part of the American Community Survey (ACS). The table 
displays some data from the ACS in a recent year. 
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Household 

425 
936459 
50055 
592934 
545854 
809928 
110157 
999347 


Region 
Midwest 
West 
Northeast 
West 
South 
South 
Midwest 
South 


Number of 
people 


fs) 


PrN O BRN BS 


Rudy Sulgan/Corbis 
Documentary/Getty Images 


Time in 
dwelling 
(years) 

2-4 
2-4 
10-19 
2-4 
2-4 
30+ 
5-9 
<1 


mode income 
Internet 52,000 
Mail 40,500 
Internet 481,000 
Phone 230,800 
Phone 33,800 
Internet 59,500 
Internet 80,000 
Mail 8,400 


Response Household Internet 


access? 


Most data tables follow this format—each row describes an individual 


and each column holds the values of a variable. 


DEFINITION Individual, Variable 


An individual is an object described in a set of data. Individuals can 


be people, animals, or things. 


A variable is an attribute that can take different values for different 
individuals. 


Sometimes the individuals in a data set are called cases or observational units. 


For the American Community Survey data set, the individuals are 
households. The variables recorded for each household are region, number 
of people, time in current dwelling, survey response mode, household 
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income, and whether the dwelling has Internet access. Region, time in 
dwelling, response mode, and Internet access status are categorical 
variables. Number of people and household income are quantitative 
variables. 


Note that household is not a variable. The numbers in the household 
column of the data table are just labels for the individuals in this data set. 
Be sure to look for a column of labels—names, numbers, or other 
identifiers—in any data table you encounter. 


DEFINITION Categorical variable, Quantitative variable 


A categorical variable assigns labels that place each individual into a 
particular group, called a category. 


A quantitative variable takes number values that are quantities— 
counts or measurements. 


o@ Not every variable that takes number values is quantitative. 


Zip code is one example. Although zip codes are numbers, they are neither 
counts of anything, nor measurements of anything. They are simply labels 
for a regional location, making zip code a categorical variable. Some 
variables—such as gender, race, and occupation—are categorical by 
nature. Time in dwelling from the ACS data set is also a categorical 
variable because the values are recorded as intervals of time, such as 2—4 
years. If time in dwelling had been recorded to the nearest year for each 
household, this variable would be quantitative. 


To make life simpler, we sometimes refer to categorical data or 
quantitative data instead of identifying the variable as categorical or 
quantitative. 


EXAMPLE | Census At School (>) 


Individuals and Variables 


De 


Garry Black/Alamy 


PROBLEM: Census At School is an international project that collects 
data about primary and secondary school students using surveys. 
Hundreds of thousands of students from Australia, Canada, Ireland, 
Japan, New Zealand, South Africa, South Korea, the United Kingdom, 
and the United States have taken part in the project. Data from the 
surveys are available online. We used the site’s “Random Data 
Selector” to choose 10 Canadian students who completed the survey in a 
recent year. The table displays the data. 


Number 
of Wrist 

languages Height circumference _Preferrec 
Province Gender spoken Handedness (cm) (mm) communicat 
Saskatchewan Male 1 Right 175.0 180 In person 
Ontario Female 1 Right 162.5 160 In person 
Alberta Male 1 Right 178.0 174 Facebook 
Ontario Male 2 Right 169.0 160 Cell phone 
Ontario Female 2 Right 166.0 65 In person 
Nunavut Male 1 Right 168.5 160 Text messag) 
Ontario Female 1 Right 166.0 165 Cell phone 
Ontario Male 4 Left 157.5 147 Text messag) 
Ontario Female 2 Right 150.5 187 Text messag) 
Ontario Female 1 Right 171.0 180 Text messag) 


a. Identify the individuals in this data set. 


b. What are the variables? Classify each as categorical or quantitative. 


SOLUTION: 


a. 10 randomly selected Canadian students who participated in the 


Census At School survey. 
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We’ll see in Chapter 4 why choosing at random, as we did in this 


example, is a good idea. 


b. Categorical: Province, gender, handedness, preferred 


communication method 


Quantitative: Number of languages spoken, height (cm), wrist 


circumference (mm) 


There is at least one suspicious value in the data table. We doubt that 
the girl who is 166 cm tall really has a wrist circumference of 65 mm 


(about 2.6 inches). Always look to be sure the values make sense! 


FOR PRACTICE, TRY EXERCISE 1 


The proper method of data analysis depends on whether a variable is 
categorical or quantitative. For that reason, it is important to distinguish 
these two types of variables. The type of data determines what kinds of 
graphs and which numerical summaries are appropriate. 


AP® EXAM TIP 


If you learn to distinguish categorical from quantitative variables now, 
it will pay big rewards later. You will be expected to analyze 


categorical and quantitative variables correctly on the AP® exam. 


ANALYZING DATA A variable generally takes values that vary (hence 
the name variable!). Categorical variables sometimes have similar counts 
in each category and sometimes don’t. For instance, we might have 
expected similar numbers of males and females in the Census At School 
data set. But we aren’t surprised to see that most students are right-handed. 
Quantitative variables may take values that are very close together or 
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values that are quite spread out. We call the pattern of variation of a 
variable its distribution. 


DEFINITION Distribution 


The distribution of a variable tells us what values the variable takes 
and how often it takes those values. 


Let’s return to the data for the sample of 10 Canadian students from 
the preceding example. Figure 1.1(a) shows the distribution of preferred 
communication method for these students in a bar graph. We can see how 
many students chose each method from the heights of the bars: cell phone 
(2), Facebook (1), in person (3), text messaging (4). Figure 1.1(b) shows 
the distribution of number of languages spoken in a dotplot. We can see 
that 6 students speak one language, 3 students speak two languages, and 1 
student speaks four languages. 
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messaging 
— ae a Number of languages spoken 
(a) Preferred communication method (b) 


FIGURE 1.1 (a) Bar graph showing the distribution of preferred communication 
method for the sample of 10 Canadian students. (b) Dotplot showing the 
distribution of number of languages spoken by these students. 


Section 1.1 begins by looking at how to describe the distribution of a 
single categorical variable and then examines relationships between 
categorical variables. Sections 1.2 and 1.3 and all of Chapter 2 focus on 
describing the distribution of a quantitative variable. Chapter 3 investigates 
relationships between two quantitative variables. In each case, we begin 
with graphical displays, then add numerical summaries for a more 
complete description. 
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HOW TO ANALYZE DATA 


e Begin by examining each variable by itself. Then move on to study 


relationships among the variables. 


e Start with a graph or graphs. Then add numerical summaries. 


CHECK YOUR UNDERSTANDING 


( 


y Jake is a car buff who wants to find out more about the vehicles that 
his classmates drive. He gets permission to go to the student parking lot 
and record some data. Later, he does some Internet research on each 
model of car he found. Finally, Jake makes a spreadsheet that includes 
each car’s license plate, model, year, color, highway gas mileage, 
weight, and whether it has a navigation system. 


1. Identify the individuals in Jake’s study. 


2. What are the variables? Classify each as categorical or quantitative. 


From Data Analysis to Inference 


Sometimes we’re interested in drawing conclusions that go beyond the 
data at hand. That’s the idea of inference. In the “Census At School” 
example, 9 of the 10 randomly selected Canadian students are right- 
handed. That’s 90% of the sample. Can we conclude that exactly 90% of 
the population of Canadian students who participated in Census At School 
are right-handed? No. 

If another random sample of 10 students were selected, the percent 
who are right-handed might not be exactly 90%. Can we at least say that 
the actual population value is “close” to 90%? That depends on what we 
mean by “close.” The following activity gives you an idea of how 
statistical inference works. 
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ACTIVITY Hiring discrimination—it just won’t fly! 


An airline has just finished training 25 pilots—15 male and 10 female— 
to become captains. Unfortunately, only eight captain positions are 
available right now. Airline managers announce that they will use a 
lottery to determine which pilots will fill the available positions. The 
names of all 25 pilots will be written on identical slips of paper. The 
slips will be placed in a hat, mixed thoroughly, and drawn out one at a 
time until all eight captains have been identified. 


Choja/Getty Images 


A day later, managers announce the results of the lottery. Of the 8 
captains chosen, 5 are female and 3 are male. Some of the male pilots 
who weren’t selected suspect that the lottery was not carried out fairly. 
One of these pilots asks your statistics class for advice about whether to 
file a grievance with the pilots’ union. 

The key question in this possible discrimination case seems to be: Is 
it plausible (believable) that these results happened just by chance? To 
find out, you and your classmates will simulate the lottery process that 
airline managers said they used. 


1. Your teacher will give you a bag with 25 beads (15 of one color and 
10 of another) or 25 slips of paper (15 labeled “M” and 10 labeled 
“F”) to represent the 25 pilots. Mix the beads/slips thoroughly. 
Without looking, remove 8 beads/slips from the bag. Count the 
number of female pilots selected. Then return the beads/slips to the 
bag. 

2. Your teacher will draw and label a number line for a class dotplot. On 
the graph, plot the number of females you got in Step 1. 


3. Repeat Steps 1 and 2 if needed to get a total of at least 40 simulated 
lottery results for your class. 
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4. Discuss the results with your classmates. Does it seem plausible that 
airline managers conducted a fair lottery? What advice would you 
give the male pilot who contacted you? 


Our ability to do inference is determined by how the data are produced. 
Chapter 4 discusses the two main methods of data production—sampling 
and experiments—and the types of conclusions that can be drawn from 
each. As the activity illustrates, the logic of inference rests on asking, 
“What are the chances?” Probability, the study of chance behavior, is the 
topic of Chapters 5—7. We’|] introduce the most common inference 
techniques in Chapters 8-12. 


Introduction Summary 


e Statistics is the science and art of collecting, analyzing, and drawing 
conclusions from data. 


e A data set contains information about a number of individuals. 
Individuals may be people, animals, or things. For each individual, the 
data give values for one or more variables. A variable describes some 
characteristic of an individual, such as a person’s height, gender, or 
salary. 


e A categorical variable assigns a label that places each individual in one 
of several groups, such as male or female. A quantitative variable has 
numerical values that count or measure some characteristic of each 
individual, such as number of siblings or height in meters. 


e The distribution of a variable describes what values the variable takes 
and how often it takes them. 


Introduction Exercises 


The solutions to all exercises numbered in red may be found in the 
Solutions Appendix. 
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1. pg 3 |p) Aclass survey Here is a small part of the data set that 


describes the students in an AP® Statistics class. The data come from 
anonymous responses to a questionnaire filled out on the first day of 
class. 


Grade Children in Homework last night Android or 

Gender level GPA family (min) iPhone? 
F a 23 3 0-14 iPhone 

11 3.8 6 15-29 Android 

M 10 3.1 2 15-29 Android 
F 10 4.0 1 45-59 iPhone 
F 10 3.4 4 0-14 iPhone 

F 10 3.0 3 30-44 Android 
M a oh, 2 15-29 iPhone 
M 12 a. 2 0-14 iPhone 


a. Identify the individuals in this data set. 


b. What are the variables? Classify each as categorical or quantitative. 


2. Coaster craze Many people like to ride roller coasters. Amusement 


3. 


parks try to increase attendance by building exciting new coasters. The 
following table displays data on several roller coasters that were 


opened in a recent year.4 


Roller coaster Type Height (ft) Design Speed(mph) Duration (sec) 


Wildfire Wood 187.0 Sit down 70.2 120 
Skyline Steel 131.3 Inverted 50.0 90 
Goliath Wood 165.0 Sit down 72.0 105 
Helix Steel 134.5 Sit down 62.1 130 
Banshee Steel 167.0 Inverted 68.0 160 
Black Hole Steel 2227 Sit down 25.5 75 


a. Identify the individuals in this data set. 


b. What are the variables? Classify each as categorical or quantitative. 
Hit movies According to the Internet Movie Database, Avatar is tops 
based on box-office receipts worldwide as of January 2017. The 
following table displays data on several popular movies. Identify the 
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4. 


individuals and variables in this data set. Classify each variable as 
categorical or quantitative. 


Movie 


Avatar 


Titanic 


Star Wars: The Force 


Awakens 


Jurassic World 


Marvel’s The Avengers 


Furious 7 


The Avengers: Age of Ultron 


Harry Potter and the Deathly 


Hallows: Part 2 
Frozen 


Iron Man 3 


Year Rating 


2009 


1997 


2015 


2015 


2012 


2015 


2015 


2011 


2013 
2013 


Time 


(min) Genre 


Box office ($) 
2,783,918,982 


2,207,615,668 
2,040,375,795 
1,669, 164,161 
1,519,479,547 
1,516,246,709 
1,404,705,868 


1,328,111,219 


Animation 1,254,512,386 


PG- 162 Action 
13 

PG- 194 Drama 
13 

PG- 136 Adventure 
13 

PG- 124 Action 
13 

PG- 142 Action 
13 

PG- 137 Action 
13 

PG- 141 Action 
13 

PG- 130 Fantasy 
13 

PG 108 

PG- 129 Action 
13 


1,172,805,920 


Skyscrapers Here is some information about the tallest buildings in 


the world as of February 2017. Identify the individuals and variables in 
this data set. Classify each variable as categorical or quantitative. 


Building 
Burj Khalifa 


Shanghai Tower 
Makkah Royal Clock 


Tower Hotel 


Ping An Finance Center 
Lotte World Tower 


Country 


United Arab 
Emirates 


China 
Saudi Arabia 


China 
South Korea 


One World Trade Center United States 


Taipei 101 


Taiwan 
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Height 
(m) 
828.0 


632.0 
601.0 


599.0 
904.5 
941.0 
509.0 


Year 
Floors Use completed 

163 Mixed 2010 
121 Mixed 2014 
120 Hotel 2012 
115 Mixed 2016 
123 Mixed 2016 
104 Office 2013 
101 Office 2004 


5. 


6. 


Ze 


8. 


Shanghai World China 492.0 101 Mixed 2008 
Financial Center 


International Commerce China 484.0 118 Mixed 2010 
Center 
Petronas Tower 1 Malaysia 452.0 88 Office 1998 


Protecting wood What measures can be taken, especially when 
restoring historic wooden buildings, to help wood surfaces resist 
weathering? In a study of this question, researchers prepared wooden 
panels and then exposed them to the weather. Some of the variables 
recorded were type of wood (yellow poplar, pine, cedar); type of water 
repellent (solvent-based, water-based); paint thickness (millimeters); 
paint color (white, gray, light blue); weathering time (months). 
Classify each variable as categorical or quantitative. 

Medical study variables Data from a medical study contain values of 
many variables for each subject in the study. Some of the variables 
recorded were gender (female or male); age (years); race (Asian, 
Black, White, or other); smoker (yes or no); systolic blood pressure 
(millimeters of mercury); level of calcium in the blood (micrograms 
per milliliter). Classify each variable as categorical or quantitative. 
Ranking colleges Popular magazines rank colleges and universities on 
their “academic quality” in serving undergraduate students. Describe 
two categorical variables and two quantitative variables that you might 
record for each institution. 

Social media You are preparing to study the social media habits of 
high school students. Describe two categorical variables and two 
quantitative variables that you might record for each student. 


Multiple Choice Select the best answer. 


Exercises 9 and 10 refer to the following setting. At the Census Bureau 
website www.census.gov, you Can view detailed data collected by the 
American Community Survey. The following table includes data for 10 
people chosen at random from the more than 1 million people in 
households contacted by the survey. “School” gives the highest level of 
education completed. 


Weight Age Travel to work Income last year 
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(Ib) (years) (min) School Gender 
187 66 0 Ninth grade 1 
158 66 n/a High school 2 
grad 
176 54 10 Assoc. degree 2 
339 ay 10 Assoc. degree 1 
91 27 10 Some college 2 
155 18 n/a High school 2 
grad 
213 38 15 Master’s 2 
degree 
194 40 0 High school 1 
grad 
221 18 20 High school 1 
grad 
193 11 n/a Fifth grade 1 
9. The individuals in this data set are 
a. households. 
b. people. 
c. adults. 
d. 120 variables. 
e. columns. 
10. This data set contains 


a. 7 variables, 2 of which are categorical. 
b. 7 variables, 1 of which is categorical. 
c. 6 variables, 2 of which are categorical. 
d. 6 variables, 1 of which is categorical. 


e. None of these. 
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(3) 
24,000 
0 


11,900 
6000 
30,000 
0 


125,000 


800 


SECTION 1.1 Analyzing Categorical Data 


LEARNING TARGETS By the end of the section, you should be able to: 


Make and interpret bar graphs for categorical data. 


Identify what makes some graphs of categorical data misleading. 


Calculate marginal and joint relative frequencies from a two-way 
table. 


Calculate conditional relative frequencies from a two-way table. 
Use bar graphs to compare distributions of categorical data. 


Describe the nature of the association between two categorical 
variables. 


Here are the data on preferred communication method for the 10 
randomly selected Canadian students from the example on page 3: 


In person In person Facebook Cell phone In person 
Text Cell Text Text Text 
messaging phone messaging messaging messaging 


We can summarize the distribution of this categorical variable with a 
frequency table or a relative frequency table. 


DEFINITION Frequency table, Relative frequency table 


A frequency table shows the number of individuals having each 
value. 


A relative frequency table shows the proportion or percent of 
individuals having each value. 


Some people use the terms frequency distribution and relative frequency distribution 


instead. 
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To make either kind of table, start by tallying the number of times that the 
variable takes each value. rl Note that the frequencies and relative 
frequencies listed in these tables are not data. The tables summarize the 
data by telling us how many (or what proportion or percent of) students in 
the sample said “Cell phone,” “Facebook,” “In person,” and “Text 


messaging.” 
Preferred Tally Preferred Frequency Preferred Relative 
method method method frequency 
Cell phone || Cell phone 2 Cell phone 2/10 = 0.20 
or 20% 
Facebook | Facebook 1 Facebook 1/10 = 0.10 
or 10% 
In person ||| In person 3 In person 3/10 = 0.30 
or 30% 
Text \||| Text 4 Text 4/10 = 0.40 
messaging messaging messaging or 40% 


The same process can be used to summarize the distribution of a 
quantitative variable. Of course, it would be hard to make a frequency 
table or a relative frequency table for quantitative data that take many 
different values, like the ages of people attending a Major League Baseball 
game. We’|l look at a better option for quantitative variables with many 
possible values in Section 1.2. 


Displaying Categorical Data: Bar Graphs and Pie Charts 


A frequency table or relative frequency table summarizes a variable’s 
distribution with numbers. To display the distribution more clearly, use a 
graph. You can make a bar graph or a pie chart for categorical data. 


DEFINITION Bar graph, Pie chart 


A bar graph shows each category as a bar. The heights of the bars 
show the category frequencies or relative frequencies. 
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A pie chart shows each category as a slice of the “pie.” The areas of 


the slices are proportional to the category frequencies or relative 
frequencies. 


Bar graphs are sometimes called bar charts. Pie charts are sometimes called circle 
graphs. 


Figure 1.2 shows a bar graph and a pie chart of the data on preferred 
communication method for the random sample of Canadian students. Note 
that the percents for each category come from the relative frequency table. 


40 


30) 


Text 
5 messaging 
E 20 40% 
-¥ 

10 
Cell phone Facebook _ In person Text 
messaging 

(a) Preferred communication method (b) Preferred communication method 


FIGURE 1.2 (a) Bar graph and (b) pie chart of the distribution of preferred 
communication method for a random sample of 10 Canadian students. 


Preferred Relative 
method frequency 
Cell phone 2/10 = 0.20 or 

20% 
Facebook 1/10 = 0.10 or 

10% 
In person 3/10 = 0.30 or 

30% 
Text 4/10 = 0.40 or 
messaging 40% 


65 


It is fairly easy to make a bar graph by hand. Here’s how you do it. 


HOW TO MAKE A BAR GRAPH 


Draw and label the axes. Put the name of the categorical variable 
under the horizontal axis. To the left of the vertical axis, indicate 
whether the graph shows the frequency (count) or relative frequency 
(percent or proportion) of individuals in each category. 


“Scale” the axes. Write the names of the categories at equally 


spaced intervals under the horizontal axis. On the vertical axis, start 
at 0 and place tick marks at equal intervals until you exceed the 
largest frequency or relative frequency in any category. 


Draw bars above the category names. Make the bars equal in width 
and leave gaps between them. Be sure that the height of each bar 
corresponds to the frequency or relative frequency of individuals in 
that category. 


Making a graph is not an end in itself. The purpose of a graph is to 
help us understand the data. When looking at a graph, always ask, “What 
do I see?” We can see from both graphs in Figure 1.2 that the most 
preferred communication method for these students is text messaging. 


EXAMPLE |What’s on the radio? (>) 
Making and interpreting bar graphs 


PROBLEM: Arbitron, the rating service for radio audiences, 
categorizes U.S. radio stations in terms of the kinds of programs they 
broadcast. The frequency table summarizes the distribution of station 


formats in a recent year.2 

Format Number of stations 
Adult contemporary 2536 

All sports 1274 
Contemporary hits 1012 
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Country 2893 


News/talk/information 4077 
Oldies 831 
Religious 3884 
Rock 1636 
Spanish language 878 
Variety 1579 
Other formats 4852 
Total 25,452 


a. Identify the individuals in this data set. 
b. Make a frequency bar graph of the data. Describe what you see. 


SOLUTION: 


a. U.S. radio stations 


b 5000 


4500 
4000 
3500 
3000 
2500 
42000 
1500 
1000 

500 


Number of stations 


0 8 Ww oy. 
oe wt oe A wi a “ te aye 
w ® 
.y i eo ol 
Radio station format 


To make the bar graph: 


e Draw and label the axes. 


e “Scale” the axes. The largest frequency is 4852. So we choose a 


vertical scale from 0 to 5000, with tick marks 500 units apart. 


e Draw bars above the category names. 
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On U.S. radio stations, the most frequent formats are Other (4852), 
News/talk/information (4077), and Religious (3884), while the least 
frequent are Oldies (831), Spanish language (878), and 
Contemporary hits (1012). 


FOR PRACTICE, TRY EXERCISE 11 


Radio station 
format 


Spanish lang. 
3.4% 


Oldies 
3.3% 


Here is a pie chart of the radio station format data from the preceding 
example. You can use a pie chart when you want to emphasize each 
category’s relation to the whole. Pie charts are challenging to make by 
hand, but technology will do the job for you. Note that @ a pie chart 


must include all categories that make up a whole, which might mean 
adding an “other” category, as in the radio station example. 


IT DIDN'T HAVE 
ANYTHING USEFUL 
TO SAY SO T MADE 

THIS PIE CHART. 


T PLEDGE 
MY LIFE 
AND MY 
FORTUNE 
TO THE PIE! 


IT MUST 
BE TRUE 
BECAUSE 
TT’S PTE. 


; 
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3-709 ©2009Scott Adams, Inc./Dist. by UFS, Inc. 


DILBERT © Scott Adams. Used by permission of UNIVERSAL UCLICK. All rights reserved. 


CHECK YOUR UNDERSTANDING 
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v The American Statistical Association sponsors a web-based project 
that collects data about primary and secondary school students using 
surveys. We used the site’s “Random Sampler” to choose 40 U.S. high 
school students who completed the survey in a recent year.2 One of the 
questions asked: 


Which would you prefer to be? Select one. 


Rich Happy Famous 
Healthy 


Here are the responses from the 40 randomly selected students: 


Famous Healthy Healthy Famous Happy Famous Happy Happy Famous 
Rich Happy Happy _ Rich Happy Happy Happy Rich Happy 
Famous Healthy Rich Happy Happy Rich Happy Happy Rich 
Healthy Happy Happy _ Rich Happy Happy’ Rich Happy Famous 
Famous Happy Happy Happy 


Make a relative frequency bar graph of the data. Describe what you see. 


Graphs: Good and Bad 


Bar graphs are a bit dull to look at. It is tempting to replace the bars with 
pictures or to use special 3-D effects to make the graphs seem more 
interesting. Don’t do it! Our eyes react to the area of the bars as well as to 
their height. When all bars have the same width, the area (width x height) 
varies in proportion to the height, and our eyes receive the right impression 
about the quantities being compared. 


EXAMPLE | Who buys iMacs? (>) 
Beware the pictograph! 
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Justin Sullivan/Getty Images 


PROBLEM: When Apple, Inc., introduced the iMac, the company 
wanted to know whether this new computer was expanding Apple’s 
market share. Was the iMac mainly being bought by previous 
Macintosh owners, or was it being purchased by first-time computer 
buyers and by previous PC users who were switching over? To find out, 
Apple hired a firm to conduct a survey of 500 iMac customers. Each 
customer was categorized as a new computer purchaser, a previous PC 
Owner, or a previous Macintosh owner. The table summarizes the survey 
results.4 


Previous ownership Count Percent (%) 
None 85 17.0 
PC 60 12.0 
Macintosh 355 71.0 
Total 500 100.0 


a. Below is a clever graph of the data that uses pictures instead of the 
more traditional bars. How is this pictograph misleading? 
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400 4 


300 5 


200 5 


Number of buyers 


Pp Se 


None Windows Macintosh 


Previous computer 


b. Two possible bar graphs of the data are shown below. Which one 
could be considered deceptive? Why? 


= 


Percent 
w aN 
| 

Percent 


eer 
None Macintosh None PC Macintosh 


Previous computer Previous computer 


SOLUTION: 


a. The pictograph makes it look like the percentage of iMac buyers 
who are former Mac owners is at least 10 times larger than either of 


the other two categories, which isn’t true. 


In part (a), the heights of the images are correct. But the areas of the 


images are misleading. The Macintosh image is about 6 times as tall 


as the PC image, but its area is about 36 times as large! 


b. The bar graph on the right is misleading. By starting the vertical 
scale at 10 instead of 0, it looks like the percentage of iMac buyers 
who previously owned a PC is less than half the percentage who are 


71 


first-time computer buyers, which isn’t true. 


FOR PRACTICE, TRY EXERCISE 19 


There are two important lessons to be learned from this example: @ 
(1) beware the pictograph, and (2) watch those scales. 


Analyzing Data on Two Categorical Variables 


You have learned some techniques for analyzing the distribution of a 
single categorical variable. What should you do when a data set involves 
two categorical variables? For example, Yellowstone National Park staff 
surveyed a random sample of 1526 winter visitors to the park. They asked 
each person whether he or she belonged to an environmental club (like the 
Sierra Club). Respondents were also asked whether they owned, rented, or 
had never used a snowmobile. The data set looks something like the 
following: 


Respondent Environmental club? Snowmobile use 
1 No Own 
2 No Rent 
a Yes Never 
4 Yes Rent 
5 


No Never 


franz12/Shutterstock 


The two-way table summarizes the survey responses. 
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Environmental club member? 


Yes 

Never 212 

Snowmobile use Rent 77 
Own 279 16 


DEFINITION Two-way table 


A two-way table is a table of counts that summarizes data on the 
relationship between two categorical variables for some group of 
individuals. 


A two-way table is sometimes called a contingency table. 


It’s easier to grasp the information in a two-way table if row and 
column totals are included, like the one shown here. 


Environmental club 


Never used 
Snowmobile use Snowmobile renter 


Snowmobile owner 


Total 


Now we can quickly answer questions like: 


e What percent of people in the sample are environmental club members? 


, 305 
305, ne ~ 90.0% 
3051526=0.200=20,0%i526 7 9200 = 20.0% 759g ~ 0-200 = 20.07% 


e What proportion of people in the sample never used a snowmobile? 


. 
Le 


657 


6571526=0.431i526 94?" 1526 


These percents or proportions are known as marginal relative 
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frequencies because they are calculated using values in the margins of the 
two-way table. 


DEFINITION Marginal relative frequency 


A marginal relative frequency gives the percent or proportion of 
individuals that have a specific value for one categorical variable. 


We could call this distribution the marginal distribution of environmental club 


membership. 


We can compute marginal relative frequencies for the column totals to 
give the distribution of environmental club membership in the entire 
sample of 1526 park visitors: 


No: 12211526=0.800 or 80.0% Yes: 3051526=0.200 or 20.0% 


1221 305 7 
No: 55 = 0.8000r80.0% Yes: or = 0.200 or 20.0% 


1221 | . _ 305 0; 
1596 ~ 0:800 or 80.0% Yes : 1596 > 0:200 oF 20.0% 


We can compute marginal relative frequencies for the row totals to 


give the distribution of snowmobile use for all the individuals in the 
sample: 


Never: 6571526=0.431 or 43.1%Rent: 5741526=0.376 or 37.6%Own: 2951 


No 


657 
Never: ——- = 0.431 or 43.1% 
657 1526 
Never : 1636 0.431 or 43.1% 574 
574 Rent : —— = 0.376 or 37.6% 
Rent: —— = 0.376 or 37.6% 1526 
1526 995 
Own : — = 0.193 or 19.3% Own: 1526 = 0.193 or 19.3% 
1526 v0 


We could call this distribution the marginal distribution of snowmobile use. 


Note that we could use a bar graph or a pie chart to display either of these 
distributions. 


A marginal relative frequency tells you about only one of the variables 
in a two-way table. It won’t help you answer questions like these, which 


74 


involve values of both variables: 
e What percent of people in the sample are environmental club members 


and own snowmobiles? 
16 


16 «<——= 0.010 = 1.0% 
161526=0.010=1.0%1526 9 = 1-9” 1526 = 


e What proportion of people in the sample are not environmental club 


members and never use snowmobiles? 


44 
445 > _ 0.292 


4451526=0.2921526 °*"? 1526 
These percents or proportions are known as joint relative frequencies. 


DEFINITION Joint relative frequency 


A joint relative frequency gives the percent or proportion of 
individuals that have a specific value for one categorical variable and 
a specific value for another categorical variable. 


EXAMPLE |A Titanic disaster (») 


Calculating marginal and joint relative frequencies 


“an! Ohe New Bork Times. === 


TITANIC SINKS FOUR HOURS AFTER HITTING ICEBERG; 
866 RESCUED BY CARPATHIA, PROBABLY 1250 PERISI 
ISMA Y SAFE, MRS. ASTOR MAYBE, NOTED NAMES MISSI 


& 


Blank Archives/Getty Images 


PROBLEM: In 1912 the luxury liner Titanic, on its first voyage across 
the Atlantic, struck an iceberg and sank. Some passengers got off the 
ship in lifeboats, but many died. The two-way table gives information 
about adult passengers who survived and who died, by class of travel. 
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Class of travel 


Survival status Survived 


Died 


a. What proportion of adult passengers on the Titanic survived? 


b. Find the distribution of class of travel for adult passengers on the 
Titanic using relative frequencies. 


c. What percent of adult Titanic passengers traveled in third class and 
survived? 


SOLUTION: 


Start by finding the marginal totals. 


Class of travel 


Second 
Survival status Survived 


4421207=0.366-44 _ 9 365 442 
a. nk 1207 


= 0.366 


Remember that a distribution lists the possible values of a variable 


and how often those values occur. 
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First: 3191207=0.264=26.4%Second: 2611207=0.216=21.6%Third: 6271207=0. 


319 319 
= 0.264 = 26.4% First : ———-= 0.264 = 26.4% 
ms" * 1907 : 


First : 


1207 
261 
Second : —= 0.216 = 21.6% 261 
1207 Second : ——= 0.216 = 21.6% 
Third : ©?” — 0.519 = 51.9% 1207 
: Third wal 0.519 = 51.9% 
urd: -——=| Uv. = . 
1207 ; 


Note that the three percentages for class of travel in part (b) do not add to 


exactly 100% due to roundoff error. 


151 _ 151 
0.125 OA 


C. 1511207=0.125=12.5%1207 _ 


= 0.125 = 12.5% 


FOR PRACTICE, TRY EXERCISE 23 


CHECK YOUR UNDERSTANDING 


A 


¥ 


y An article in the Journal of the American Medical Association 
reports the results of a study designed to see if the herb St. John’s wort 
is effective in treating moderately severe cases of depression. The study 
involved 338 patients who were being treated for major depression. The 
subjects were randomly assigned to receive one of three treatments: St. 
John’s wort, Zoloft (a prescription drug), or placebo (an inactive 
treatment) for an 8-week period. The two-way table summarizes the 
data from the experiment.2 


Treatment 


Full 
response 


Change in Partial 16 26 13 
depression response 


1. What proportion of subjects in the study were randomly assigned to 
take St. John’s wort? Explain why this value makes sense. 


2. Find the distribution of change in depression for the subjects in this 
study using relative frequencies. 


3. What percent of subjects took Zoloft and showed a full response? 


Relationships Between Two Categorical Variables 


Let’s return to the data from the Yellowstone National Park survey of 1526 
randomly selected winter visitors. Earlier, we calculated marginal and joint 
relative frequencies from the two-way table. These values do not tell us 
much about the relationship between environmental club membership and 
snowmobile use for the people in the sample. 


Environmental club 


Never used 
Snowmobile use Snowmobile renter 


Snowmobile owner 


Total 


We can also use the two-way table to answer questions like: 


e What percent of environmental club members in the sample are 
snowmobile owners? 
16 


16 Bo & Om = 0.052 = 5.2% 
16305=0.052=5.2%305 70°? = >? 305 


e What proportion of snowmobile renters in the sample are not 
environmental club members? 
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= 497 
= = 0.866574 


= 0.866 


497574=0.866 


These percents or proportions are known as conditional relative 
frequencies. 


DEFINITION Conditional relative frequency 


A conditional relative frequency gives the percent or proportion of 
individuals that have a specific value for one categorical variable 
among individuals who share the same value of another categorical 
variable (the condition). 


EXAMPLE |A Titanic disaster (>) 


Conditional relative frequencies 


PROBLEM: In 1912 the luxury liner Titanic, on its first voyage across 
the Atlantic, struck an iceberg and sank. Some passengers made it off 
the ship in lifeboats, but many died. The two-way table gives 
information about adult passengers who survived and who died, by class 
of travel. 


Class of travel 


Survival status Survived 
Died 
Total 


a. What proportion of survivors were third-class passengers? 
b. What percent of first-class passengers survived? 
SOLUTION: 


151 bol _ 9 342 
a. 151442=0.342442 9747442 


79 


197 
197 —— 0.618 = 61.8% 
b. 197319=0.618=61.8%319 ~ 9-818 = 81.8% 379 


Note that a proportion is always a number between 0 and 1, whereas a 


percent is a number between 0 and 100. To get a percent, multiply the 
proportion by 100. 
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We can study the snowmobile use habits of environmental club 
members by looking only at the “Yes” column in the two-way table. 


Environmental club 


No /es \ Total 

Never used 657 

Snowmobile use Snowmobile renter 574 
Snowmobile owner 295 

Total 1221 305 / 1526 


It is easy to calculate the proportions or percents of environmental club 
members who never use, rent, and own snowmobiles: 
Never: 212305=0.695 or 69.5%Rent: 77305=0.252 or 25.2%Own: 16305=( 


212 
Never: 3057 0.695 or 69.5% 


a ——= 0.695 or 69.5% yg 
ws Rent: ——= 0.252 or 25.2% 
305 


rete 0.252 or 25.2% 
305 


16 
Own: = 0.052 or 5.2% Own: 305 0.052 or 5.2% 


We could also refer to this distribution as the conditional distribution of snowmobile 


use among environmental club members. 


This is the distribution of snowmobile use among environmental club 
members. 


We can find the distribution of snowmobile use among the survey 
respondents who are not environmental club members in a similar way. 
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The table summarizes the conditional relative frequencies for both groups. 


Snowmobile 
use Not environmental club members 
Never - 445 
445 | ‘ 246 4% = 0.364 or 36.4% 
4451221=0.364 or 36.4% 1221 ~ 2904 F 96-44 1991 2 
Rent - 497 
497 407 or 40.7% aaa. 0.407 or 40.7% 
4971221=0.407 or 40.7% 1221 0407 F 49-7 1997 ! 
O = 279 
“ 279 = 0.229 or 22.9% 


9791221=0.229 of 22.9%aa ne 


AP® EXAM TIP 


When comparing groups of different sizes, be sure to use relative 
frequencies (percents or proportions) instead of frequencies (counts) 
when analyzing categorical data. Comparing only the frequencies can 
be misleading, as in this setting. There are many more people who 
never use snowmobiles among the non-environmental club members in 
the sample (445) than among the environmental club members (212). 
However, the percentage of environmental club members who never 
use snowmobiles is much higher (69.5% to 36.4%). Finally, make sure 
to avoid statements like “More club members never use snowmobiles” 
when you mean “A greater percentage of club members never use 
snowmobiles.” 


Figure 1.3 compares the distributions of snowmobile use for 
Yellowstone National Park visitors who are environmental club members 
and those who are not environmental club members with (a) a side-by-side 
bar graph and (b) a segmented bar graph. Notice that the segmented bar 
graph can be obtained by stacking the bars in the side-by-side bar graph 
for each of the two environmental club membership categories (no and 


yes). 
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FIGURE 1.3 (a) Side-by-side bar graph and (b) segmented bar graph displaying 
the distribution of snowmobile use among environmental club members and among 
non-environmental club members from the 1526 randomly selected winter visitors 
to Yellowstone National Park. 


DEFINITION Side-by side bar graph, Segmented bar 
graph 


A side-by-side bar graph displays the distribution of a categorical 
variable for each value of another categorical variable. The bars are 
grouped together based on the values of one of the categorical 
variables and placed side by side. 


A segmented bar graph displays the distribution of a categorical 
variable as segments of a rectangle, with the area of each segment 
proportional to the percent of individuals in the corresponding 
category. 


Both graphs in Figure 1.3 show a clear association between 
environmental club membership and snowmobile use in this random 
sample of 1526 winter visitors to Yellowstone National Park. The 
environmental club members were much less likely to rent (25.2% versus 
40.7%) or own (5.2% versus 29.0%) snowmobiles than non-club-members 
and more likely to never use a snowmobile (69.5% versus 36.4%). 
Knowing whether or not a person in the sample is an environmental club 
member helps us predict that individual’s snowmobile use. 
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DEFINITION Association 


There is an association between two variables if knowing the value 
of one variable helps us predict the value of the other. If knowing the 
value of one variable does not help us predict the value of the other, 
then there is no association between the variables. 


What would the graphs in Figure 1.3 look like if there was no 
association between environmental club membership and snowmobile use 
in the sample? The blue segments would be the same height for both the 
“Yes” and “No” groups. So would the green segments and the red 
segments, as shown in the graph at left. In that case, knowing whether a 
survey respondent is an environmental club member would not help us 
predict his or her snowmobile use. 


100 


Snowmobile 


Percent 
vi 
So 


No Yes 
Environmental club membership 


Which distributions should we compare? Our goal all along has been 
to analyze the relationship between environmental club membership and 
snowmobile use for this random sample of 1526 Yellowstone National 
Park visitors. We decided to calculate conditional relative frequencies of 
snowmobile use among environmental club members and among non- 
club-members. Why? Because we wanted to see if environmental club 
membership helped us predict snowmobile use. What if we had wanted to 
determine whether snowmobile use helps us predict whether a person is an 
environmental club member? Then we would have calculated conditional 
relative frequencies of environmental club membership among 
snowmobile owners, renters, and non-users. In general, you should 
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calculate the distribution of the variable that you want to predict for each 
value of the other variable. 

Can we say that there is an association between environmental club 
membership and snowmobile use in the population of all winter visitors to 
Yellowstone National Park? Making this determination requires formal 
inference, which will have to wait until Chapter 11. 


EXAMPLE |A Titanic disaster (») 


Conditional relative frequencies and association 


—— 


Universal History Archive/Getty Images 


PROBLEM: In 1912 the luxury liner Titanic, on its first voyage across 
the Atlantic, struck an iceberg and sank. Some passengers made it off 
the ship in lifeboats, but many died. The two-way table gives 
information about adult passengers who survived and who died, by class 
of travel. 


Class of travel 


Survival status Survived 
Died 
Total 


a. Find the distribution of survival status for each class of travel. Make 
a segmented bar graph to compare these distributions. 


b. Describe what the graph in part (a) reveals about the association 
between class of travel and survival status for adult passengers on the 
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Titanic. 


SOLUTION: 


a. First 
classSurvived: 197319=0.618=61.8%Died: 122319=0.382=38.2%Second clas 


; 197 122 : 
First class Survived: had 0.618 = 61.8% Died: ——= 0.382 = 38.2% 
319 319 
—— 94 . 3 : 167 2s 
Second class Survived: ——= 0.360 = 36.0% Died: ——= 0.640 = 64.0% 
261 261 
: . Ls ae se : 476 Pi gee 
Third class Survived: —-= 0.241 = 24.1% Died: —~= 0.759 = 75.9% 
627 627 
100 
80 
Survival 
, 60 Status 
8 () Died 
5s és C) Survived 


20 


First Second Third 
Class of travel 


To make the segmented bar graph: 


e Draw and label the axes. Put class of travel on the horizontal axis 


and percent on the vertical axis. 


e “Scale” the axes. Use a vertical scale from 0 to 100%, with tick 


marks every 20%. 


e Draw bars. Make each bar have a height of 100%. Be sure the 
bars are equal in width and leave spaces between them. Segment 
each bar based on the conditional relative frequencies you 
calculated. Use different colors or shading patterns to represent 
the two possible statuses—survived and died. Add a key to the 
graph that tells us which color (or shading) represents which 


status. 
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b. Knowing a passenger’s class of travel helps us predict his or her 
survival status. First class had the highest percentage of survivors 


(61.8%), followed by second class (36.0%), and then third class 
(24.1%). 


FOR PRACTICE, TRY EXERCISE 29 


Bar graphs can be used to compare any set of quantities that can be measured in the 
same units. See Exercises 33 and 34. 


Because the variable “Survival status” has only two possible values, 
comparing the three distributions displayed in the segmented bar graph 
amounts to comparing the percent of passengers in each class of travel 
who survived. The bar graph in Figure 1.4 shows this comparison. Note 
that the bar heights do not add to 100%, because each bar represents a 
different group of passengers on the Titanic. 


70 


Percent who survived 


First Second Third 
Class of travel 


FIGURE 1.4 Bar graph comparing the percents of passengers who survived 
among each of the three classes of travel on the Titanic. 


We offer a final caution about studying the relationship between two 
variables: oo association does not imply causation. It may be true that 
being in a higher class of travel on the Titanic increased a passenger’s 
chance of survival. However, there isn’t always a cause-and-effect 
relationship between two variables even if they are clearly associated. For 
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example, a recent study proclaimed that people who are overweight are 
less likely to die within a few years than are people of normal weight. 
Does this mean that gaining weight will cause you to live longer? Not at 
all. The study included smokers, who tend to be thinner and also much 
more likely to die in a given period than non-smokers. Smokers increased 
the death rate for the normal-weight category, making it appear as if being 
overweight is better.2 The moral of the story: beware other variables! 


CHECK YOUR UNDERSTANDING 


( 


y An article in the Journal of the American Medical Association 
reports the results of a study designed to see if the herb St. John’s wort 
is effective in treating moderately severe cases of depression. The study 
involved 338 subjects who were being treated for major depression. The 
subjects were randomly assigned to receive one of three treatments: St. 
John’s wort, Zoloft (a prescription drug), or placebo (an inactive 
treatment) for an 8-week period. The two-way table summarizes the 
data from the experiment. 


Treatment 
St. John’s wort | Zoloft | Placebo 
Full response SE 
Change in depression Partial response 13 
No response 66 


1. What proportion of subjects who showed a full response took St. 
John’s wort? 


2. What percent of subjects who took St. John’s wort showed no 
response? 


3. Find the distribution of change in depression for the subjects 
receiving each of the three treatments. Make a segmented bar graph to 
compare these distributions. 
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4. Describe what the graph in Question 3 reveals about the association 
between treatment and change in depression for these subjects. 


1. Technology Corner | ANALYZING TWO-WAY 
TABLES 


Statistical software will provide marginal relative frequencies, joint 
relative frequencies, and conditional relative frequencies for data 
summarized in a two-way table. Here is output from Minitab for the 
data on snowmobile use and environmental club membership. Use the 
information on cell contents at the bottom of the output to help you 
interpret what each value in the table represents. 


Columns: Environmental club member? 


Section 1.1 Summary 


e The distribution of a categorical variable lists the categories and gives 
the frequency (count) or relative frequency (percent or proportion) of 
individuals that fall in each category. 


e You can use a pie chart or bar graph to display the distribution of a 
categorical variable. When examining any graph, ask yourself, “What 
do I see?” 


e Beware of graphs that mislead the eye. Look at the scales to see if they 
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have been distorted to create a particular impression. Avoid making 
graphs that replace the bars of a bar graph with pictures whose height 
and width both change. 


e A two-way table of counts summarizes data on the relationship between 
two categorical variables for some group of individuals. 


e You can use a two-way table to calculate three types of relative 
frequencies: 


: A marginal relative frequency gives the percent or proportion of 
individuals that have a specific value for one categorical variable. Use 
the appropriate row total or column total in a two-way table when 
calculating a marginal relative frequency. 


A joint relative frequency gives the percent or proportion of 
individuals that have a specific value for one categorical variable and a 
specific value for another categorical variable. Use the value from the 
appropriate cell in the two-way table when calculating a joint relative 
frequency. 


A conditional relative frequency gives the percent or proportion of 
individuals that have a specific value for one categorical variable 
among individuals who share the same value of another categorical 
variable (the condition). Use conditional relative frequencies to 
compare distributions of a categorical variable for two or more groups. 


e Use a side-by-side bar graph or a segmented bar graph to compare 
the distribution of a categorical variable for two or more groups. 


e There is an association between two variables if knowing the value of 
one variable helps predict the value of the other. To see whether there is 
an association between two categorical variables, find the distribution of 
one variable for each value of the other variable by calculating an 
appropriate set of conditional relative frequencies. 


1.1 Technology Corner 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
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1. Analyzing two-way tables Page 22 


Section 1.1 Exercises 


11. pg 11 |») Birth days The frequency table summarizes data on the 


numbers of babies born on each day of the week in the United States 
in a recent week.Z 


Day Births 
Sunday 7374 
Monday 11,704 
Tuesday 13,169 
Wednesday 13,038 
Thursday 13,013 
Friday 12,664 
Saturday 8459 


a. Identify the individuals in this data set. 


b. Make a frequency bar graph to display the data. Describe what you 
see. 


12. Going up? As of 2015, there were over 75,000 elevators in New 
York City. The frequency table summarizes data on the number of 
elevators of each type.2 


Type Count 
Passenger elevator 66,602 
Freight elevator 4140 
Escalator 2663 
Dumbwaiter 1143 
Sidewalk elevator 943 
Private elevator 252 
Handicap lift 227 
Manlift ve’ 
Public elevator 45 
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a. Identify the individuals in this data set. 


b. Make a frequency bar graph to display the data. Describe what you 
see. 


13. Buying cameras The brands of the last 45 digital single-lens reflex 
(SLR) cameras sold on a popular Internet auction site are listed here. 
Make a relative frequency bar graph for these data. Describe what 


you see. 

Canon Sony Canon Nikon Fujifilm 
Nikon Canon Sony Canon Canon 
Nikon Canon Nikon Canon Canon 
Canon Nikon Fujifilm Canon Nikon 
Nikon Canon Canon Canon Canon 
Olympus Canon Canon Canon Nikon 
Olympus Sony Canon Canon Sony 
Canon Nikon Sony Canon Fujifilm 
Nikon Canon Nikon Canon Sony 


14. Disc dogs Here is a list of the breeds of dogs that won the World 
Canine Disc Championships from 1975 through 2016. Make a 
relative frequency bar graph for these data. Describe what you see. 


Whippet Mixed breed Australian shepherd 
Whippet Australian shepherd Australian shepherd 
Whippet Border collie Australian shepherd 
Mixed breed Australian shepherd Border collie 
Mixed breed Mixed breed Border collie 

Other purebred Mixed breed Australian shepherd 
Labrador retriever Mixed breed Border collie 
Mixed breed Border collie Border collie 
Mixed breed Border collie Other purebred 
Border collie Australian shepherd Border collie 
Mixed breed Border collie Border collie 
Mixed breed Australian shepherd Border collie 
Labrador retriever Border collie Mixed breed 
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Labrador retriever Mixed breed Australian shepherd 


15. Cool car colors The most popular colors for cars and light trucks 
change over time. Silver advanced past green in 2000 to become the 
most popular color worldwide, then gave way to shades of white in 
2007. Here is a relative frequency table that summarizes data on the 


colors of vehicles sold worldwide in a recent year.2 

Color Percent of vehicles 
Black 19 

Blue 6 
Brown/beige fs) 

Gray 12 

Green 

Red 9 

Silver 14 

White 29 
Yellow/gold 3 

Other a 


a. What percent of vehicles would fall in the “Other” category? 
b. Make a bar graph to display the data. Describe what you see. 


c. Would it be appropriate to make a pie chart of these data? Explain. 


16. Spam Email spam is the curse of the Internet. Here is a relative 
frequency table that summarizes data on the most common types of 


spam:12 
Type of spam Percent 
Adult 19 
Financial 20 
Health 
Internet 
Leisure 
Products 25 
Scams S 
Other a 
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a. What percent of spam would fall in the “Other” category? 
b. Make a bar graph to display the data. Describe what you see. 


c. Would it be appropriate to make a pie chart of these data? Explain. 


17. Hispanic origins Here is a pie chart prepared by the Census Bureau 
to show the origin of the more than 50 million Hispanics in the 
United States in 2010.44 About what percent of Hispanics are 
Mexican? Puerto Rican? 


Origin 


Cuban 


Central 
American 


South 

American 

18. Which major? About 3 million first-year students enroll in colleges 
and universities each year. What do they plan to study? The pie chart 
displays data on the percent of first-year students who plan to major 
in several disciplines.t2 About what percent of first-year students 
plan to major in business? In social science? 


Major Arts/ 
Humanities 


Technical Biological 
™~ sciences 
Professional 


Business 


Physical sciences Education 


Engineering 


19. pg 12 |») Going to school Students in a high school statistics class 


were given data about the main method of transportation to school for 
a group of 30 students. They produced the pictograph shown. Explain 
how this graph is misleading. 


93 


20. 


21. 


Key: 
@& = 1 Cyclist 
t@» = 2 Cars 
=, = 7 Bus takers 


& =2 Walkers 


Mode of transport 


Social media The Pew Research Center surveyed a random sample of 
U.S. teens and adults about their use of social media. The following 
pictograph displays some results. Explain how this graph is 
misleading. 


AGE BREAKDOWN (OF SOCIAL MEDIA USERS) 


65+ 


13-18 19-29 30-49 50-64 


Binge-watching Do you “binge-watch” television series by viewing 
multiple episodes of a series at one sitting? A survey of 800 people 
who binge-watch were asked how many episodes is too many to 
watch in one viewing session. The results are displayed in the bar 
graph.42 Explain how this graph is misleading. 


30 + 


3to4 Sto6 7to9 10or There 
more is no 
“too many” 


Number of episodes 


22. Support the court? A news network reported the results of a survey 


about a controversial court decision. The network initially posted on 
its website a bar graph of the data similar to the one that follows. 
Explain how this graph is misleading. (Note: When notified about the 
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misleading nature of its graph, the network posted a corrected 
version. ) 


Percent who agree 
wn 
oo 
! 


Democrats Republicans Independents 


23. pg 15 | ®) Asmash or a hit? Researchers asked 150 subjects to 


recall the details of a car accident they watched on video. Fifty 
subjects were randomly assigned to be asked, “About how fast were 
the cars going when they smashed into each other?” For another 50 
randomly assigned subjects, the words “smashed into” were replaced 
with “hit.” The remaining 50 subjects—the control group—were not 
asked to estimate speed. A week later, all subjects were asked if they 
saw any broken glass at the accident (there wasn’t any). The table 


shows each group’s response to the broken glass question.14 
Treatment 
“Smashed “Hit” Control 
into” 
Response Yes 6 
No 4A 


a. What proportion of subjects were given the control treatment? 


b. Find the distribution of responses about whether there was broken 
glass at the accident for the subjects in this study using relative 
frequencies. 


c. What percent of the subjects were given the “smashed into” 
treatment and said they saw broken glass at the accident? 


24. Superpowers A total of 415 children from the United Kingdom and 
the United States who completed a survey in a recent year were 
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randomly selected. Each student’s country of origin was recorded 
along with which superpower they would most like to have: the 
ability to fly, ability to freeze time, invisibility, superstrength, or 
telepathy (ability to read minds). The data are summarized in the 
following table.12 


Country 
US. 
Fly 45 
Freeze time 44 
Superpower Invisibility 37 
Superstrength 23 
Telepathy 66 


a. What proportion of students in the sample are from the United 
States? 


b. Find the distribution of superpower preference for the students in 
the sample using relative frequencies. 


c. What percent of students in the sample are from the United 
Kingdom and prefer telepathy as their superpower preference? 1 


25. Body image A random sample of 1200 U.S. college students was 
asked, “What is your perception of your own body? Do you feel that 
you are overweight, underweight, or about right?” The two-way table 
summarizes the data on perceived body image by gender.1& 


Gender 
Total 
About right 855 
Body image Overweight 235 
Underweight 110 
Total 1200 


a. What percent of respondents feel that their body weight is about 
right? 
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b. What proportion of the sample is female? 


c. What percent of respondents are males and feel that they are 
overweight or underweight? 


26. Python eggs How is the hatching of water python eggs influenced by 
the temperature of the snake’s nest? Researchers randomly assigned 
newly laid eggs to one of three water temperatures: hot, neutral, or 
cold. Hot duplicates the extra warmth provided by the mother python, 
and cold duplicates the absence of the mother. The two-way table 
summarizes the data on whether or not the eggs hatched.14 


Water temperature 


Total 
Yes 129 
Hatched? No 58 
Total 187 


a. What percent of eggs were randomly assigned to hot water? 
b. What proportion of eggs in the study hatched? 


c. What percent of eggs in the study were randomly assigned to cold 
or neutral water and hatched? 


27. pg 17 |») Asmash or a hit Refer to Exercise 23. 


a. What proportion of subjects who said they saw broken glass at the 
accident received the “hit” treatment? 


b. What percent of subjects who received the “smashed into” 
treatment said they did not see broken glass at the accident? 


28. Superpower Refer to Exercise 24. 


a. What proportion of students in the sample who prefer invisibility 
as their superpower are from the United States? 


b. What percent of students in the sample who are from the United 
Kingdom prefer superstrength as their superpower? 
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29. pg 20 |») Asmash or a hit Refer to Exercise 23. 


a. Find the distribution of responses about whether there was broken 
glass at the accident for each of the three treatment groups. Make a 
segmented bar graph to compare these distributions. 


b. Describe what the graph in part (a) reveals about the association 
between response about broken glass at the accident and treatment 
received for the subjects in the study. 


30. Superpower Refer to Exercise 24. 


a. Find the distribution of superpower preference for the students in 
the sample from each country (i.e., the United States and the 
United Kingdom). Make a segmented bar graph to compare these 
distributions. 


b. Describe what the graph in part (a) reveals about the association 
between country of origin and superpower preference for the 
students in the sample. 


31. Body image Refer to Exercise 25. 


a. Of the respondents who felt that their body weight was about right, 
what proportion were female? 


b. Of the female respondents, what percent felt that their body weight 
was about right? 


c. The segmented bar graph displays the distribution of perceived 
body image by gender. Describe what this graph reveals about the 
association between these two variables for the 1200 college 
students in the sample. 


Body image 
eal Underweight 
4 Overweight 
a About right 


Relative frequency (%) 
uv 
! 


Female Male 
Gender 


98 


32. Python eggs Refer to Exercise 26. 


a. Of the eggs that hatched, what proportion were randomly assigned 
to hot water? 


b. Of the eggs that were randomly assigned to hot water, what percent 
hatched? 


c. The segmented bar graph displays the distribution of hatching 
status by water temperature. Describe what this graph reveals 
about the association between these two variables for the python 
eggs in this experiment. 


100 4 
Hatched? 


B No 
M4 Yes 


Relative frequency (%) 
3 
N 


Cold Neutral Hot 
Water temperature 


33. Far from home A survey asked first-year college students, “How 
many miles is this college from your permanent home?” Students had 
to choose from the following options: 5 or fewer, 6 to 10, 11 to 50, 51 
to 100, 101 to 500, or more than 500. The side-by-side bar graph 
shows the percentage of students at public and private 4-year colleges 
who chose each option./8 Write a few sentences comparing the 
distributions of distance from home for students from private and 
public 4-year colleges who completed the survey. 


Type of university 
Private Public 


Percent 


S 9 ‘ 
oS s o 


FSF SSS EPSPS SS 
%° 
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Distance from home (mi) 
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34. Popular car colors Favorite car colors may differ among countries. 
The side-by-side bar graph displays data on the most popular car 
colors in a recent year for North America and Asia. Write a few 
sentences comparing the distributions.12 


Region 


North America 


Percent 
Ww 
o 


0 - 
a FA $0 Sc SO & AS 

SIE PPS REGS FIERA SE RSS 
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35. Phone navigation The bar graph displays data on the percent of 


smartphone owners in several age groups who say that they use their 


phone for turn-by-turn navigation.22 


Percent who use smartphone 
for navigation 
cS8 8b SS 
ae ee 


18-29 30-49 50-64 65+ 
Age group (years) 


a. Describe what the graph reveals about the relationship between age 
group and use of smartphones for navigation. 


b. Would it be appropriate to make a pie chart of the data? Explain. 


36. Who goes to movies? The bar graph displays data on the percent of 
people in several age groups who attended a movie in the past 12 
months.24 


100 


Percent who attended a movie 
P= 


7 i ] | 

0 

18-24 25-34 35-44 45-54 55-64 65-74 75 
and older 


Age group (years) 


a. Describe what the graph reveals about the relationship between age 
group and movie attendance. 


b. Would it be appropriate to make a pie chart of the data? Explain. 


37. Marginal totals aren’t the whole story Here are the row and column 
totals for a two-way table with two rows and two columns: 


60 40 100 


Find two different sets of counts a, b, c, and d for the body of the 
table that give these same totals. This shows that the relationship 
between two variables cannot be obtained from the two individual 
distributions of the variables. 


38. Women and children first? Here’s another table that summarizes 
data on survival status by gender and class of travel on the Titanic: 


Class of travel 


First class Second class Third class 
Survival status Female | Male | Female | Male | Female | Male 
Survived 
Died 


a. Find the distributions of survival status for males and for females 
within each class of travel. Did women survive the disaster at 
higher rates than men? Explain. 


b. In an earlier example, we noted that survival status is associated 
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with class of travel. First-class passengers had the highest survival 
rate, while third-class passengers had the lowest survival rate. 
Does this same relationship hold for both males and females in all 
three classes of travel? Explain. 


39. Simpson’s paradox Accident victims are sometimes taken by 
helicopter from the accident scene to a hospital. Helicopters save 
time. Do they also save lives? The two-way table summarizes data 
from a sample of patients who were transported to the hospital by 
helicopter or by ambulance.22 

Method of transport 
Helicopter | Ambulance | Total 


Died 324 
Survival status Survived 976 
Total 200 1100 1300 


a. What percent of patients died with each method of transport? 
Here are the same data broken down by severity of accident: 


Serious accidents 


Method of transport 


Survival status 


Less serious accidents 
Method of transport 
Helicopter | Ambulance | Total 


Died 216 
Survival status Survived 884 
Total 100 1000 1100 


b. Calculate the percent of patients who died with each method of 
transport for the serious accidents. Then calculate the percent of 
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40. 


41. 


patients who died with each method of transport for the less 
serious accidents. What do you notice? 


c. See if you can explain how the result in part (a) is possible given 
the result in part (b). 


Note: This is an example of Simpson’s paradox, which states that an 
association between two variables that holds for each value of a third 
variable can be changed or even reversed when the data for all values 
of the third variable are combined. 


Multiple Choice Select the best answer for Exercises 40-43. 


For which of the following would it be inappropriate to display the 
data with a single pie chart? 


a. The distribution of car colors for vehicles purchased in the last 
month 


b. The distribution of unemployment percentages for each of the 50 
states 


c. The distribution of favorite sport for a sample of 30 middle school 
students 


d. The distribution of shoe type worn by shoppers at a local mall 


e. The distribution of presidential candidate preference for voters in a 
state 


The following bar graph shows the distribution of favorite subject for 
a sample of 1000 students. What is the most serious problem with the 
graph? 


Number of students 


Math Science English Social Foreign Fine 
studies language arts 
Favorite subject 
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42. 


d. 


e. 


The subjects are not listed in the correct order. 

This distribution should be displayed with a pie chart. 
The vertical axis should show the percent of students. 
The vertical axis should start at 0 rather than 100. 


The foreign language bar should be broken up by language. 


The Dallas Mavericks won the NBA championship in the 2010-2011 
season. The two-way table displays the relationship between the 
outcome of each game in the regular season and whether the 
Mavericks scored at least 100 points. 


Outcome of game 


Points scored 


Which of the following is the best evidence that there is an 
association between the outcome of a game and whether or not the 
Mavericks scored at least 100 points? 


The Mavericks won 57 games and lost only 25 games. 


The Mavericks scored at least 100 points in 47 games and fewer 
than 100 points in only 35 games. 


The Mavericks won 43 games when scoring at least 100 points and 
only 14 games when scoring fewer than 100 points. 


The Mavericks won a higher proportion of games when scoring at 
least 100 points (43/47) than when they scored fewer than 100 
points (14/35). 


The combination of scoring 100 or more points and winning the 
game occurred more often (43 times) than any other combination 
of outcomes. 
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43. The following partially completed two-way table shows the marginal 
distributions of gender and handedness for a sample of 100 high 
school students. 


Gender 
Total 
Right 90 
Dominant Left 10 
mang Total 100 


If there is no association between gender and handedness for the 
members of the sample, which of the following is the correct value of 
xX? 


a. 20 
b. 30 
c. 36 
d. 45 


e. Impossible to determine without more information. 


Recycle and Review 


44. Hotels (Introduction) A high school lacrosse team is planning to go 
to Buffalo for a three-day tournament. The tournament’s sponsor 
provides a list of available hotels, along with some information about 
each hotel. The following table displays data about hotel options. 
Identify the individuals and variables in this data set. Classify each 
variable as categorical or quantitative. 


Distance Room 

Exercise Internet tosite Room _ rate 
Hotel Pool room? ($/day) Restaurants (mi) service? ($/day) 
Comfort Inn Out ¥ 0.00 1 8.2 Y 149 
Fairfield Inn In Y 0.00 1 8.3 N 119 
& Suites 
Baymont Out ¥ 0.00 1 3.7 ¥ 60 
Inn & Suites 
Chase Suite Out N 15.00 0 1.5 N 139 


Hotel 


Courtyard In Y 0.00 1 0.2 Dinner 114 
Hilton In Y 10.00 Z 0.1 ¥. 156 


Marriott In Y 9.95 2 0.0 Y 145 
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SECTION 1.2 Displaying Quantitative Data with 
Graphs 


LEARNING TARGETS By the end of the section, you should be able to: 


Make and interpret dotplots, stemplots, and histograms of 
quantitative data. 


Identify the shape of a distribution from a graph. 


Describe the overall pattern (shape, center, and variability) of a 
distribution and identify any major departures from the pattern 
(outliers). 


Compare distributions of quantitative data using dotplots, stemplots, 
and histograms. 


To display the distribution of a categorical variable, use a bar graph or a 
pie chart. How can we picture the distribution of a quantitative variable? In 
this section, we present several types of graphs that can be used to display 
quantitative data. 


Dotplots 


One of the simplest graphs to construct and interpret is a dotplot. 


DEFINITION Dotplot 


A dotplot shows each data value as a dot above its location on a 
number line. 


Here are data on the number of goals scored in 20 games played by the 
2016 U.S. women’s soccer team: 


55 11052112332142d12i1°9 3 
Figure 1.5 shows a dotplot of these data. 
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Goals scored 


FIGURE 1.5 Dotplot of goals scored in 20 games by the 2016 U.S. women’s 
soccer team. 


It is fairly easy to make a dotplot by hand for small sets of quantitative 
data. 


HOW TO MAKE A DOTPLOT 


e Draw and label the axis. Draw a horizontal axis and put the name of 
the quantitative variable underneath. Be sure to include units of 
measurement. 


Scale the axis. Look at the smallest and largest values in the data set. 


Start the horizontal axis at a convenient number equal to or less than 
the smallest value and place tick marks at equal intervals until you 
equal or exceed the largest value. 


Plot the values. Mark a dot above the location on the horizontal axis 
corresponding to each data value. Try to make all the dots the same 
size and space them out equally as you stack them. 


Remember what we said in Section 1.1: Making a graph is not an end 
in itself. When you look at a graph, always ask, “What do I see?” From 
Figure 1.5, we see that the 2016 U.S. women’s soccer team scored 4 or 
more goals in 6/20 = 0.30 or 30% of its games. That’s quite an offense! 
Unfortunately, the team lost to Sweden on penalty kicks in the 2016 
Summer Olympics. 


EXAMPLE | Give it some gas! » 
Making and interpreting dotplots 
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Ann Heath 


PROBLEM: The Environmental Protection Agency (EPA) is in charge 
of determining and reporting fuel economy ratings for cars. To estimate 
fuel economy, the EPA performs tests on several vehicles of the same 
make, model, and year. Here are data on the highway fuel economy 
ratings for a sample of 25 model year 2018 Toyota 4Runners tested by 
the EPA: 


Doe eA 2 By 2 een 2 22) 22 a 
2222 422 G22 O22 5 22 Ne 2? A222 22 9 G2 224 


a. Make a dotplot of these data. 


b. Toyota reports the highway gas mileage of its 2018 model year 
4Runners as 22 mpg. Do these data give the EPA sufficient reason to 
investigate that claim? 


SOLUTION: 
a. - 
2 
a2 @ 
2e@6¢@464 64 
2 a 4 4 4 @ a 4 
ans 22.0 22.5 23.0 23.5 


Highway fuel economy rating (mpg) 


To make the dotplot: 


e Draw and label the axis. Note variable name and units in the 
label. 
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e Scale the axis. The smallest value is 21.5 and the largest value is 
23.3. So we choose a scale from 21.5 to 23.5 with tick marks 0.1 
units apart. 


e Plot the values. 


b. No. 23 of the 25 cars tested had an estimated highway fuel economy 
of 22 mpg or greater. 
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Describing Shape 


When you describe the shape of a dotplot or another graph of quantitative 
data, focus on the main features. Look for major peaks, not for minor ups 
and downs in the graph. Look for clusters of values and obvious gaps. 
Decide if the distribution is roughly symmetric or clearly skewed. 


DEFINITION Symmetric and skewed distributions 


A distribution is roughly symmetric if the right side of the graph 
(containing the half of observations with the largest values) is 
approximately a mirror image of the left side. 
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Roughly symmetric 


eeeeeeeee 


a 


Skewed to the right 


en. aaa 


Skewed to the left 


A distribution is skewed to the right if the right side of the graph is 
much longer than the left side. 


A distribution is skewed to the left if the left side of the graph is 
much longer than the right side. 


We could also describe a distribution with a long tail to the left as “skewed toward 


negative values” or “negatively skewed” and a distribution with a long right tail as 


“positively skewed.” 


Coorecsoe \ 
eesece 


@620066 XK . 
ae 


For ease, we sometimes say “left-skewed” instead of “skewed to the 
left” and “right-skewed” instead of “skewed to the right.” oOo The 


direction of skewness is toward the long tail, not the direction where 
most observations are clustered. The drawing is a cute but corny way to 
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help you keep this straight. To avoid danger, Mr. Starnes skis on the 
gentler slope—in the direction of the skewness. 


EXAMPLE | Quiz scores and die rolls >» 
Describing shape 


Malerapaso/Getty Images 


PROBLEM: The dotplots display two different sets of quantitative 
data. Graph (a) shows the scores of 21 statistics students on a 20-point 
quiz. Graph (b) shows the results of 100 rolls of a 6-sided die. Describe 
the shape of each distribution. 


a. : 
° i i te Be 
12 14 16 18 20 
Quiz score (points) 
FF : 
H Hy H H H H 
1 2 3 4 5 6 
Roll 
SOLUTION: 


a. The distribution of statistics quiz scores is skewed to the left, with a 
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single peak at 20 (a perfect score). There are two small gaps at 12 
and 16. 


b. The distribution of die rolls is roughly symmetric. It has no clear 
peak. 


We can describe the shape of the distribution in part (b) as 


“approximately uniform” because the frequencies are about the same for 
all possible rolls. 
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Some people refer to graphs with a single peak as unimodal, to graphs with two 


peaks as bimodal, and to graphs with more than two clear peaks as multimodal. 


Some quantitative variables have distributions with easily described 
shapes. But many distributions have irregular shapes that are neither 
symmetric nor skewed. Some distributions show other patterns, like the 
dotplot in Figure 1.6. This graph shows the durations (in minutes) of 220 
eruptions of the Old Faithful geyser. The dotplot has two distinct clusters 
and two peaks: one at about 2 minutes and one at about 4.5 minutes. When 
you examine a graph of quantitative data, describe any pattern you see as 
clearly as you can. 


Duration (min) 


FIGURE 1.6 Dotplot displaying duration (in minutes) of 220 Old Faithful 
eruptions. This graph has two distinct clusters and two clear peaks. 


Some quantitative variables have distributions with predictable shapes. 
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Many biological measurements on individuals from the same species and 
gender—lengths of bird bills, heights of young women—have roughly 
symmetric distributions. Salaries and home prices, on the other hand, 
usually have right-skewed distributions. There are many moderately priced 
houses, for example, but the few very expensive mansions give the 
distribution of house prices a strong right skew. 


CHECK YOUR UNDERSTANDING 


( 


y Knoebels Amusement Park in Elysburg, Pennsylvania, has earned 
acclaim for being an affordable, family-friendly entertainment venue. 
Knoebels does not charge for general admission or parking, but it does 
charge customers for each ride they take. How much do the rides cost at 
Knoebels? The table shows the cost for each ride in a sample of 22 rides 
in a recent year. 


Name Cost 
Merry Mixer $1.50 
Italian Trapeze $1.50 
Satellite $1.50 
Galleon $1.50 
Whipper $1.25 
Skooters $1.75 
Ribbit $1.25 
Roundup $1.50 
Paradrop $1.25 
The Phoenix $2.50 
Gasoline Alley $1.75 
Looper $1.75 
Flying Turns $3.00 
Flyer $1.50 
The Haunted Mansion $1.75 
StratosFear $2.00 
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Twister $2.50 


Cosmotron $1.75 
Paratrooper $1.50 
Downdraft $1.50 
Rockin’ Tug $1.25 
Sklooosh! $1.75 


1. Make a dotplot of the data. 
2. Describe the shape of the distribution. 


Describing Distributions 


Here is a general strategy for describing a distribution of quantitative data. 


HOW TO DESCRIBE THE DISTRIBUTION OF A 
QUANTITATIVE VARIABLE 


In any graph, look for the overall pattern and for clear departures from 
that pattern. 


e You can describe the overall pattern of a distribution by its shape, 
center, and variability. 


e An important kind of departure is an outlier, an observation that falls 
outside the overall pattern. 


Variability is sometimes referred to as spread. We prefer variability because students 
sometimes think that spread refers only to the distance between the maximum and 
minimum value of a quantitative data set (the range). There are several ways to 


measure the variability (spread) of a distribution, including the range. 


AP® EXAM TIP 


Always be sure to include context when you are asked to describe a 
distribution. This means using the variable name, not just the units the 
variable is measured in. 
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(EES 


We will discuss more formal ways to measure center and variability 
and to identify outliers in Section 1.3. For now, just use the median 
(middle value in the ordered data set) when describing center and the 
minimum and maximum when describing variability. 


Let’s practice with the dotplot of goals scored in 20 games played by 
the 2016 U.S. women’s soccer team. 


N —eecee 
Woieee 

-| 
w-eee 

| 
i+ 
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Goals scored 


When describing a distribution of quantitative data, don’t forget: Statistical 


Opinions Can Vary (Shape, Outliers, Center, Variability). 


Shape: The distribution of goals scored is skewed to the right, with a 
single peak at 1 goal. There is a gap between 5 and 9 goals. 


Outliers: | The games when the team scored 9 and 10 goals appear to be 
outliers. 


Center: The median is 2 goals scored. 


Variability: The data vary from 1 to 10 goals scored. 


EXAMPLE | Give it some gas! » 


Describing a distribution 


= —_—. gaye ey os 


Daren Starnes 


PROBLEM: Here is a dotplot of the highway fuel economy ratings for 
a sample of 25 model year 2018 Toyota 4Runners tested by the EPA. 
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Describe the distribution. 


T | 
21.5 22.0 22.5 
Highway fuel economy rating (mpg) 


SOLUTION: 


Shape: The distribution of highway fuel economy ratings is 
roughly symmetric, with a single peak at 22.4 mpg. There 
are two Clear gaps: between 21.5 and 21.9 mpg and 
between 22.9 and 23.3 mpg. 


Outliers: |The cars with 21.5 mpg and 23.3 mpg ratings are possible 
outliers. 


Center: The median rating is 22.4 mpg. 


Variability: The ratings vary from 21.5 to 23.3 mpg. 


Be sure to include context by discussing the variable of interest, highway 


fuel economy ratings. And give the units of measurement: miles per 
gallon (mpg). 
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Comparing Distributions 


Some of the most interesting statistics questions involve comparing two or 
more groups. Which of two popular diets leads to greater long-term weight 
loss? Who texts more—males or females? As the following example 
suggests, you should always discuss shape, outliers, center, and variability 
whenever you compare distributions of a quantitative variable. 


EXAMPLE | Household size: U.K. versus South Africa 
Ne 


Comparing distributions 
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FrankvandenBergh/Getty Images 


PROBLEM: How do the numbers of people living in households in the 
United Kingdom (U.K.) and South Africa compare? To help answer this 
question, we used Census At School’s “Random Data Selector” to 
choose 50 students from each country. Here are dotplots of the 
household sizes reported by the survey respondents. Compare the 
distributions of household size for these two countries. 


: 
a 
5 
{o} 
a eeee8e 
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ee 
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eeeene 
eeeee 
eeeene 
rs a a Sr a rs 
0 5 10 15 20 25 30 
Household size 
AP® EXAM TIP 
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When comparing distributions of quantitative data, it’s not enough 
just to list values for the center and variability of each distribution. 
You have to explicitly compare these values, using words like 
“greater than,” “less than,” or “about the same as.” 


SOLUTION: 


Shape: The distribution of household size for the U.K. sample is 
roughly symmetric, with a single peak at 4 people. The 
distribution of household size for the South Africa sample 
is skewed to the right, with a single peak at 4 people and a 
clear gap between 15 and 26. 


Outliers: | There don’t appear to be any outliers in the U.K. 
distribution. The South African distribution seems to have 
two outliers: the households with 15 and 26 people. 


Center: Household sizes for the South African students tend to be 
larger (median = 6 people) than for the U.K. students 
(median = 4 people). 


Variability: The household sizes for the South African students vary 
more (from 3 to 26 people) than for the U.K. students 
(from 2 to 6 people). 


Don’t forget to include context! It isn’t enough to refer to the U.K. 


distribution or the South Africa distribution. You need to mention the 
variable of interest, household size. 
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Notice that in the preceding example, we discussed the distributions of 
household size only for the two samples of 50 students. We might be 
interested in whether the sample data give us convincing evidence of a 
difference in the population distributions of household size for South 
Africa and the United Kingdom. We’|I have to wait a few chapters to 
decide whether we can reach such a conclusion, but our ability to make 
such an inference later will be helped by the fact that the students in our 
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samples were chosen at random. 


CHECK YOUR UNDERSTANDING 


A 


\ 


y For a statistics class project, Jonathan and Crystal hosted an ice- 
cream-eating contest. Each student in the contest was given a small cup 
of ice cream and instructed to eat it as fast as possible. Jonathan and 
Crystal then recorded each contestant’s gender and time (in seconds), as 
shown in the dotplots. Compare the distributions of eating times for 
males and females. 


Female 


Gender 


. 
15 30 45 60 i) 90 105 
Time to eat ice cream (sec) 


Stemplots 


Another simple type of graph for displaying quantitative data is a 
stemplot. 


DEFINITION Stemplot 


A stemplot shows each data value separated into two parts: a stem, 
which consists of all but the final digit, and a leaf, the final digit. The 
stems are ordered from lowest to highest and arranged in a vertical 
column. The leaves are arranged in increasing order out from the 
appropriate stems. 


A stemplot is also known as a stem-and-leaf plot. 


120 


Here are data on the resting pulse rates (beats per minute) of 19 middle 
school students: 


71 104 76 88 78 71 68 86 70 90 74 76 69 68 88 96 | 


Figure 1.7 shows a stemplot of these data. 


Key: 8|2 isa 
student whose 


resting pulse 


rate is 82 beats 


per minute. 


FIGURE 1.7 Stemplot of the resting pulse rates of 19 middle school students. 


According to the American Heart Association, a resting pulse rate 
above 100 beats per minute is considered high for this age group. We can 
see that 2/19 = 0.105 = 10.5% of these students have high resting pulse 
rates by this standard. Also, the distribution of pulse rates for these 19 
students is skewed to the right (toward the larger values). 

Stemplots give us a quick picture of a distribution that includes the 
individual observations in the graph. It is fairly easy to make a stemplot by 
hand for small sets of quantitative data. 


HOW TO MAKE A STEMPLOT 


Make stems. Separate each observation into a stem, consisting of all 
but the final digit, and a leaf, the final digit. Write the stems in a 
vertical column with the smallest at the top. Draw a vertical line at 
the right of this column. Do not skip any stems, even if there is no 
data value for a particular stem. 


Add leaves. Write each leaf in the row to the right of its stem. 


Order leaves. Arrange the leaves in increasing order out from the 
stem. 


Add a key. Provide a key that identifies the variable and explains 
what the stems and leaves represent. 
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EXAMPLE | Wear your helmets! (>) 
Making and interpreting stemplots 


Pete Saloutos/AGE Fotostock 


PROBLEM: Many athletes (and their parents) worry about the risk of 
concussions when playing sports. A football coach plans to obtain 
specially made helmets for his players that are designed to reduce the 
chance of getting a concussion. Here are the measurements of head 
circumference (in inches) for the 30 players on the team: 


2a 2222/7 220223 22,0225) 21,5227 25,6206 23,0242 23.5208 
24.0 22.7 22.6 23.9 22.5 23.1 21.9 21.0 22.4 23.5 22.5 23.9 23.4 21.6 23.3 


a. Make a stemplot of these data. 


b. Describe the shape of the distribution. Are there any obvious 


outliers? 
SOLUTION: 
eh anes ‘Key: 23ls is a | 
21 | 05679 
player with | 
22 | 0234556677F 
a head | 


23 | 001345599 
circumference 
24 | 02 . 
of 23.5 inches. 


To make the stemplot: 


e Make stems. The smallest head circumference is 20.8 inches and 
the largest is 25.6 inches. We use the first two digits as the stem 
and the final digit as the leaf. So we need stems from 20 to 25. 
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e Add leaves. 


e Order leaves. 


e Adda key. 


b. The distribution of head circumferences for the 30 players on the 
team is roughly symmetric, with a single peak on the 22-inch stem. 
There are no obvious outliers. 
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We can get a better picture of the head circumference data by splitting 
stems. In Figure 1.8(a), leaf values from 0 to 9 are placed on the same 
stem. Figure 1.8(b) shows another stemplot of the same data. This time, 
values with leaves from 0 to 4 are placed on one stem, while those with 
leaves from 5 to 9 are placed on another stem. Now we can see the shape 
of the distribution even more clearly—including the possible outlier at 
25.6 inches. 


20|88 20| 88 
21|}05679 21)0 
22102345566777 Key: 23)5 is a player with a head 2115679 
231001345599 circumference of 23.5 inches. 2210234 
24 22)5566777 
2516 23|}00134 
23|5599 
24|02 
24 
25 
2516 
(a) (b) 


FIGURE 1.8 Two stemplots showing the head circumference data. The graph 
in (b) improves on the graph in (a) by splitting stems. 
Here are a few tips to consider before making a stemplot: 


e There is no magic number of stems to use. Too few or too many stems 
will make it difficult to see the distribution’s shape. Five stems is a good 
minimum. 


e If you split stems, be sure that each stem is assigned an equal number of 
possible leaf digits. 


e When the data have too many digits, you can get more flexibility by 
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rounding or truncating the data. See Exercises 61 and 62 for an 
illustration of rounding data before making a stemplot. 


You can use a back-to-back stemplot with common stems to compare 
the distribution of a quantitative variable in two groups. The leaves are 
placed in order on each side of the common stem. For example, Figure 1.9 
shows a back-to-back stemplot of the 19 middle school students’ resting 
pulse rates and their pulse rates after 5 minutes of running. 


Resting After exercise 


9888 | 6 
8664110 7 
8862 | 8) 6788 
60} 9] 02245899 

4110} 044 
11] 8 
0/12] 44 


Key: 8|2 is a student 
whose pulse rate is 
82 beats per minute. 
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FIGURE 1.9 Back-to-back stemplot of 19 middle school students’ resting 
pulse rates and their pulse rates after 5 minutes of running. 


CHECK YOUR UNDERSTANDING 


(\ 


\ 


y 


1. Write a few sentences comparing the distributions of resting and 
after-exercise pulse rates in Figure 1.9. 


Multiple Choice: Select the best answer for Questions 2-4. 


Here is a stemplot of the percent of residents aged 65 and older in the 50 
states and the District of Columbia: 


Key: 8/8 represents a state 
in which 8.8% of residents 
are 65 and older. 


6 

7 

8 

9 

10 

11 

12 | 012223444457888999 
13 | 01233333444899 
14 | 02666 

15 | 23 

1618 


2. The low outlier is Alaska. What percent of Alaska residents are 65 or 
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older? 
a. 0.68 
b. 6.8 
c. 8.8 
d. 16.8 
e. 68 
3. Ignoring the outlier, the shape of the distribution is 
a. skewed to the right. 
b. skewed to the left. 
c. skewed to the middle. 
d. double-peaked. 
e. roughly symmetric. 
4. The center of the distribution is close to 
a. 11.6%. 
b. 12.0%. 
c. 12.8%. 
d. 13.3%. 
e. 6.8% to 16.8%. 


Histograms 


You can use a dotplot or stemplot to display quantitative data. Both graphs 
show every individual data value. For large data sets, this can make it 
difficult to see the overall pattern in the graph. We often get a clearer 
picture of the distribution by grouping together nearby values. Doing so 
allows us to make a new type of graph: a histogram. 


DEFINITION Histogram 


A histogram shows each interval of values as a bar. The heights of 
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the bars show the frequencies or relative frequencies of values in each 
interval. 


Figure 1.10 shows a dotplot and a histogram of the durations (in 
minutes) of 220 eruptions of the Old Faithful geyser. Notice how the 
histogram groups together nearby values. 


= 
1 


Frequency 
| 


L5 2.0 2.5 3.0 35 4.0 4.5 5.0 iS 2.0 2.5 3.0 3.5 4.0 4.5 5.0 
(a) Duration (min) (b) Duration (min) 


FIGURE 1.10 (a) Dotplot and (b) histogram of the duration (in minutes) of 
220 eruptions of the Old Faithful geyser. 


It is fairly easy to make a histogram by hand. Here’s how you do it. 


HOW TO MAKE A HISTOGRAM 


e Choose equal-width intervals that span the data. Five intervals is a 
good minimum. 


e Make a table that shows the frequency (count) or relative frequency 
(percent or proportion) of individuals in each interval. Put values that 
fall on an interval boundary in the interval containing larger values. 


e Draw and label the axes. Draw horizontal and vertical axes. Put the 
name of the quantitative variable under the horizontal axis. To the 
left of the vertical axis, indicate whether the graph shows the 
frequency (count) or relative frequency (percent or proportion) of 
individuals in each interval. 
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e Scale the axes. Place equally spaced tick marks at the smallest value 
in each interval along the horizontal axis. On the vertical axis, start at 
0 and place equally spaced tick marks until you exceed the largest 
frequency or relative frequency in any interval. 


e Draw bars above the intervals. Make the bars equal in width and 
leave no gaps between them. Be sure that the height of each bar 
corresponds to the frequency or relative frequency of individuals in 
that interval. An interval with no data values will appear as a bar of 
height 0 on the graph. 


It is possible to choose intervals of unequal widths when making a 
histogram. Such graphs are beyond the scope of this book. 


EXAMPLE | How much tax? (>) 


Making and interpreting histograms 


PROBLEM: Sales tax rates vary widely across the United States. Four 
states charge no state or local sales tax: Delaware, Montana, New 
Hampshire, and Oregon. The table shows data on the average total tax 
rate for each of the remaining 46 states and the District of Columbia.22 


State Tax rate (%) 
Alabama 9.0 
Alaska 1.8 
Arizona 8.3 
Arkansas whe) 
California 8.9 
Colorado Tas 
Connecticut 6.4 
Florida Oy7, 
Georgia TD} 
Hawaii 4.4 
Idaho 6.0 
Illinois 8.6 
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Indiana 


Iowa 


Kansas 
Kentucky 
Louisiana 
Maine 
Maryland 
Massachusetts 
Michigan 
Minnesota 
Mississippi 
Missouri 
Nebraska 
Nevada 

New Jersey 
New Mexico 
New York 
North Carolina 
North Dakota 
Ohio 
Oklahoma 
Pennsylvania 
Rhode Island 
South Carolina 
South Dakota 
Tennessee 
Texas 

Utah 
Vermont 
Virginia 
Washington 
West Virginia 
Wisconsin 
Wyoming 


District of Columbia 
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7.0 
6.8 


8.6 
6.0 
9.0 
ooo 
6.0 
O23 
6.0 
fis: 
FeAl 
dee 
6.9 
8.0 
7.0 
Tae 
8.5 
6.9 
6.8 
fal 
8.8 
6.3 
70 
a2 
D8 
obs. 
8.2 
6.7 
6.2 
5.6 
8.9 
6.2 
5.4 
3.4 
5.8 


a. Make a frequency histogram to display the data. 


b. What percent of values in the distribution are less than 6.0? Interpret 
this result in context. 


SOLUTION: 


Number of states 
an © S BS 
| 


Tax rate (%) 


Interval Frequency 
1.0 to < 2.0 1 

2.0 to < 3.0 0 

3.0 to < 4.0 0 

4.0 to < 5.0 1 

5.0 to < 6.0 6 

6.0 to < 7.0 15 

7.0 to < 8.0 it 

8.0 to < 9.0 9 

9.0 to < 10.0 


To make the histogram: 


e Choose equal-width intervals that span the data. The data vary 
from 1.8 percent to 9.5 percent. So we choose intervals of width 
1.0, starting at 1.0%. 


e Make a table. Record the number of states in each interval to 
make a frequency histogram. 


e Draw and label the axes. Don’t forget units (percent) for the 


variable (tax rate). 


e Scale the axes. 
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| e Draw bars. | 


b. 8/47 = 0.170 = 17.0%; 17% of the states (including the District of 
Columbia) have tax rates less than 6%. 
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Figure 1.11 shows two different histograms of the state sales tax data. 
Graph (a) uses the intervals of width 1% from the preceding example. The 
distribution has a single peak in the 6.0 to <7.0 interval. Graph (b) uses 
intervals half as wide: 1.0 to <1.5, 1.5 to <2.0, and so on. Now we see a 
distribution with more than one distinct peak. oo The choice of intervals 
in a histogram can affect the appearance of a distribution. Histograms 
with more intervals show more detail but may have a less clear overall 
pattern. 


16 gj 
7 14 g 7 
< 12 S 6- 
g a 
2 10 5 
S a 
ae 2 4 
= 6 = 3 
5 4 7 24 
Zz. ml 

0 0 

| fi 3 4 5 6 7 8 9 10 | 2 3 4 5 6 7 8 9 10 

(a) Tax rate (%) (b) Tax rate (%) 


FIGURE 1.11 (a) Frequency histogram of the sales tax rate in the states that 
have local or state sales taxes and the District of Columbia with intervals of 
width 1.0%, from the preceding example. (b) Frequency histogram of the data 
with intervals of width 0.5%. 


You can use a graphing calculator, statistical software, or an applet to 
make a histogram. The technology’s default choice of intervals is a good 
starting point, but you should adjust the intervals to fit with common 
sense. 
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2. Technology Corner |MAKING HISTOGRAMS (p>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


1. Enter the data from the sales tax example in your Statistics/List 
Editor. 


e Press |STAT| and choose Edit... 


e Type the values into list L1. 


2. Set up a histogram in the Statistics Plots menu. 


° Press (STAT PLOT). 
e Press or |1/ to go into Plot1. 


e Adjust the settings as shown. 


NORMAL FLOAT AUTO REAL RADIAN MP 1 


fathe§ Plot2 Plot3 


3. Use ZoomStat to let the calculator choose intervals and make a 
histogram. 


e Press |ZOOM| and choose ZoomStat. 
e Press |TRACE| to examine the intervals. 
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NORMAL FLOAT AUTO REAL RADIAN MP f 
Ploti:La 


min=1.8 
max<2.3 


4. Adjust the intervals to match those in Figure 1.11(a), and then graph 
the histogram. 


e Press |WINDOW]| and enter the values shown for Xmin, Xmax, 
Xscl, Ymin, Ymax, and Yscl. 


e Press |\GRAPH|. 
e Press |TRACE| to examine the intervals. 


NORMAL FLOAT AUTO REAL RADIAN MP 1 


WINDOW 

Xmin=0 

Xmax=11 

Xscl=1 

Ymin=-5 

Ymax=20 

Yscl=2 

Xres=1 
ox=0.04166666666666 
TraceStep=0. 083333333333... 


5. See if you can match the histogram in Figure 1.11(b). 


NORMAL FLOAT AUTO REAL RADIAN MP 


Ploti:La 


AP® EXAM TIP 
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If you’re asked to make a graph on a free-response question, be sure 
to label and scale your axes. Unless your calculator shows labels and 
scaling, don’t just transfer a calculator screen shot to your paper. 


CHECK YOUR UNDERSTANDING 


A 


y Many people believe that the distribution of IQ scores follows a 
“bell curve,” like the one shown. But is this really how such scores are 
distributed? The IQ scores of 60 fifth-grade students chosen at random 


from one school are shown here.24 


145 139 126 122 125 130 96 110 118 «118 
101) 142 #134 #124 #112 #109 «#134 «#+113~«#281~=«#(113 
123. 94 100 136 109 131 117 «#110 127 += 124 
106 124 #115 #133 «£116 #102 127 «©1117 109 = 137 
117. 90 §=©103 114 139 101 122 105 £97 ~~ 89 
102. 108 110 128 114 #%112 #114 «102 ~ «82 ~=«101 


1. Construct a histogram that displays the distribution of IQ scores 
effectively. 


2. Describe what you see. Is the distribution bell-shaped? 


Using Histograms Wisely 


We offer several cautions based on common mistakes students make when 
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using histograms. 


, o@ Don’t confuse histograms and bar graphs. Although histograms 


resemble bar graphs, their details and uses are different. A histogram 
displays the distribution of a quantitative variable. Its horizontal axis 
identifies intervals of values that the variable takes. A bar graph displays 
the distribution of a categorical variable. Its horizontal axis identifies the 
categories. Be sure to draw bar graphs with blank space between the 
bars to separate the categories. Draw histograms with no space between 
bars for adjacent intervals. For comparison, here is one of each type of 
graph from earlier examples: 

Histogram Bar graph 
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a Oe ; ee Messaging 
Tax rate (%) Preferred communication method 


a oOo Use percents or proportions instead of counts on the vertical 


axis when comparing distributions with different numbers of 
observations. Mary was interested in comparing the reading levels of a 
biology journal and an airline magazine. She counted the number of 
letters in the first 400 words of an article in the journal and of the first 
100 words of an article in the airline magazine. Mary then used 
statistical software to produce the histograms shown in Figure 1.12(a). 
This figure is misleading—it compares frequencies, but the two samples 
were of very different sizes (400 and 100). Using the same data, Mary’s 
teacher produced the histograms in Figure 1.12(b). By using relative 
frequencies, this figure makes the comparison of word lengths in the 
two samples much easier. 
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Journal Magazine Journal Magazine 


Frequency 
Percent 


ae es Cs ey Cl a ee es | 


0 2 4 6 8 10 12 140 2 4 6 8 10 12 14 0 2 4 6 8 10 12 140 2 4 6 8 10 12 14 
(a) Word length Word length (b) Word length Word length 


FIGURE 1.12 Two sets of histograms comparing word lengths in articles 
from a biology journal and from an airline magazine. In graph (a), the vertical 
scale uses frequencies. Graph (b) fixes the problem of different sample sizes 
by using percents (relative frequencies) on the vertical scale. 


3. o@ Just because a graph looks nice doesn’t make it a meaningful 


display of data. The 15 students in a small statistics class recorded the 
number of letters in their first names. One student entered the data into 
an Excel spreadsheet and then used Excel’s “chart maker” to produce 
the graph shown on the left. What kind of graph is this? It’s a bar graph 
that compares the raw data values. But first-name length is a 
quantitative variable, so a bar graph is not an appropriate way to display 
its distribution. The histogram on the right is a much better choice 
because the graph makes it easier to identify the shape, center, and 
variability of the distribution of name length. 
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CHECK YOUR UNDERSTANDING 
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1. Write a few sentences comparing the distributions of word length 


shown in Figure 1.12(b). 


Questions 2 and 3 refer to the following setting. About 3 million first- 
year students enroll in colleges and universities each year. What do they 
plan to study? The graph displays data on the percent of first-year 


students who plan to major in several disciplines.22 
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Percent of students who plan to major 
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o 


Field of study 


2. Is this a bar graph or a histogram? Explain. 


3. Would it be correct to describe this distribution as right-skewed? Why 
or why not? 


Section 1.2 Summary 


e You can use a dotplot, stemplot, or histogram to show the distribution 
of a quantitative variable. A dotplot displays individual values on a 
number line. Stemplots separate each observation into a stem and a one- 
digit leaf. Histograms plot the frequencies (counts) or relative 
frequencies (proportions or percents) of values in equal-width intervals. 


e Some distributions have simple shapes, such as symmetric, skewed to 
the left, or skewed to the right. The number of peaks is another aspect 
of overall shape. So are distinct clusters and gaps. 
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e When examining any graph of quantitative data, look for an overall 
pattern and for clear departures from that pattern. Shape, center, and 
variability describe the overall pattern of the distribution of a 
quantitative variable. Outliers are observations that lie outside the 
overall pattern of a distribution. 


e When comparing distributions of quantitative data, be sure to compare 
shape, center, variability, and possible outliers. 


e Remember: histograms are for quantitative data; bar graphs are for 
categorical data. Be sure to use relative frequencies when comparing 
data sets of different sizes. 


1.2 Technology Corner | » 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
2. Making histograms Page 43 


Section 1.2 Exercises 


45. pg 31 |») Feeling sleepy? Students in a high school statistics class 


responded to a survey designed by their teacher. One of the survey 
questions was “How much sleep did you get last night?” Here are the 
data (in hours): 


") 6 8 7 8 8 6 6.5 pe. of 9.0 4 3 4 
fs) 6 11 6 3 7 6 10.0 7 8 4.5 a 7 
a. Make a dotplot to display the data. 


b. Experts recommend that high school students sleep at least 9 hours 
per night. What proportion of students in this class got the 
recommended amount of sleep? 


46. Easy reading? Here are data on the lengths of the first 25 words ona 
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47. 


48. 


49. 


randomly selected page from Toni Morrison’s Song of Solomon: 


2 3 4 10 2 11 2 8 4 3 7 2 4 
fs) 3 6 4 4 2 fs) 8 2 3 4 4 


a. Make a dotplot of these data. 


b. Long words can make a book hard to read. What percentage of 
words in the sample have 8 or more letters? 


U.S. women’s soccer—2016 Earlier, we examined data on the 
number of goals scored by the 2016 U.S. women’s soccer team in 20 
games played. The following dotplot displays the goal differential for 
those same games, computed as U.S. goals scored minus opponent 
goals scored. 


° 
° 
e J ° 
T T T T T T T T 
3 4 5 6 7 8 9 LO 
Difference (U.S. — Opponent) in goals scored 
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a. Explain what the dot above 3 represents. 


b. What does the graph tell us about how well the team did in 2016? 
Be specific. 


Fuel efficiency The dotplot shows the difference (Highway — City) in 
EPA mileage ratings, in miles per gallon (mpg) for each of 24 model 
year 2018 cars. 


e 
3 
T T T T T ir 
-2 0 2 o 6 8 10 12 


Difference (Highway — City) in EPA mileage rating (mpg) 


a. Explain what the dot above —3 represents. 


b. What does the graph tell us about fuel economy in the city versus 
on the highway for these car models? Be specific. 


pg 33 |p) Getting older How old is the oldest person you know? 


Prudential Insurance Company asked 400 people to place a blue 
sticker on a huge wall next to the age of the oldest person they have 
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50. 


ol. 


52. 


53. 


4. 
55. 


ever known. An image of the graph is shown here. Describe the shape 
of the distribution. 


Pair-a-dice The dotplot shows the results of rolling a pair of fair, six- 
sided dice and finding the sum of the up-faces 100 times. Describe 
the shape of the distribution. 


—eeeeeees 


e e e J . a J e 

T T T T T T T T T 

2 4 6 8 10 12 
Sum of the up-faces on a pair of dice 


Feeling sleepy? Refer to Exercise 45. Describe the shape of the 
distribution. 


Easy reading? Refer to Exercise 46. Describe the shape of the 
distribution. 
pg 35 |p) U.S. women’s soccer—2016 Refer to Exercise 47. 


Describe the distribution. 
Fuel efficiency Refer to Exercise 48. Describe the distribution. 


pg 36 |») Making money The parallel dotplots show the total 


family income of randomly chosen individuals from Indiana (38 
individuals) and New Jersey (44 individuals). Compare the 
distributions of total family incomes in these two samples. 


rateang@adeaseneees Ss 


New Jersey + is T a 
0 25 50 75 100 125 150 175 
Total family income ($1000s) 
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56. Healthy streams Nitrates are organic compounds that are a main 
ingredient in fertilizers. When those fertilizers run off into streams, 
the nitrates can have a toxic effect on fish. An ecologist studying 
nitrate pollution in two streams measures nitrate concentrations at 42 
places on Stony Brook and 42 places on Mill Brook. The parallel 
dotplots display the data. Compare the distributions of nitrate 
concentration in these two streams. 


Stony Brook 


Mill Brook —; T r —J 
0 2 4 6 8 10 12 14 16 18 20 


Nitrate concentration (mg/L) 

57. Enhancing creativity Do external rewards—things like money, 
praise, fame, and grades—promote creativity? Researcher Teresa 
Amabile recruited 47 experienced creative writers who were college 
students and divided them at random into two groups. The students in 
one group were given a list of statements about external reasons (E) 
for writing, such as public recognition, making money, or pleasing 
their parents. Students in the other group were given a list of 
statements about internal reasons (I) for writing, such as expressing 
yourself and enjoying wordplay. Both groups were then instructed to 
write a poem about laughter. Each student’s poem was rated 
separately by 12 different poets using a creativity scale.2° These 
ratings were averaged to obtain an overall creativity score for each 
poem. Parallel dotplots of the two groups’ creativity scores are shown 
here. 


Reward 


Internal (I) External (E)_ 
. 
. 
— 
. 
. 


. e ° eee e 

7 1 = eee + eee Apesentt e + 

0 5 10 15 20 25 30 
Average rating 


a. Is the variability in creativity scores similar or different for the two 
groups? Justify your answer. 


b. Do the data suggest that external rewards promote creativity? 
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Justify your answer. 


58. Healthy cereal? Researchers collected data on 76 brands of cereal at 
a local supermarket.22 For each brand, the sugar content (grams per 
serving) and the shelf in the store on which the cereal was located (1 


= bottom, 2 = middle, 3 = top) were recorded. A dotplot of the data is 
shown here. 


Shelf 


Sugar content (grams per serving) 


a. Is the variability in sugar content of the cereals on the three shelves 
similar or different? Justify your answer. 


b. Critics claim that supermarkets tend to put sugary cereals where 
kids can see them. Do the data from this study support this claim? 
Justify your answer. (Note that Shelf 2 is at about eye level for kids 
in most supermarkets. ) 


59. pg 38 ph) Snickers® are fun! Here are the weights (in grams) of 17 


Snickers Fun Size bars from a single bag: 


17.1 17.4 16.6 17.4 17.7 17.4 17.3 1757 17.8 
19.2 16.0 15.9 16.5 16.8 16.5 17.1 16.7 

a. Make a stemplot of these data. 

b. What interesting feature does the graph reveal? 


c. The advertised weight of a Snickers Fun Size bar is 17 grams. 
What proportion of candy bars in this sample weigh less than 
advertised? 


60. Eat your beans! Beans and other legumes are a great source of 
protein. The following data give the protein content of 30 different 
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varieties of beans, in grams per 100 grams of cooked beans.22 

7a 8.2 8.9 fo 7.1 8.3 8.7 be rs, 8.2 oad 
9.0 9.0 o.7 9.2 pee) 8.1 9.0 7.8 8.0 7.8 
7.0 7.9 13.5 8.3 6.8 10.6 8.3 7.6 ees 8.1 


a. Make a stemplot of these data. 
b. What interesting feature does the graph reveal? 


c. What proportion of these bean varieties contain more than 9 grams 
of protein per 100 grams of cooked beans? 


61. South Carolina counties Here is a stemplot of the areas of the 46 
counties in South Carolina. Note that the data have been rounded to 
the nearest 10 square miles (mi?). 


3} 9999 
4| 0116689 
5 | 01115566778 
6| 47899 
7} 01245579 
8| 0011 Key: 6/4 represents 
9|13 a county with an area 
10|8 of 635 to 644.99 
11} 233 square miles. 
2 


a. What is the area of the largest South Carolina county? 


b. Describe the distribution of area for the 46 South Carolina 
counties. 


62. Shopping spree The stemplot displays data on the amount spent by 
50 shoppers at a grocery store. Note that the values have been 
rounded to the nearest dollar. 


399 
1345677889 
000123455668888 


WOOAONAUPWN OO 


a. What was the smallest amount spent by any of the shoppers? 


b. Describe the distribution of amount spent by these 50 shoppers. 
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63. Where do the young live? Here is a stemplot of the percent of 


residents aged 25 to 34 in each of the 50 states: 


11 | 44 

11 | 66778 

12 | 0134 

12 | 666778888 

13 | 0000001111444 
13 | 7788999 

14 | 0044 

14 | 567 

15] 11 


16 | 0 
a. Why did we split stems? 


b. Give an appropriate key for this stemplot. 


c. Describe the shape of the distribution. Are there any outliers? 


64. Watch that caffeine! The U.S. Food and Drug Adminis tration 


65. 


(USFDA) limits the amount of caffeine in a 12-ounce can of 
carbonated beverage to 72 milligrams. That translates to a maximum 
of 48 milligrams of caffeine per 8-ounce serving. Data on the caffeine 
content of popular soft drinks (in milligrams per 8-ounce serving) are 
displayed in the stemplot. 


556 

033344 
55667778888899 
113 

55567778 

33 

arg 


FPrWWNPr 


a. Why did we split stems? 
b. Give an appropriate key for this graph. 


c. Describe the shape of the distribution. Are there any outliers? 


Acorns and oak trees Of the many species of oak trees in the United 
States, 28 grow on the Atlantic Coast and 11 grow in California. The 
back-to-back stemplot displays data on the average volume of acorns 
(in cubic centimeters) for these 39 oak species.22 Write a few 
sentences comparing the distributions of acorn size for the oak trees 
in these two regions. 
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Atlantic Coast California 
998643 


88864211111 
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Key: 2/6 = An oak species 
whose acorn volume 


is 2.6 cm}. 


66. Who studies more? Researchers asked the students in a large first- 
year college class how many minutes they studied on a typical 
weeknight. The back-to-back stemplot displays the responses from 
random samples of 30 women and 30 men from the class, rounded to 
the nearest 10 minutes. Write a few sentences comparing the male 
and female distributions of study time. 


O | 03333 
96 | 0 | 56668999 
22222222 | 1 | 02222222 
888888888875555 | 1 | 558 
4440 | 2 | 00344 
2 
3} 0 
613 


[ Key: 2/3 = 230 minutes 


67. pg 42 |») Carbon dioxide emissions Burning fuels in power plants 
and motor vehicles emits carbon dioxide (CO>), which contributes to 
global warming. The table displays CO, emissions per person from 


countries with populations of at least 20 million.22 


a. Make a histogram of the data using intervals of width 2, starting at 
0. 


b. Describe the shape of the distribution. Which countries appear to 
be outliers? 


Country CO2 
Algeria ss 
Argentina 4.5 
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Australia 16.9 


Bangladesh 0.4 
Brazil 22 
Canada 14.7 
China 6.2 
Colombia 1.6 
Congo 0.5 
Egypt 2.6 
Ethiopia 0.1 
France 5.6 
Germany 9.1 
Ghana 0.4 
India 1.7 
Indonesia 1.8 
Iran dah 
Iraq 3.7 
Italy 6.7 
Japan 9.2 
Kenya 0.3 
Korea, North 11.5 
Korea, South 2.9 
Malaysia Ld 
Mexico 3.8 
Morocco 1.6 
Myanmar 0.2 
Nepal 0.1 
Nigeria 0.5 
Pakistan 0.9 
Peru 2.0 
Philippines 0.9 
Poland 8.3 
Romania he 
Russia 12.2 
Saudi Arabia 17.0 
South Africa 9.0 
Spain 5.8 
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Sudan 0.3 


Tanzania 0.2 
Thailand 4.4 
Turkey 4.1 
Ukraine 6.6 
United Kingdom 7.9 
United States 17.6 
Uzbekistan oo) 
Venezuela 6.9 
Vietnam 1.7 


68. Traveling to work How long do people travel each day to get to 
work? The following table gives the average travel times to work (in 
minutes) for workers in each state and the District of Columbia who 
are at least 16 years old and don’t work at home.2! 


AL 23.6 
AK 177 
AZ 25.0 
AR 20.7 
CA 26.8 
CO 23.9 
CT 24.1 
DE 23.6 
FL 25.9 
GA 213 
HI 25.5 
ID 20.1 
IL 27.9 
IN 22.3 
IA 18.2 
KS 18.5 
KY 22.4 
LA 25.1 
ME 22.3 
MD 30.6 
MA 26.6 
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MI 23.4 


MN 22.0 
MS 24.0 
MO 22:9 
MT 17.6 
NE 17.7 
NV 24.2 
NH 24.6 
NJ 29.1 
NM 20.9 
NY 30.9 
NC 23.4 
ND 15.5 
OH 22.1 
OK 20.0 
OR 21.8 
PA 25.0 
RI 22.3 
SC 22.9 
SD 15.9 
TN 23.5 
TX 24.6 
UT 20.8 
VT 21.2 
VA 26.9 
WA 2012 
WV 25.6 
WI 20.8 
WY 179 
DC 2952 


. Make a histogram to display the travel time data using intervals of 
width 2 minutes, starting at 14 minutes. 


. Describe the shape of the distribution. What is the most common 
interval of travel times? 
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69. 


70. 


71. 


DRP test scores There are many ways to measure the reading ability 
of children. One frequently used test is the Degree of Reading Power 
(DRP). In a research study on third-grade students, the DRP was 
administered to 44 students.22 Their scores were as follows. 


40 26 39 14 42 18 25 43 46 2F 19 
47 19 26 35 34 15 44 40 38 31 46 
BZ 25 35 35 33 29 34 41 49 28 52 


47 30 48 22 33 41 51 27 14 54 45 
Make a histogram to display the data. Write a few sentences 
describing the distribution of DRP scores. 


Country music The lengths, in minutes, of the 50 most popular mp3 
downloads of songs by country artist Dierks Bentley are given here. 


4.2 4.0 3.8 oof 4.7 
3.4 4.0 4.4 5.0 4.6 Bet 
4.6 4.4 4.1 3.0 3.2 4.7 
3.5 3.7 4.3 af 4.8 4.4 
4.2 4.7 6.2 4.0 740 oe 
3.4 3.4 pas) Sud 4.0 4.2 
Sid 3.4 3.7 3.5 3.4 Sel 
3.9 Sul 3.8 al ae 3.6 
4.5 3.7 


Make a histogram to display the data. Write a few sentences 
describing the distribution of song lengths. 


Returns on common stocks The retum on a stock is the change in its 
market price plus any dividend payments made. Return is usually 
expressed as a percent of the beginning price. The figure shows a 
histogram of the distribution of monthly returns for the U.S. stock 


market over a 273-month period.22 
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Number of months 


25 20 15 10 5 0 5 10 15 
Monthly percent return on common stocks 


a. Describe the overall shape of the distribution of monthly returns. 
b. What is the approximate center of this distribution? 


c. Explain why you cannot find the exact value for the minimum 
return. Between what two values does it lie? 


d. A return less than 0 means that stocks lost value in that month. 
About what percent of all months had returns less than 0? 
72. Healthy cereal? Researchers collected data on calories per serving 
for 77 brands of breakfast cereal. The histogram displays the data.24 


30 


Frequency 


50 60 70 80 90 100 110 120 130 140 150 160 
Calories 


a. Describe the overall shape of the distribution of calories. 
b. What is the approximate center of this distribution? 


c. Explain why you cannot find the exact value for the maximum 
number of calories per serving for these cereal brands. Between 
what two values does it lie? 
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73. 


74. 


75. 


d. About what percent of the cereal brands have 130 or more calories 
per serving? 


Paying for championships Does paying high salaries lead to more 
victories in professional sports? The New York Yankees have long 
been known for having Major League Baseball’s highest team 
payroll. And over the years, the team has won many championships. 
This strategy didn’t pay off in 2008, when the Philadelphia Phillies 
won the World Series. Maybe the Yankees didn’t spend enough 
money that year. The figure shows histograms of the salary 
distributions for the two teams during the 2008 season. Why can’t 
you use these graphs to effectively compare the team payrolls? 


Yankees Phillies 
i2 4 9 4 
10 |[™) ; 
“Ri! 
o 8 = 6 
oe ) 5 
5 6 — 
o a 4 
2 o 
me 44 m3, 
2- 1 
0 0 
0 5 10 15 20 25 30 35 0 2 4 6 8 10 12 14 
Salary ($ millions) Salary ($ millions) 


Paying for championships Refer to Exercise 73. Here is a better 
graph of the 2008 salary distributions for the Yankees and the 
Phillies. Write a few sentences comparing these two distributions. 


Yankees Phillies 


0 8 16 24 32 0 8 16 24 32 
Salary ($ millions) Salary ($ millions) 

Value of a diploma Do students who graduate from high school earn 
more money than students who do not? To find out, we took a 
random sample of 371 U.S. residents aged 18 and older. The 
educational level and total personal income of each person were 
recorded. The data for the 57 non-graduates (No) and the 314 
graduates (Yes) are displayed in the relative frequency histograms. 
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High school graduate? 


aL 


0 20 40 60 80 100 120 140 160 
Total personal income ($1000s) 


a. Would it be appropriate to use frequency histograms instead of 
relative frequency histograms in this setting? Explain why or why 
not. 


b. Compare the distributions of total personal income for the two 
groups. 


76. Strong paper towels In commercials for Bounty paper towels, the 
manufacturer claims that they are the “quicker picker-upper,” but are 
they also the stronger picker-upper? Two of Mr. Tabor’s statistics 
students, Wesley and Maverick, decided to find out. They selected a 
random sample of 30 Bounty paper towels and a random sample of 30 
generic paper towels and measured their strength when wet. To do 
this, they uniformly soaked each paper towel with 4 ounces of water, 
held two opposite edges of the paper towel, and counted how many 
quarters each paper towel could hold until ripping, alternating brands. 
The data are displayed in the relative frequency histograms. Compare 
the distributions. 
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ree 


78. 


79. 


30% 4 
20% 4 
10% 


Bounty 


0%  -—--_-----—- Se 


Brand 


Generic 


30% 4 
20% 4 


10% 4 


70 80 90 100 110 120 130 140 
Number of quarters 


a. Would it be appropriate to use frequency histograms instead of 
relative frequency histograms in this setting? Explain why or why 
not. 


b. Compare the distributions of number of quarters until breaking for 
the two paper towel brands. 


Birth months Imagine asking a random sample of 60 students from 
your school about their birth months. Draw a plausible (believable) 
graph of the distribution of birth months. Should you use a bar graph 
or a histogram to display the data? 


Die rolls Imagine rolling a fair, six-sided die 60 times. Draw a 
plausible graph of the distribution of die rolls. Should you use a bar 
graph or a histogram to display the data? 


AP® exam scores The table gives the distribution of grades earned 
by students taking the AP® Calculus AB and AP® Statistics exams in 
2016.22 


Grade 
5 4 3 2 1 Total 
Calculus AB 76,486 308,215 
Statistics 29,627 | 44,884 | 51,367 | 32,120 | 48,565 | 206,563 


a. Make an appropriate graphical display to compare the grade 
distributions for AP® Calculus AB and AP® Statistics. 


b. Write a few sentences comparing the two distributions of exam 
grades. 
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80. 


81. 


82. 


Multiple Choice Select the best answer for Exercises 80-85. 


Here are the amounts of money (cents) in coins carried by 10 students 
in a Statistics class: 50, 35, 0, 46, 86, 0, 5, 47, 23, 65. To make a 
stemplot of these data, you would use stems 


a. 0, 2, 3, 4, 6, 8. 

b. 0, 1, 2, 3, 4, 5, 6, 7, 8. 

G. 0,3, 5, 6,7. 

d. 00, 10, 20, 30, 40, 50, 60, 70, 80, 90. 
e. None of these. 


The histogram shows the heights of 300 randomly selected high 
school students. Which of the following is the best description of the 
shape of the distribution of heights? 
70 
60 


50 


Frequency 


52 56 60 64 68 72 76 
Height 


a. Roughly symmetric and single-peaked 
b. Roughly symmetric and double-peaked 
c. Roughly symmetric and multi-peaked 
d. Skewed to the left 


e. Skewed to the right 


You look at real estate ads for houses in Naples, Florida. There are 
many houses ranging from $200,000 to $500,000 in price. The few 
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83. 


84. 


houses on the water, however, are priced up to $15 million. The 
distribution of house prices will be 


a. skewed to the left. 
b. roughly symmetric. 
c. skewed to the right. 
d. single-peaked. 

e. too high. 


The histogram shows the distribution of the percents of women aged 
15 and over who have never married in each of the 50 states and the 
District of Columbia. Which of the following statements about the 
histogram is correct? 


Number of states 


0 T T T T ry 
20 24 28 32 36 40 4d 48 52 
Percent of women over age 15 who never married 


a. The center (median) of the distribution is about 36%. 


b. There are more states with percentages above 32 than there are 
states with percentages less than 24. 


c. It would be better if the values from 34 to 50 were deleted on the 
horizontal axis so there wouldn’t be a large gap. 


d. There was one state with a value of exactly 33%. 


e. About half of the states had percentages between 24% and 28%. 


When comparing two distributions, it would be best to use relative 
frequency histograms rather than frequency histograms when 
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85. 


86. 


a. the distributions have different shapes. 

b. the distributions have different amounts of variability. 

c. the distributions have different centers. 

d. the distributions have different numbers of observations. 


e. at least one of the distributions has outliers. 


Which of the following is the best reason for choosing a stemplot 
rather than a histogram to display the distribution of a quantitative 
variable? 


a. Stemplots allow you to split stems; histograms don’t. 

b. Stemplots allow you to see the values of individual observations. 
c. Stemplots are better for displaying very large sets of data. 

d. Stemplots never require rounding of values. 


e. Stemplots make it easier to determine the shape of a distribution. 
Recycle and Review 


Risks of playing soccer (1.1) A study in Sweden looked at former 
elite soccer players, people who had played soccer but not at the elite 
level, and people of the same age who did not play soccer. Here is a 
two-way table that classifies these individuals by whether or not they 
had arthritis of the hip or knee by their mid-fifties:2° 


Soccer level 


Elite Non-elite Did not play 
Whether Yes 24 
person No 548 
developed 
arthritis 


a. What percent of the people in this study were elite soccer players? 
What percent of the people in this study developed arthritis? 


b. What percent of the elite soccer players developed arthritis? What 
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percent of those who got arthritis were elite soccer players? 


. Researchers suspected that the more serious soccer players were 
more likely to develop arthritis later in life. Do the data confirm 
this suspicion? Calculate appropriate percentages to support your 
answer. 
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SECTION 1.3 Describing Quantitative Data with 
Numbers 


LEARNING TARGETS. By the end of the section, you should be able to: 


e Calculate measures of center (mean, median) for a distribution of 
quantitative data. 


Calculate and interpret measures of variability (range, standard 
deviation, IQR) for a distribution of quantitative data. 


Explain how outliers and skewness affect measures of center and 
variability. 


Identify outliers using the 1.5 x IQR rule. 


Make and interpret boxplots of quantitative data. 


Use boxplots and numerical summaries to compare distributions of 
quantitative data. 


How much offense did the 2016 U.S. women’s soccer team generate? The 
dotplot (reproduced from Section 1.2) shows the number of goals the team 
scored in 20 games played. 


» 
T T T T T T 
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Goals scored 
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The distribution is right-skewed and single-peaked. The games in which 
the team scored 9 and 10 goals appear to be outliers. How can we describe 
the center and variability of this distribution? 


Measuring Center: The Mean 


The most common measure of center is the mean. 
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DEFINITION The mean X X 


The mean X X (pronounced “x-bar”) of a distribution of 
quantitative data is the average of all the individual data values. To 
find the mean, add all the values and divide by the total number of 
observations. 

If the n observations are x1, X9, ..., X,, the mean is given by the 


formula 


X =sum of data valuesnumber of data values=X1+X2+-+Xnn=)y Xin 
x —_ sum ofdata values _ X14 +X9q4+---+Xn = a Xj 
~ number of data values _ n oon 


The & (capital Greek letter sigma) in the formula is short for “add them 
all up.” The subscripts on the observations x; are just a way of keeping the 


n data values distinct. They do not necessarily indicate order or any other 
special facts about the data. 


EXAMPLE |How many goals? (>) 


Calculating the mean 


Kyodo News/Getty Images 


PROBLEM: Here are the data on the number of goals scored in 20 
games played by the 2016 U.S. women’s soccer team: 


Diy 5) lO) Seon a Sos Scale le Le 8 


a. Calculate the mean number of goals scored per game by the team. 
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Show your work. 


b. The earlier description of these data (page 35) suggests that the 
games in which the team scored 9 and 10 goals are possible outliers. 
Calculate the mean number of goals scored per game by the team in 
the other 18 games that season. What do you notice? 


SOLUTION: 
Meo ol ee ee A De ee 6 32062) 
x - §+5+14+104+542414142434+3424+144424+1424+1+49+3 
7 20 
a. 63, a 
20 3.15 goals 


b. The mean for the other 18 games is 


Me=S4+54+145425141422343+251444241 424 1 + 318=4418=2 444 
§+5+14+54+241414+24+34342414+442414+24143 


xe = 
18 


At 


= a 2.44 goals 


These two games increased the team’s mean number of goals scored 
per game by 0.71 goals. 


FOR PRACTICE, TRY EXERCISE 87 


The notation x £ refers to the mean of a sample. Most of the time, the 
data we encounter can be thought of as a sample from some larger 
population. When we need to refer to a population mean, we’ || use the 
symbol p (Greek letter mu, pronounced “mew”). If you have the entire 
population of data available, then you calculate p in just the way you’d 
expect: add the values of all the observations, and divide by the number of 
observations. 
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The preceding example illustrates an important weakness of the mean 
as a measure of center: rT the mean is sensitive to extreme values in a 


distribution. These may be outliers, but a skewed distribution that has no 
outliers will also pull the mean toward its long tail. We say that the mean 
is not a resistant measure of center. 


DEFINITION Resistant 


A Statistical measure is resistant if it isn’t sensitive to extreme 
values. 


The mean of a distribution also has a physical interpretation, as the 
following activity shows. 


ACTIVITY Mean as a “balance point” 


Ann Heath 


In this activity, you’ll investigate an important property of the mean. 


1. Stack 5 pennies on top of the 6-inch mark on a 12-inch ruler. Place a 
pencil under the ruler to make a “seesaw” on a desk or table. Move 
the pencil until the ruler balances. What is the relationship between 
the location of the pencil and the mean of the five data values 6, 6, 6, 
6, and 6? 

2. Move one penny off the stack to the 8-inch mark on your ruler. Now 
move one other penny so that the ruler balances again without 
moving the pencil. Where did you put the other penny? What is the 
mean of the five data values represented by the pennies now? 
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3. Move one more penny off the stack to the 2-inch mark on your ruler. 
Now move both remaining pennies from the 6-inch mark so that the 
ruler still balances with the pencil in the same location. Is the mean of 
the data values still 6? 


4. Discuss with your classmates: Why is the mean called the “balance 
point” of a distribution? 


The activity gives a physical interpretation of the mean as the balance 
point of a distribution. For the data on goals scored in each of 20 games 
played by the 2016 U.S. women’s soccer team, the dotplot balances at 
x =3.15@ = 3.15 goals. 


es s 
oO 
oe a ca a 
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¥ =3.15 


Goals scored 


Measuring Center: The Median 


We could also report the value in the “middle” of a distribution as its 
center. That’s the idea of the median. 


DEFINITION Median 


The median is the midpoint of a distribution, the number such that 
about half the observations are smaller and about half are larger. 


To find the median, arrange the data values from smallest to largest. 


e If the number n of data values is odd, the median is the middle 
value in the ordered list. 


e If the number n of data values is even, the median is the average of 
the two middle values in the ordered list. 


The median is easy to find by hand for small sets of data. For instance, 
here are the data from Section 1.2 on the highway fuel economy ratings for 
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a sample of 25 model year 2018 Toyota 4Runners tested by the EPA: 


224. 224 22.5 23.5 22.3 22.3 22.5 224 22.1 21.5 22.0 22. 
22.8 224 226 22.9 22.5 221. 224 22.2 22.9 226 21.9 22, 


Start by sorting the data values from smallest to largest: 


215: 219: 22.0 22.1, 22:1. 222 222-223 22.5: 22.5 22:4 22, 
224.224 224 22.5 22.5 22.6 22,6 22.7 22.8 22.9 22.9 23, 


There are n = 25 data values (an odd number), so the median is the middle 
(13th) value in the ordered list, the bold 22.4. 


EXAMPLE | How many goals? (») 


Finding the median 


Icon Sports Wire/Getty Images 


PROBLEM: Here are the data on the number of goals scored in 20 
games played by the 2016 U.S. women’s soccer team: 


ba oe oe Oy Poet Wl erlaeD no no eh ee ee De he Os 
Find the median. 


SOLUTION: 


111111222@|255545559 10 


2+2_ 


The median is 2+22=2. 2 
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To find the median, sort the data values from smallest to largest. Because 


there are n = 20 data values (an even number), the median is the average 


of the middle two values in the ordered list. 


FOR PRACTICE, TRY EXERCISE 89 


Comparing the Mean and the Median 


Which measure—the mean or the median—should we report as the center 
of a distribution? That depends on both the shape of the distribution and 
whether there are any outliers. 


e Shape: Figure 1.13 shows the mean and median for dotplots with three 
different shapes. Notice how these two measures of center compare in 
each case. The mean is pulled in the direction of the long tail ina 
skewed distribution. 


Skewed to the left 
Mean < Median 


Median = 19 

* H 

a a a a oe H 

T T T T x T 

13 14 15 16 17 18 19 20 
Mean = 17.97 


(a) Quiz score 


Roughly symmetric 
Mean = Median 


Median = 22.4 


T 
21.5 22.0 22.5 23.0 23.5 


(b) Highway fuel economy (mpg) 


Skewed to the right 
Mean > Median 


Mean = 3.15 
(c) Goals scored 


FIGURE 1.13 Dotplots that show the relationship between the mean and median 
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in distributions with different shapes: (a) Scores of 30 statistics students on a 20- 
point quiz, (b) highway fuel economy ratings for a sample of 25 model year 2018 
Toyota 4Runners, and (c) number of goals scored in 20 games played by the 2016 
U.S. women’s soccer team. 


e Outliers: We noted earlier that the mean is sensitive to extreme values. 
If we remove the two possible outliers (9 and 10) in Figure 1.13(c), the 
mean number of goals scored per game decreases from 3.15 to 2.44. The 
median number of goals scored is 2 whether we include these two 
games or not. The median is a resistant measure of center, but the mean 
is not. 


You can compare how the mean and median behave by using the Mean and Median 


applet at the book’s website, highschool.bfwpub.com/tps6e. 


EFFECT OF SKEWNESS AND OUTLIERS ON 
MEASURES OF CENTER 


e Ifa distribution of quantitative data is roughly symmetric and has no 
outliers, the mean and median will be similar. 


e If the distribution is strongly skewed, the mean will be pulled in the 
direction of the skewness but the median won’t. For a right-skewed 
distribution, we expect the mean to be greater than the median. For a 
left-skewed distribution, we expect the mean to be less than the 
median. 


e The median is resistant to outliers but the mean isn’t. 


The mean and median measure center in different ways, and both are 
useful. In Major League Baseball (MLB), the distribution of player salaries 
is strongly skewed to the right. Most players earn close to the minimum 
salary (which was $507,500 in 2016), while a few earn more than $20 
million. The median salary for MLB players in 2016 was about $1.5 
million—but the mean salary was about $4.4 million. Clayton Kershaw, 
Miguel Cabrera, John Lester, and several other highly paid superstars 
pulled the mean up but that did not affect the median. The median gives us 
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a good idea of what a “typical” MLB salary is. If we want to know the 
total salary paid to MLB players in 2016, however, we would multiply the 
mean salary by the total number of players: ($4.4 million)(862) * $3.8 
billion! 


CHECK YOUR UNDERSTANDING 


‘ 


\ 


y Some students purchased pumpkins for a carving contest. Before 
the contest began, they weighed the pumpkins. The weights in pounds 
are shown here, along with a histogram of the data. 


3.6 40 96 140 11.0 124 130 20 60 66 150 3.4 
12.7 60 28 96 40 61 54 11.9 54 31.0 33.0 


Frequency 


8 
1 
6 
5 
4 
3 
2 
1 
0 


0 5 10 15 20 25 30 35 
Pumpkin weight (Ib) 


1. Calculate the mean weight of the pumpkins. 


2. Find the median weight of the pumpkins. 


3. Would you use the mean or the median to summarize the typical 
weight of a pumpkin in this contest? Explain. 


Measuring Variability: The Range 


Being able to describe the shape and center of a distribution is a great start. 
However, two distributions can have the same shape and center, but still 
look quite different. 
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Figure 1.14 shows comparative dotplots of the length (in millimeters) 
of separate random samples of PVC pipe from two suppliers, A and B.24 
Both distributions are roughly symmetric and single-peaked, with centers 
at about 600 mm, but the variability of these two distributions is quite 
different. The sample of pipes from Supplier A has much more consistent 
lengths (less variability) than the sample from Supplier B. 


5970 598.5 600.0 601.5 603.0 
Length of PVC pipe (mm) 


FIGURE 1.14 Comparative dotplots of the length of PVC pipes in separate 
random samples from Supplier A and Supplier B. 

There are several ways to measure the variability of a distribution. The 
simplest is the range. 


DEFINITION Range 


The range of a distribution is the distance between the minimum 


value and the maximum value. That is, 


Range = Maximum — MinimumRange = Maximum — Minimum 


Here are the data on the number of goals scored in 20 games played by 
the 2016 U.S. women’s soccer team, along with a dotplot: 


951105211233214212i19 3 


+ eeesce 
aca (ached 


I 7 i 
4 5 6 7 8 9 10 
Goals scored 


The range of this distribution is 10 — 1 = 9 goals. Note that oo the 
range of a data set is a single number. In everyday language, people 


Ww 
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sometimes say things like, “The data values range from 1 to 10.” A correct 
statement is “The number of goals scored in 20 games played by the 2016 
U.S. women’s soccer team varies from 1 to 10, a range of 9 goals.” 


The range is not a resistant measure of variability. It depends on only 
the maximum and minimum values, which may be outliers. Look again at 
the data on goals scored by the 2016 U.S. women’s soccer team. Without 
the possible outliers at 9 and 10 goals, the range of the distribution would 
decrease to 5 — 1 = 4 goals. 


The following graph illustrates another problem with the range as a 
measure of variability. The parallel dotplots show the lengths (in 
millimeters) of a sample of 11 nails produced by each of two machines.22 
Both distributions are centered at 70 mm and have a range of 72 — 68 = 4 
mm. But the lengths of the nails made by Machine B clearly vary more 


from the center of 70 mm than the nails made by Machine A. 


Machine 


T T 
68 69 70 71 72 
Length (mm) 


Measuring Variability: The Standard Deviation 


If we summarize the center of a distribution with the mean, then we should 
use the standard deviation to describe the variation of data values around 
the mean. 


DEFINITION Standard deviation 
The standard deviation measures the typical distance of the values 


in a distribution from the mean. 


How do we calculate the standard deviation s, of a quantitative data set 


with n values? Here are the steps. 
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HOW TO CALCULATE THE STANDARD DEVIATION s, 


Find the mean of the distribution. 


Calculate the deviation of each value from the mean: deviation = 
value — mean. 


Square each of the deviations. 


Add all the squared deviations, divide by n — 1, and take the square 
root. 
If the values in a data set are Xj, X9,..., X, the standard deviation is 
given by the formula 
sx=(x1-x )2 + (x2-x )2+-°+(xn-x )2n-1=) (xi-x )2n-1 


(Sa ee _ (De? 


n—1 n—1 


AP® EXAM TIP 


The formula sheet provided with the AP® Statistics exam gives the 
sample standard deviation in the equivalent form 


“2 Se (ai _ z)’. 


_ S = 
sx=In-1F (xi-x 2... Vn-l 


The notation s, refers to the standard deviation of a sample. When we 


need to refer to the standard deviation of a population, we’ ll use the 
symbol o (Greek lowercase sigma). The population standard deviation is 
calculated by dividing the sum of squared deviations by n instead of n - 1 
before taking the square root. 


EXAMPLE | How many friends? (>) 


Calculating and interpreting standard deviation 
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LaraBelova/Getty Images 


PROBLEM: Eleven high school students were asked how many 
“close” friends they have. Here are their responses, along with a dotplot: 


12 2) 28 8 4A 6 


a 
4 
T 


es 
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3 4 5 6 
Number of close friends 
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Calculate the standard deviation. Interpret this value. 


SOLUTION: 


To calculate the standard deviation: 
Find the mean of the distribution. 
Calculate the deviation of each value from the mean: deviation = 
value — mean 
Square each of the deviations. 
Add all the squared deviations, divide by n — 1, and take the square 


root to return to the original units. 


Kee? ee toe ode Eola 
142424+243434343444446_ 


£ 7 3 

et xi-x 71 — % (xi-x )2(2 — 2) 
1 1-3=-2 (2)2=4 

2 2-3=-1 (1)2=1 

2 2-3=-1 ened 

2 2-3=-1 eyed 
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WwW WwW Ww 


/ 18 
= ,/ —— = 1.34 close friends 
\ 11-1 


Sx 
sx=1811-1=1.34 close friends 


Interpretation: The number of close friends these students have typically 
varies by about 1.34 close friends from the mean of 3 close friends. 


FOR PRACTICE, TRY EXERCISE 99 


The value obtained before taking the square root in the standard 
deviation calculation is known as the variance. In the preceding example, 


the sample variance is 
=\2 
(x; —-7) 
_ s2 2d -2)" 18 1.80 
sx2=)\ (xi-x )2n-1=1811-1=1.80 n—1 11-1 


Unfortunately, the units are “squared close friends.” Because variance is 
measured in squared units, it is not a very helpful way to describe the 
variability of a distribution. 


Think About It 


WHY IS THE STANDARD DEVIATION CALCULATED IN 
SUCH A COMPLEX WAY? Add the deviations from the mean in the 
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preceding example. You should get a sum of 0. Why? Because the mean 
is the balance point of the distribution. We square the deviations to 
avoid the positive and negative deviations balancing each other out and 
adding to 0. It might seem strange to “average” the squared deviations 
by dividing by n — 1. We’Il explain the reason for doing this in Chapter 
7. It’s easier to understand why we take the square root: to return to the 
original units (close friends). 


More important than the details of calculating s, are the properties of the 


standard deviation as a measure of variability: 


e s, is always greater than or equal to 0. s, = 0 only when there is no 


variability, that is, when all values in a distribution are the same. 


e Larger values of s, indicate greater variation from the mean of a 


distribution. The comparative dotplot shows the lengths of PVC pipe in 
random samples from two different suppliers. Supplier A’s pipe lengths 
have a standard deviation of 0.681 mm, while Supplier B’s pipe lengths 
have a standard deviation of 2.02 mm. The lengths of pipes from 
Supplier B are typically farther from the mean than the lengths of pipes 
from Supplier A. 


“« * - 4 e J , . es + a e + e J LJ H a 
5970 598.5 600.0 601.5 603.0 
Length of PVC pipe (mm) 


e s, is not a resistant measure of variability. The use of squared 


deviations makes s, even more sensitive than x ® to extreme values in a 


distribution. For example, the standard deviation of the number of goals 
scored in 20 games played by the 2016 U.S women’s soccer team is 
2.58 goals. If we omit the possible outliers of 9 and 10 goals, the 
standard deviation drops to 1.46 goals. 


—eeeseece 
“eeeee 
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e s, measures variation about the mean. It should be used only when 


the mean is chosen as the measure of center. 


In the close friends example, 11 high school students had an average of 
x =32 = 3 close friends with a standard deviation of s, = 1.34. What if a 
12th high school student was added to the sample who had 3 close friends? 
The mean number of close friends in the sample would still be x =3. 
= = 3. How would s, be affected? Because the standard deviation 
measures the typical distance of the values in a distribution from the mean, 
s, would decrease because this 12th value is at a distance of 0 from the 


mean. In fact, the new standard deviation would be 
sx=) (xi-x )2n-1=1812-1=1.28 friends 


} =. 
_ DC? z ) — f wm | nS entis 
Ss = (| = Vigor =) riends 


Measuring Variability: The Interquartile Range (IQR) 


We can avoid the impact of extreme values on our measure of variability 
by focusing on the middle of the distribution. Start by ordering the data 
values from smallest to largest. Then find the quartiles, the values that 
divide the distribution into four groups of roughly equal size. The first 
quartile Q, lies one-quarter of the way up the list. The second quartile is 
the median, which is halfway up the list. The third quartile Q3 lies three- 
quarters of the way up the list. The first and third quartiles mark out the 
middle half of the distribution. 


Second quartile 


First quartile Q, (median) Third quartile Q, 


a ° | ° 6 e * ° | oo i o 
am | tT T T T T T T ~T T T 


18 20 22 24 26 28 30 32 34 36 38 40 
Amount collected ($) 


For example, here are the amounts collected each hour by a charity at a 
local store: $19, $26, $25, $37, $31, $28, $22, $22, $29, $34, $39, and 
$31. The dotplot displays the data. Because there are 12 data values, the 
quartiles divide the distribution into 4 groups of 3 values. 
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DEFINITION Quartiles, First quartile Q,, Third quartile 
Q3 

The quartiles of a distribution divide the ordered data set into four 

groups having roughly the same number of values. To find the 

quartiles, arrange the data values from smallest to largest and find the 

median. 

The first quartile Q; is the median of the data values that are to the 

left of the median in the ordered list. 

The third quartile Q3 is the median of the data values that are to the 

right of the median in the ordered list. 


The interquartile range ([QR) measures the variability in the middle 


half of the distribution. 


DEFINITION Interquartile range (IQR) 


The interquartile range (IQR) is the distance between the first and 
third quartiles of a distribution. In symbols: 


IQR=Q3-Q1UQR = Q3 — Q: 


Notice that the IQR is simply the range of the “middle half” of the 


distribution. 


EXAMPLE | Boys and their shoes? (>) 
Finding the IQR 
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Peter Cade/Getty Images 


PROBLEM: How many pairs of shoes does a typical teenage boy own? 
To find out, two AP® Statistics students surveyed a random sample of 
20 male students from their large high school and recorded the number 
of pairs of shoes that each boy owned. Here are the data, along with a 
dotplot: 


1447 6 5 12388 7 101010114 5 227 5 10357 


eeeee ee @ ® e e 
yo -  a ae a O 
0 5 10 15 20 25 30 35 40 


Number of pairs of shoes 


Find the interquartile range. 


SOLUTION: 


45556777 7@|1010 10 10 11 12 14 22 35 38 
Median = 9 


Sort the data values from smallest to largest and find the median. 


i455 i a 77 eii10 10 10 10 1 [12)14 22 35 381 
l i] 


Find the first quartile Q; and the third quartile Q3. 


IQR= 11.5 — 6.5 = 5 pairs of shoes 
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FOR PRACTICE, TRY EXERCISE 105 


The quartiles and the interquartile range are resistant because they are 
not affected by a few extreme values. For the shoe data, Q3 would still be 
11.5 and the JQR would still be 5 if the maximum were 58 rather than 38. 

Be sure to leave out the median when you locate the quartiles. In the 
preceding example, the median was not one of the data values. For the 
earlier close friends data set, we ignore the circled median of 3 when 
finding Q, and Q3. 


11 2@2 388 3@+4 6 


O; Median O; 


CHECK YOUR UNDERSTANDING 


y Here are data on the highway fuel economy ratings for a sample of 
25 model year 2018 Toyota 4Runners tested by the EPA, along with a 
dotplot: 


DA 22 A 2D B25 oe eee ee ee, 22) 
PP Boe A226. 22.9) 2 oe D2 oy il 224 


21.5 22.0 22.5 23.0 23.5 
Highway fuel economy (mpg) 


1. Find the range of the distribution. 


2. The mean and standard deviation of the distribution are 22.404 mpg 
and 0.363 mpg, respectively. Interpret the standard deviation. 
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3. Find the interquartile range of the distribution. 


4. Which measure of variability would you choose to describe the 
distribution? Explain. 


Numerical Summaries with Technology 


Graphing calculators and computer software will calculate numerical 
summaries for you. Using technology to perform calculations will allow 
you to focus on choosing the right methods and interpreting your results. 


3. Technology Corner |COMPUTING NUMERICAL 
SUMMARIES (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Let’s find numerical summaries for the boys’ shoes data from the 
example on page 64. We’II start by showing you how to compute 
summary statistics on the TI-83/84 and then look at output from 
computer software. 


I. One-variable statistics on the TI-83/84 
1. Enter the data in list L1. 


2. Find the summary statistics for the shoe data. 


e Press (CALC); choose 1-VarStats. Os 2.55 or later: In the 


dialog box, press (L1) and to specify L1 as the 


List. Leave FreqList blank Arrow down to Calculate and press 


[ENTER|, Older Os: Press (L1) and [ENTER\. 
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x=11.65 

=x=233 
=x7=4401 

Sx=9. 421559822 
ox=9. 183000599 
n=20 

minx=4 
4Q:1=6.5 


e Press [¥] to see the rest of the one-variable statistics. 


TSx=9. 421559822 
ox=9. 183000599 
=20 

minx=4 

Q1=6.5 

Med=9 

Q3=11.5 
maxxX=38 


I. Output from statistical software We used Minitab statistical 
software to calculate descriptive statistics for the boys’ shoes data. 
Minitab allows you to choose which numerical summaries are 
included in the output. 


Descriptive Statistics: Shoes 


Variable N Mean StDev Minimum Q1 Median Q3 Maximum 
Shoes 20 11.65 9.42 4.00 6.25 9.00 11.75 38.00 


Note: The TI-83/84 gives the first and third quartiles of the boys’ shoes 
distribution as Q; = 6.5 and Q3 = 11.5. Minitab reports that Q; = 6.25 


and Q3 = 11.75. What happened? Minitab and some other software use 


slightly different rules for locating quartiles. Results from the various 
rules are usually close to each other. Be aware of possible differences 
when calculating quartiles as they may affect more than just the IQR. 


Identifying Outliers 


Besides serving as a measure of variability, the interquartile range (IQR) is 
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used as a “ruler” for identifying outliers. 


HOW TO IDENTIFY OUTLIERS: THE 1.5 x IQR RULE 


Call an observation an outlier if it falls more than 1.5 x IQR above the 


third quartile or below the first quartile. That is, 
Low outliers <Q; — 1.5 x IQR High outliers > Q3 + 1.5 x IQR 


Here are sorted data on the highway fuel economy ratings for a sample of 
25 model year 2018 Toyota 4Runners tested by the EPA, along with a 
dotplot: 


215 209 220: 221 22.1 22.2 22.2 22,322.35 22.5: 224-22, 
224 224 22.4 22.5 22.5 22.6 22.6. 22.7 22.8 22.9 22.9 23, 


T T T T T T T T 
21.5 22.0 22.5 23.0 23.5 
Highway fuel economy (mpg) 


Does the 1.5 x JQR rule identify any outliers in this distribution? If you 
did the preceding Check Your Understanding, you should have found that 
Q, = 22.2 mpg, Q3 = 22.6 mpg, and IQR = 0.4 mpg. For these data, 


High outliers > Q3+1.5xIQR=22.6+1.5x0.4 = 23.2 
High outliers > Q3 + 1.5 x IQR = 22.64 1.5 x 0.4 = 23.2 
and 


Low outliers < Q1-1.5xIQR=22.2-1.5x0.4 = 21.6 
Low outliers < Q) — 1.5 x IQR = 22.2 -—1.5 x 0.4 = 21.6 


The cars with estimated highway fuel economy ratings of 21.5 and 
23.3 are identified as outliers. 


AP® EXAM TIP 


You may be asked to determine whether a quantitative data set has any 
outliers. Be prepared to state and use the rule for identifying outliers. 
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EXAMPLE |How many goals? (>) 
Identifying outliers 


Icon Sports Wire/Getty Images 


PROBLEM: Here are sorted data on the number of goals scored in 20 
games played by the 2016 U.S women’s soccer team, along with a 
dotplot: 


ee es ee 2 De 2 Ao ret eA om ano we. 2K) 
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Identify any outliers in the distribution. Show your work. 


SOLUTION: 


11114 D2 22@i2d33 3@5)s 5 910 | 

aaa ee aaa 
Qa=1 Median=2 Q,=4.5 

IQR = Q3 - Q) =4.5-1=3.5 

Low outliers < Q; — 1.5 x IQR=1-1.5 x 3.5=-4.25 

High outliers > Q3 + 1.5 x IQR =4.5+ 1.5 x 3.5=9.75 


There are no data values less than —4.25, but the game in which the team 
scored 10 goals is an outlier. 
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The game in which the team scored 9 goals is not identified as an outlier 


by the 1.5 x JQR rule. 


FOR PRACTICE, TRY EXERCISE 107 


It is important to identify outliers in a distribution for several reasons: 


. They might be inaccurate data values. Maybe someone recorded a 
value as 10.1 instead of 101. Perhaps a measuring device broke down. 
Or maybe someone gave a silly response, like the student in a class 
survey who claimed to study 30,000 minutes per night! Try to correct 
errors like these if possible. If you can’t, give summary statistics with 
and without the outlier. 


’, They can indicate a remarkable occurrence. For example, in a graph 
of net worth, Bill Gates is likely to be an outlier. 


3. They can heavily influence the values of some summary statistics, 
like the mean, range, and standard deviation. 


Making and Interpreting Boxplots 


You can use a dotplot, stemplot, or histogram to display the distribution of 
a quantitative variable. Another graphical option for quantitative data is a 
boxplot. A boxplot summarizes a distribution by displaying the location of 
5 important values within the distribution, known as its five-number 


summary. 


DEFINITION Five-number summary, Boxplot 


The five-number summary of a distribution of quantitative data 
consists of the minimum, the first quartile Q,, the median, the third 


quartile Q3, and the maximum. 


A boxplot is a visual representation of the five-number summary. 
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A boxplot is sometimes called a box-and-whisker plot. 


Figure 1.15 illustrates the process of making a boxplot. The dotplot in 
Figure 1.15(a) shows the data on EPA estimated highway fuel economy 
ratings for a sample of 25 model year 2018 Toyota 4Runners. We have 
marked the first quartile, the median, and the third quartile with vertical 
blue lines. The process of testing for outliers with the 1.5 x IQR rule is 
shown in red. Because the values of 21.5 mpg and 23.3 mpg are outliers, 
we mark these separately. To get the finished boxplot in Figure 1.15(b), 
we make a box spanning from Q; to Q3 and then draw “whiskers” to the 


smallest and largest data values that are not outliers 


Lower cutoff Upper cutoff 
for outliers for outliers 
' 1.5 X JOR = 0.6 1.5 X JOR = 0.6 | 

a 
IQR =0.4 
Qa 


' 
' 
Q, Median Q, 
' 


' 

' 

' 

' a 

' 99 «24 2996 
' 

' 

' 

' 


21.5 22.0 22.5 23.0 23.5 


(a) Highway fuel economy (mpg) 
Q; Median Q3 
Min 22.2 22.4 22.6 Max 
21.5 23.3 
e . a : - e H - 

SS ee ee a ee 

21.5 22.0 22.5 23.0 23.5 
(b) Highway fuel economy (mpg) 


FIGURE 1.15 A visual illustration of how to make a boxplot for the Toyota 
4Runner highway gas mileage data. (a) Dotplot of the data with the five-number 
summary and 1.5 x IQR marked. (b) Boxplot of the data with outliers identified 
(*). 

As you can see, it is fairly easy to make a boxplot by hand for small 
sets of data. Here’s a summary of the steps. 


HOW TO MAKE A BOXPLOT 


e Find the five-number summary for the distribution. 


e Identify outliers using the 1.5 x IQR rule. 
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e Draw and label the axis. Draw a horizontal axis and put the name of 
the quantitative variable underneath, including units if applicable. 


e Scale the axis. Look at the minimum and maximum values in the 
data set. Start the horizontal axis at a convenient number equal to or 
below the minimum and place tick marks at equal intervals until you 
equal or exceed the maximum. 


e Draw a box that spans from the first quartile (Q,) to the third 
quartile (Q3). 


e Mark the median with a vertical line segment that’s the same height 
as the box. 


e Draw whiskers—lines that extend from the ends of the box to the 
smallest and largest data values that are not outliers. Mark any 
outliers with a special symbol such as an asterisk (*). 


We see from the boxplot in Figure 1.15 that the distribution of 
highway gas mileage ratings for this sample of model year 2018 Toyota 
4Runners is roughly symmetric with one high outlier and one low outlier. 


EXAMPLE | Picking pumpkins » 
Making and interpreting boxplots 


cscredon/Getty Images 


PROBLEM: Some students purchased pumpkins for a carving contest. 
Before the contest began, they weighed the pumpkins. The weights in 
pounds are shown here. 


3.6 40 96 140 11.0 12.4 130 2.0 60 66 15.0 3.4 
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12.7 60 28 96 40 61 54 11.9 5.4 31.0 33.0 


a. Make a boxplot of the data. 


b. Explain why the median and IQR would be a better choice for 
summarizing the center and variability of the distribution of pumpkin 
weights than the mean and standard deviation. 


SOLUTION: 

a ROR ie a OO Eater Ma erin aie Hee nouns Median 
(20 28 34 36 40 GO) 54 54 60 60 61) 
“96 96 11.0 119124 427130 14.0 15.0. 31.0 33.0) 


= 


Os “Max 
TOR Oo = Oy = 1257 = 4.0 =8.7 


Low outliers < Q; — 1.5 x IQR = 4.0 - 1.5 x 8.7 = -9.05 
High outliers > Q3 + 1.5 x IQR = 12.7 + 1.5 x 8.7 = 25.75 
The pumpkins that weighed 31.0 and 33.0 pounds are outliers. 


To make the boxplot: 


e Find the five-number summary. 


Identify outliers. 

Draw and label the axis. 
Scale the axis. 

Draw a box. 

Mark the median. 


Draw whiskers to the smallest and largest data values that are not 
outliers. Mark outliers with an asterisk. 


Pumpkin weight (1b) 
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b. The distribution of pumpkin weights is skewed to the right with two 
high outliers. Because the mean and standard deviation are sensitive 
to outliers, it would be better to use the median and JQR, which are 
resistant. 


We know the distribution is skewed to the right because the left half of 


the distribution varies from 2.0 to 6.6 pounds, while the right half of the 
distribution (excluding outliers) varies from 6.6 to 15.0 pounds. 


FOR PRACTICE, TRY EXERCISE 111 


Boxplots provide a quick summary of the center and variability of a 
distribution. The median is displayed as a line in the central box, the 
interquartile range is the length of the box, and the range is the length of 
the entire plot, including outliers. Note that some statistical software 
orients boxplots vertically. At left is a vertical boxplot of the pumpkin 
weight data from the preceding example. You can see that the graph is 
skewed toward the larger values. 
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Pumpkin weight (Ib) 


Boxplots do not display each individual value in a distribution. And 
oOo boxplots don’t show gaps, clusters, or peaks. For instance, the 
dotplot below left displays the duration, in minutes, of 220 eruptions of the 
Old Faithful geyser. The distribution of eruption durations is clearly 
double-peaked (bimodal). But a boxplot of the data hides this important 
information about the shape of the distribution. 
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CHECK YOUR UNDERSTANDING 


Ryan and Brent were curious about the amount of french fries they 
would get in a large order from their favorite fast-food restaurant, 
Burger King. They went to several different Burger King locations over 
a series of days and ordered a total of 14 large fries. The weight of each 
order (in grams) is as follows: 


165 163 160 159 166 152 166 168 173 171 168 167 170 170 


1. Make a boxplot to display the data. 


2. According to a nutrition website, Burger King’s large fries weigh 160 
grams, on average. Ryan and Brent suspect that their local Burger 
King restaurants may be skimping on fries. Does the boxplot in 
Question 1 support their suspicion? Explain why or why not. 


Comparing Distributions with Boxplots 


Boxplots are especially effective for comparing the distribution of a 
quantitative variable in two or more groups, as seen in the following 


example. 
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EXAMPLE | Which company makes better tablets? > 


Comparing distributions with boxplots 


Peter Cade/Getty Images 


PROBLEM: Ina recent year, Consumer Reports rated many tablet 
computers for performance and quality. Based on several variables, the 
magazine gave each tablet an overall rating, where higher scores 
indicate better ratings. The overall ratings of the tablets produced by 
Apple and Samsung are given here, along with parallel boxplots and 
numerical summaries of the data.22 


Apple 87 87 87 87 86 86 86 86 84 84 
84 84 83 83 83 83 81 79 76 73 

Samsung 88 87 87 86 86 86 86 86 84 84 83 83 
Uf US 13 78 93 3 75s 5 7) VAL 


Apple + ok t—Th 


Manufacturer 


T T T T T 
60 65 70 75 80 85 90 


Overall rating 
x xX Sy. Min Q; Median Q3 Max IQR 
Apple 83.45 3.762 ie 83 84 86 87 3 
Samsung 79.87 6.74 62 75 83 86 88 11 


Compare the distributions of overall rating for Apple and Samsung. 
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SOLUTION: 


Remember to compare shape, outliers, center, and variability! 


Shape: Both distributions of overall ratings are skewed to the left. 


Outliers: | There are two low outliers in the Apple tablet distribution: 
overall ratings of 73 and 76. The Samsung tablet 
distribution has no outliers. 


Center: The Apple tablets had a slightly higher median overall 
rating (84) than the Samsung tablets (83). More 
importantly, about 75% of the Apple tablets had overall 
ratings that were greater than or equal to the median for 
the Samsung tablets. 


Because of the strong skewness and outliers, use the median and IQR 


instead of the mean and standard deviation when comparing center and 
variability. 


Variability: There is much more variation in overall rating among the 
Samsung tablets than the Apple tablets. The JQR for 
Samsung tablets (11) is almost four times larger than the 
IQR for Apple tablets (3). 


FOR PRACTICE, TRY EXERCISE 115 


AP® EXAM TIP 


Use statistical terms carefully and correctly on the AP® Statistics exam. 
Don’t say “mean” if you really mean “median.” Range is a single 


number; so are Q, Q3, and JOR. Avoid poor use of language, like “the 


outlier skews the mean” or “the median is in the middle of the IQR.” 
Skewed is a shape and the JQR is a single number, not a region. If you 
misuse a term, expect to lose some credit. 
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Here’s an activity that gives you a chance to put into practice what you 
have learned in this section. 


ACTIVITY Team challenge: Did Mr. Starnes stack his 
class? 


In this activity, you will work in a team of three or four students to 
resolve a dispute. 


Mr. Starnes teaches AP® Statistics, but he also does the class scheduling 
for the high school. There are two AP® Statistics classes—one taught by 
Mr. Starnes and one taught by Ms. McGrail. The two teachers give the 
same first test to their classes and grade the test together. Mr. Starnes’s 
students earned an average score that was 8 points higher than the 
average for Ms. McGrail’s class. Ms. McGrail wonders whether Mr. 
Starnes might have “adjusted” the class rosters from the computer 
scheduling program. In other words, she thinks he might have “stacked” 
his class. He denies this, of course. 


To help resolve the dispute, the teachers collect data on the 
cumulative grade point averages and SAT Math scores of their students. 
Mr. Starnes provides the GPA data from his computer. The students 
report their SAT Math scores. The following table shows the data for 
each student in the two classes. 

Did Mr. Starnes stack his class? Give appropriate graphical and 


numerical evidence to support your conclusion. Be prepared to defend 
your answer. 


Student Teacher GPA SAT-M 
it Starnes 2.900 670 
2 Starnes 2.860 520 
3 Starnes 2.600 5970 
4 Starnes 3.600 710 
fs) Starnes 3.200 600 
6 Starnes 2.700 590 
Ji Starnes 3.100 640 
8 Starnes 3.085 570 
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9 Starnes 3.750 710 


10 Starnes 3.400 630 
11 Starnes 3.338 630 
12 Starnes 3.560 670 
13 Starnes 3.800 650 
14 Starnes 3.200 660 
15 Starnes 3.100 510 
16 McGrail 2.900 620 
17 McGrail 3.300 590 
18 McGrail 3.980 650 
19 McGrail 2.900 600 
20 McGrail 3.200 620 
21 McGrail 3.500 680 
22 McGrail 2.800 500 
23 McGrail 2.900 502.5 
24 McGrail 3.950 640 
25 McGrail 3.100 630 
26 McGrail 2.850 580 
2, McGrail 2.900 590 
28 McGrail 3.245 600 
29 McGrail 3.000 600 
30 McGrail 3.000 620 
31 McGrail 2.800 580 
32 McGrail 2.900 600 
33 McGrail 3.200 600 


You can use technology to make boxplots, as the following 
Technology Corner illustrates. 


4. Technology Corner |MAKING BOXPLOTS 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


The TI-83/84 can plot up to three boxplots in the same viewing 
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window. Let’s use the calculator to make parallel boxplots of the 
overall rating data for Apple and Samsung tablets. 


1. Enter the ratings for Apple tablets in list L1 and for Samsung in list 
2: 


2. Set up two statistics plots: Plot1 to show a boxplot of the Apple data 
in list L1 and Plot2 to show a boxplot of the Samsung data in list L2. 
The setup for Plot1 is shown. When you define Plot2, be sure to 
change L1 to L2. 


Note: The calculator offers two types of boxplots: one that shows 
outliers and one that doesn’t. We’|l always use the type that identifies 
outliers. 


3. Use the calculator’s Zoom feature to display the parallel boxplots. 
Then Trace to view the five-number summary. 


e Press |ZOOM| and select ZoomStat. 
e Press |TRACE|. 


|AEXSI Plot2 Plot3 

Off 

Type: Let Le chy Bu bh 
Xlist:Li 

Frea :1 

Mark :BU += - 
Color: aU = 


NORMAL FLOAT AUTO REAL RADIAN MP ft] 


Section 1.3 Summary 


190 


A numerical summary of a distribution should include measures of 
center and variability. 


The mean x = and the median describe the center of a distribution in 
different ways. The mean is the average of the observations: x =) xin. 
_ ei 

a a The median is the midpoint of the distribution, the number 
such that about half the observations are smaller and half are larger. 


The simplest measure of variability for a distribution of quantitative data 
is the range, which is the distance from the maximum value to the 
minimum value. 


When you use the mean to describe the center of a distribution, use the 
standard deviation to describe the distribution’s variability. The 
standard deviation s, gives the typical distance of the values in a 


distribution from the mean. In symbols, sx=y (xi-x )2n-1. 


n—1 ' The standard deviation s, is 0 when there is no 
variability and gets larger as variability from the mean increases. 
When you use the median to describe the center of a distribution, use the 


interquartile range to describe the distribution’s variability. The first 
quartile Q, has about one-fourth of the observations below it, and the 


third quartile Q3 has about three-fourths of the observations below it. 


The interquartile range (IQR) measures variability in the middle half of 
the distribution and is found using IQR = Q3 — Q}. 


The median is a resistant measure of center because it is relatively 
unaffected by extreme observations. The mean is not resistant. Among 
measures of variability, the JQR is resistant, but the standard deviation 
and range are not. 


According to the 1.5 x IQR rule, an observation is an outlier if it is less 
than Q; — 1.5 x IQR or greater than Q3 + 1.5 x IQR. 


Boxplots are based on the five-number summary of a distribution, 
consisting of the minimum, Q, the median, Q3, and the maximum. The 


box shows the variability in the middle half of the distribution. The 
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median is marked within the box. Lines extend from the box to the 
smallest and the largest observations that are not outliers. Outliers are 
plotted with special symbols. Boxplots are especially useful for 
comparing distributions. 


1.3 Technology Corners (> 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
3. Computing numerical summaries Page 66 
4. Making boxplots Page 73 


Section 1.3 Exercises 


87. pg 55 |) Quiz grades Joey’s first 14 quiz grades in a marking 
period were as follows: 


86 84 91 75 78 80 74 
87 76 96 82 90 98 we 


a. Calculate the mean. Show your work. 


b. Suppose Joey has an unexcused absence for the 15th quiz, and he 
receives a score of 0. Recalculate the mean. What property of the 
mean does this illustrate? 


88. Pulse rates Here are data on the resting pulse rates (in beats per 
minute) of 19 middle school students: 


71 104 76 88 78 71 68 86 70 90 
74. 76 69 68 88 96 68 82 120 


a. Calculate the mean. Show your work. 


b. The student with a 120 pulse rate has a medical issue. Find the 
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mean pulse rate for the other 18 students. What property of the 
mean does this illustrate? 


89. pg 57 |p) Quiz grades Refer to Exercise 87. 


a. Find the median of Joey’s first 14 quiz grades. 


b. Find the median of Joey’s quiz grades after his unexcused absence. 
Explain why the 0 quiz grade does not have much effect on the 
median. 


90. Pulse rates Refer to Exercise 88. 


91. 


92. 


a. Find the median pulse rate for all 19 students. 


b. Find the median pulse rate excluding the student with the medical 
issue. Explain why this student’s 120 pulse rate does not have 
much effect on the median. 


Electing the president To become president of the United States, a 
candidate does not have to receive a majority of the popular vote. The 
candidate does have to win a majority of the 538 Electoral College 
votes. Here is a stemplot of the number of electoral votes in 2016 for 
each of the 50 states and the District of Columbia: 


3333333344444 
55566666677788999 
00001111234 

5668 

00 

99 


8 


Key: 15 is a state with 
15 electoral votes. 


MUP EWN Dr AH HOO 


a. Find the median. 


b. Without doing any calculations, explain how the mean and median 
compare. 


Birthrates in Africa One of the important factors in determining 
population growth rates is the birthrate per 1000 individuals in a 
population. The dotplot shows the birthrates per 1000 individuals 
(rounded to the nearest whole number) for 54 African nations. 
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Birthrate (per 1000 population) 
a. Find the median. 


b. Without doing any calculations, explain how the mean and median 
compare. 


93. House prices The mean and median selling prices of existing single- 
family homes sold in September 2016 were $276,200 and 
$234,200.42 Which of these numbers is the mean and which is the 
median? Explain your reasoning. 


94. Mean salary? Last year a small accounting firm paid each of its five 
clerks $32,000, two junior accountants $60,000 each, and the firm’s 
owner $280,000. 


a. What is the mean salary paid at this firm? How many of the 
employees earn less than the mean? What is the median salary? 


b. Write a sentence to describe how an unethical recruiter could use 
Statistics to mislead prospective employees. 


95. Do adolescent girls eat fruit? We all know that fruit is good for us. 
Here is a histogram of the number of servings of fruit per day claimed 


by 74 seventeen-year-old girls in a study in Pennsylvania: 


15 


Number of subjects 


0 1 2 3 = 5 6 7 8 
Servings of fruit per day 


a. Find the median number of servings of fruit per day from the 
histogram. Explain your method clearly. 
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b. Calculate the mean of the distribution. Show your work. 


96. Shakespeare The histogram shows the distribution of lengths of 
words used in Shakespeare’s plays.44 


25 4 


20 = 


Percent 


2 4 6 8 10 
Word length 


a. Find the median word length in Shakespeare’s plays from the 
histogram. Explain your method clearly. 


b. Calculate the mean of the distribution. Show your work. 


97. Quiz grades Refer to Exercise 87. 


a. Find the range of Joey’s first 14 quiz grades and the range of 
Joey’s quiz grades after his unexcused absence. 


b. Explain what part (a) suggests about using the range as a measure 
of variability for a distribution of quantitative data. 


98. Pulse rates Refer to Exercise 88. 


a. Find the range of the pulse rates for all 19 students and the range of 
the pulse rates excluding the student with the medical issue. 


b. Explain what part (a) suggests about using the range as a measure 
of variability for a distribution of quantitative data. 


99. pg 61 |) Foot lengths Here are the foot lengths (in centimeters) 


for arandom sample of seven 14-year-olds from the United Kingdom: 


25 Ze 20 25 24 24 28 


Calculate the standard deviation. Interpret this value. 
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100. Well rested? A random sample of 6 students in a first-period 
Statistics class was asked how much sleep (to the nearest hour) they 
got last night. Their responses were 6, 7, 7, 8, 10, and 10. Calculate 
the standard deviation. Interpret this value. 


101. File sizes How much storage space does your music use? Here is a 
dotplot of the file sizes (to the nearest tenth of a megabyte) for 18 
randomly selected files on Nathaniel’s mp3 player: 


ee @ @ 2 ee a e 9 e 
| a ay a | T T T T Le § T "t T 
1 2 3 4 5 6 7 8 


File size (mb) 


a. The distribution of file size has a mean of x =3.2% = 3.2 
megabytes and a standard deviation of s, = 1.9 megabytes. 


Interpret the standard deviation. 


b. Suppose the music file that takes up 7.5 megabytes of storage 
space is replaced with another version of the file that only takes 
up 4 megabytes. How would this affect the mean and the standard 
deviation? Justify your answer. 


102. Healthy fast food? Here is a dotplot of the amount of fat (to the 
nearest gram) in 12 different hamburgers served at a fast-food 
restaurant: 


e 
e es e & ee ee e . e 
T T T T T 
10 15 20 25 30 35 40 
Fat content (g) 


a. The distribution of fat content has a mean of x =22.83% = 22.83 
grams and a standard deviation of s, = 9.06 grams. Interpret the 


standard deviation. 


b. Suppose the restaurant replaces the burger that has 22 grams of 
fat with a new burger that has 35 grams of fat. How would this 
affect the mean and the standard deviation? Justify your answer. 


103. Comparing SD Which of the following distributions has a smaller 
standard deviation? Justify your answer. 
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E2#s43 67 8 123 43 67 8 
Variable A Variable B 


104. Comparing SD The parallel dotplots show the lengths (in 
millimeters) of a sample of 11 nails produced by each of two 
machines. Which distribution has the larger standard deviation? 
Justify your answer. 


A & » e 
: 
z 
= ns s e ; 
B 0 * ° 9 Ss 
“i. ws on re gor, - 
68 69 70 71 72 


Length (mm) 
105. pg 64 |») File sizes Refer to Exercise 101. Find the interquartile 
range of the file size distribution shown in the dotplot. 


106. Healthy fast food? Refer to Exercise 102. Find the interquartile 
range of the fat content distribution shown in the dotplot. 


107. pg 67 |p) File sizes Refer to Exercises 101 and 105. Identify any 
outliers in the distribution. Show your work. 


108. Healthy fast food? Refer to Exercises 102 and 106. Identify any 
outliers in the distribution. Show your work. 


109. Shopping spree The figure displays computer output for data on the 
amount spent by 50 grocery shoppers. 


x & s, Min Q; Med Q3 ~~ Max 
Amount spent 34.70 21.70 3.11 19.27 27.86 45.40 93.34 
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a. What would you guess is the shape of the distribution based only 
on the computer output? Explain. 


b. Interpret the value of the standard deviation. 


c. Are there any outliers? Justify your answer. 


110. C-sections A study in Switzerland examined the number of cesarean 
sections (surgical deliveries of babies) performed in a year by 
samples of male and female doctors. Here are summary statistics for 
the two distributions: 


x © s Min Q; Med Q3 Max 
Male doctors 41.333 20.607 20 27 34 50 86 
Female doctors 19.1 10.126 5 10 185 29 33 


a. Based on the computer output, which distribution would you 
guess has a more symmetrical shape? Explain your answer. 


b. Explain how the IQRs of these two distributions can be so similar 
even though the standard deviations are quite different. 


c. Does either distribution have any outliers? Justify your answer. 


111. pg 69 |») Don’t call me According to a study by Nielsen Mobile, 


“Teenagers ages 13 to 17 are by far the most prolific texters, 
sending 1742 messages a month.” Mr. Williams, a high school 
Statistics teacher, was skeptical about the claims in the article. So he 
collected data from his first-period statistics class on the number of 
text messages they had sent in the past 24 hours. Here are the data: 


0 7 129 25 @ 5 1 25 98 9 O 26 
8 11872 0 92 52 14 3 3 44 5 42 
a. Make a boxplot of these data. 


b. Use the boxplot you created in part (a) to explain how these data 
seem to contradict the claim in the article. 


112. Acing the first test Here are the scores of Mrs. Liao’s students on 
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their first statistics test: 


93 93 87.5 91 94572 96 95 93.593.5 73 
82. 45 88 80 86 85.5 87.5 81 78 86 89 
92 91 98 85 82.588 94.5 43 


a. Make a boxplot of these data. 


b. Use the boxplot you created in part (a) to describe how the 
students did on Mrs. Liao’s first test. 


113. Electing the president Refer to Exercise 91. Here are a boxplot and 
some numerical summaries of the electoral vote data: 


0 10 20 30 40 50 60 
Electoral votes 
Variable N Mean SD Min Q, Median Q3 Max 
Electoral votes 51 10.55 9.69 3 4 8 12 55 


a. Explain why the median and JQR would be a better choice for 
summarizing the center and variability of the distribution of 
electoral votes than the mean and standard deviation. 


b. Identify an aspect of the distribution that the stemplot in Exercise 
91 reveals that the boxplot does not. 


114. Birthrates in Africa Refer to Exercise 92. Here are a boxplot and 
some numerical summaries of the birthrate data: 


I I T I 
10 20 30 40 50 


Birthrate (per 1000 individuals) 


Variable N Mean SD Min Qi Median Q3 Max 
Birthrate 54 34.91 8.57 14.00 29.00 37.50 41.00 53.00 
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a. Explain why the median and JQR would be a better choice for 
summarizing the center and variability of the distribution of 
birthrates in African countries than the mean and standard 
deviation. 


b. Identify an aspect of the distribution that the dotplot in Exercise 
92 reveals that the boxplot does not. 


115. pg 71 |p) Energetic refrigerators Consumer Reports magazine 


rated different types of refrigerators, including those with bottom 
freezers, those with top freezers, and those with side freezers. One 
of the variables they measured was annual energy cost (in dollars). 
The following boxplots show the energy cost distributions for each 
of these types. Compare the energy cost distributions for the three 
types of refrigerators. 
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116. Income in New England The following boxplots show the total 
income of 40 randomly chosen households each from Connecticut, 
Maine, and Massachusetts, based on U.S. Census data from the 
American Community Survey. Compare the distributions of annual 
incomes in the three states. 


Maine 


Massachusetts 


Connecticut {EH 


I... Ll 
0 50 100 150 200 250 300 350 400 450 500 
Annual household income ($1000s) 


117. Who texts more? For their final project, a group of AP® Statistics 
students wanted to compare the texting habits of males and females. 
They asked a random sample of students from their school to record 
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the number of text messages sent and received over a two-day 
period. Here are their data: 


127 44 28 83 0 6 78 6 
Males 
5 213 73 20 214 28 11 
112 203 102 54 379 305 179 24 
Females 
127 65 41 27 298 6 130 0 


a. Make parallel boxplots of the data. 


b. Use your calculator to compute numerical summaries for both 
samples. 

c. Do these data suggest that males and females at the school differ 
in their texting habits? Use the results from parts (a) and (b) to 
support your answer. 

118. SSHA scores Here are the scores on the Survey of Study Habits and 

Attitudes (SSHA) for a random sample of 18 first-year college 

women: 


154 109 137 115 152 140 154 178 101 
103 126 126 137 165 165 129 200 148 


Here are the SSHA scores for a random sample of 20 first-year 
college men: 


108 140 114 91 180 115 126 
92 169 146 109 132 75 88 
113 151 70 115 187 104 


Note that high scores indicate good study habits and attitudes 
toward learning. 


a. Make parallel boxplots of the data. 


b. Use your calculator to compute numerical summaries for both 
samples. 


c. Do these data support the belief that men and women differ in 
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their study habits and attitudes toward learning? Use your results 
from parts (a) and (b) to support your answer. 


119. Income and education level Each March, the Bureau of Labor 
Statistics compiles an Annual Demographic Supplement to its 
monthly Current Population Survey.“2 Data on about 71,067 
individuals between the ages of 25 and 64 who were employed full- 
time were collected in one of these surveys. The parallel boxplots 
compare the distributions of income for people with five levels of 
education. This figure is a variation of the boxplot idea: because 
large data sets often contain very extreme observations, we omitted 
the individuals in each category with the top 5% and bottom 5% of 
incomes. Also, the whiskers are drawn all the way to the maximum 
and minimum values of the remaining data for each distribution. 


300 4 


Nn 
vv 
Oo 


Total income ($ 1000s) 


“pena 4 


T 
Not HS Some __ Bachelor’s Advanced 
HS grad grad college 


Education level 


Use the graph to help answer the following questions. 
a. What shape do the distributions of income have? 


b. Explain how you know that there are outliers in the group that 
earned an advanced degree. 


c. How does the typical income change as the highest education 
level reached increases? Why does this make sense? 


d. Describe how the variability in income changes as the highest 
education level reached increases. 


120. Sleepless nights Researchers recorded data on the amount of sleep 
reported each night during a week by a random sample of 20 high 
school students. Here are parallel boxplots comparing the 
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distribution of time slept on all 7 nights of the study:4 


Time slept (h) 
fo.) 
! 


dies! a 


0 T T T 3 T 
Mon Tues Wed = Thurs Fri Sat Sun 


Night of the week 


Use the graph to help answer the following questions. 


Which distributions have a clear left-skewed shape? 
Which outlier stands out the most, and why? 


How does the typical amount of sleep that the students got 
compare on these seven nights? 


. On which night was there the most variation in how long the 


students slept? Justify your answer. 


121. SD contest This is a standard deviation contest. You must choose 
four numbers from the whole numbers 0 to 10, with repeats allowed. 


a. 


Cc. 


Choose four numbers that have the smallest possible standard 
deviation. 


Choose four numbers that have the largest possible standard 
deviation. 


Is more than one choice possible in either part (a) or (b)? Explain. 


122. What do they measure? For each of the following summary 
Statistics, decide (i) whether it could be used to measure center or 
variability and (ii) whether it is resistant. 


Q1+Q3 


a. Q1+Q32. 2 


b. 


Max—Min 
Max — Min2 2 
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Multiple Choice Select the best answer for Exercises 123-126. 


123. If a distribution is skewed to the right with no outliers, which 
expression is correct? 


a. Mean < median 
b. mean * median 
c. mean = median 
d. mean > median 


e. We can’t tell without examining the data. 


124. The scores on a statistics test had a mean of 81 and a standard 
deviation of 9. One student was absent on the test day, and his score 
wasn’t included in the calculation. If his score of 84 was added to 
the distribution of scores, what would happen to the mean and 
standard deviation? 


a. Mean will increase, and standard deviation will increase. 
b. Mean will increase, and standard deviation will decrease. 
c. Mean will increase, and standard deviation will stay the same. 
d. Mean will decrease, and standard deviation will increase. 


e. Mean will decrease, and standard deviation will decrease. 


125. The stemplot shows the number of home runs hit by each of the 30 
Major League Baseball teams in a single season. Home run totals 
above what value should be considered outliers? 


09 | 15 
10 | 3789 
11 | 47 
12) 19 


14 | 89 

15 | 34445 

16 | 239 

17 | 223 

18 | 356 

19 | 1 = 
3 Key: 14|8 is a 

21 0 team with 148 
2 home runs. 
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a. 173 
b. 210 
c,. 222 
d. 229 


e. 257 


126. Which of the following boxplots best matches the distribution shown 
in the histogram? 


Ga on te belo 
4 6 8 10 12 


T 
0 2 4 6 8 10 12 0 2 


(b) Data (e) Data 
-+—i—- 

0 2 & 6 8.10 1 

(c) Data 


Recycle and Review 


127. How tall are you? (1.2) We used Census At School’s “Random Data 
Selector” to choose a sample of 50 Canadian students who 
completed a survey in a recent year. Here are the students’ heights 
(in centimeters): 


166.5 170.0 178.0 163.0 150.5 169.0 173.0 169.0 171.0 166.0 
190.0 183.0 178.0 161.0 171.0 170.0 191.0 168.5 178.5 173.0 
175.0 160.5 166.0 164.0 163.0 174.0 160.0 174.0 182.0 167.0 
166.0 170.0 170.0 181.0 171.5 160.0 178.0 157.0 165.0 187.0 
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168.0 157.5 145.5 156.0 182.0 1685 177.0 162.5 160.5 185.5 


Make an appropriate graph to display these data. Describe the shape, 
center, and variability of the distribution. Are there any outliers? 


128. Success in college (1.1) The Freshman Survey asked first-year 
college students about their “habits of mind”—specific behaviors 
that college faculty have identified as being important for student 
success. One question asked students, “How often in the past year 
did you revise your papers to improve your writing?” Another 
asked, “How often in the past year did you seek feedback on your 
academic work?” The figure is a bar graph comparing the percent of 
males and females who answered “frequently” to these two 
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questions.“2 
{i Male [J Female 


60 5 54.9% 


% “Frequently” 


Revise papers to Seek feedback 
improve writing on work 


What does the graph reveal about the habits of mind of male and 


female college freshmen? 
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— FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


Using data from the 2010 census, a random sample of 348 U.S. 
residents aged 18 and older was selected. Among the variables 
recorded were gender (male or female), housing status (rent or own), 
and marital status (married or not married). 

The two-way table below summarizes the relationship between 
gender and housing status. 


Male Female Total 
Own 132 122 254 
Rent 50 44 94 
Total 182 166 348 


a. What percent of males in the sample own their home? 


b. Make a graph to compare the distribution of housing status for 
males and females. 
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c. Using your graph from part (b), describe the relationship 
between gender and housing status. 


d. The two-way table below summarizes the relationship between 
marital status and housing status. 


Married Not Married Total 
Own 172 82 254 
Rent 40 54 94 
Total 212 136 348 


For the members of the sample, is the relationship between 
marital status and housing status stronger or weaker than the 
relationship between gender and housing status that you 
described in part (c)? Justify your choice using the data 
provided in the two-way tables. 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 


“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide a scoring rubric. Score your response and note what, if 
anything, you would do differently to improve your own score. 
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Chapter 1 Review 


Introduction: Data Analysis: Making Sense of Data 


In this brief section, you learned several fundamental concepts that will be 
important throughout the course: the idea of a distribution and the 
distinction between quantitative and categorical variables. You also 
learned a strategy for exploring data: 


e Begin by examining each variable by itself. Then move on to study 
relationships between variables. 


e Start with a graph or graphs. Then add numerical summaries. 


Section 1.1: Analyzing Categorical Data 


In this section, you learned how to display the distribution of a single 
categorical variable with bar graphs and pie charts and what to look for 
when describing these displays. Remember to properly label your graphs! 
Poor labeling is an easy way to lose points on the AP® Statistics exam. 
You should also be able to recognize misleading graphs and be careful to 
avoid making misleading graphs yourself. 


Next, you learned how to investigate the relationship between two 
categorical variables. Using a two-way table, you learned how to calculate 
and display marginal and joint relative frequencies. Conditional relative 
frequencies and side-by-side or segmented bar graphs allow you to look 
for an association between the variables. If there is no association between 
the two variables, comparative bar graphs of the distribution of one 
variable for each value of the other variable will be identical. If differences 
in the corresponding conditional relative frequencies exist, there is an 
association between the variables. That is, knowing the value of one 
variable helps you predict the value of the other variable. 


Section 1.2: Displaying Quantitative Data with Graphs 


In this section, you learned how to create three different types of graphs 
for a quantitative variable: dotplots, stemplots, and histograms. Each of the 
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graphs has distinct benefits, but all of them are good tools for examining 
the distribution of a quantitative variable. Dotplots and stemplots are 
handy for small sets of data. Histograms are the best choice when there are 
a large number of observations. On the AP® exam, you will be expected to 
create each of these types of graphs, label them properly, and comment on 
their characteristics. 


When you are describing the distribution of a quantitative variable, you 
should look at its graph for the overall pattern (shape, center, variability) 
and striking departures from that pattern (outliers). Use the acronym 
SOCV (shape, outliers, center, variability) to help remember these four 
characteristics. When comparing distributions, you should include explicit 
comparison words for center and variability such as “is greater than” or “is 
approximately the same as.” When asked to compare distributions, a very 
common mistake on the AP® exam is describing the characteristics of each 
distribution separately without making these explicit comparisons. 


Section 1.3: Describing Quantitative Data with Numbers 


To measure the center of a distribution of quantitative data, you learned 
how to calculate the mean and the median of a distribution. You also 
learned that the median is a resistant measure of center, but the mean isn’t 
resistant because it can be greatly affected by skewness or outliers. 


To measure the variability of a distribution of quantitative data, you 
learned how to calculate the range, standard deviation, and interquartile 
range. The standard deviation is the most commonly used measure of 
variability and approximates the typical distance of a value in the data set 
from the mean. The standard deviation is not resistant—it is heavily 
affected by extreme values. The interquartile range (QR) is a resistant 
measure of variability because it ignores the upper 25% and lower 25% of 
the distribution, but the range isn’t resistant because it uses only the 
minimum and maximum value. 


To identify outliers in a distribution of quantitative data, you learned 
the 1.5 x JQR rule. You also learned that boxplots are a great way to 
visually summarize a distribution of quantitative data. Boxplots are helpful 
for comparing the center (median) and variability (range, IQR) for multiple 
distributions. Boxplots aren’t as useful for identifying the shape of a 
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distribution because they do not display peaks, clusters, gaps, and other 
interesting features. 


What Did You Learn? 


Related 

Example on _— Relevant Chapter 
Learning Target Section Page(s) Review Exercise(s) 
Identify the Intro 2 R11 
individuals and 
variables in a set of 
data. 
Classify variables as Intro 2 R1.1 
categorical or 
quantitative. 
Make and interpret bar 1.1 11 R1.2 
graphs for categorical 
data. 
Identify what makes 1.1 12 R1.3 
some graphs of 
categorical data 
misleading. 
Calculate marginal 1.1 15 R14 


and joint relative 
frequencies from a 
two-way table. 
Calculate conditional 1.1 17 R1.4, R1.5 
relative frequencies 

from a two-way table. 

Use bar graphs to 1.1 20 RLS 
compare distributions 

of categorical data. 

Describe the nature of i 20 R15 
the association 

between two 

categorical variables. 


Make and interpret 12 Dotplots: 31 R16, R1.7 
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dotplots, stemplots, Stemplots: 38 


and histograms of Histograms: 
quantitative data. 42 
Identify the shape of a 12 33 
distribution from a 

graph. 

Describe the overall 12 35 


pattern (shape, center, 
and variability) of a 
distribution and 
identify any major 
departures from the 
pattern (outliers). 


Compare distributions 1.2 36 
of quantitative data 

using dotplots, 

stemplots, and 


histograms. 
Calculate measures of 1.3 Mean: 55 
center (mean, median) Median: 57 


for a distribution of 
quantitative data. 


Calculate and interpret 1.3 SD: 61 
measures of variability IQR: 64 
(range, standard 

deviation, IQR) for a 

distribution of 

quantitative data. 


Explain how outliers 1.3 69 
and skewness affect 
measures of center and 


variability. 

Identify outliers using 13 67 
the 1.5 x JQR rule. 

Make and interpret 13 69 


boxplots of 
quantitative data. 


Use boxplots and 1 71 
numerical summaries 
to compare 
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distributions of 
quantitative data. 
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Chapter 1 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R1.1 |») Who buys cars? A car dealer keeps records on car buyers for 


future marketing purposes. The table gives information on the last 4 


buyers. 

Buyer’s 

distance 

from 

Buyer’s dealer Car Model 
name Zip code Gender = (mi) model year Price 
P. Smith 27514 M 13 Fiesta 2018 $26,375 
K. Ewing 27510 M 10 Mustang 2015 $39,500 
ie 27516 F 2 Fusion 2016 $38,400 
Shipman 
S. Reice 27243 F 4 F-150 2016 $56,000 


a. Identify the individuals in this data set. 


b. What variables were measured? Classify each as categorical or 
quantitative. 


R1.2 |») I want candy! Mr. Starnes bought some candy for his APp® 


Statistics class to eat on Halloween. He offered the students an 
assortment of Snickers®, Milky Way®, Butterfinger®, Twix®, and 3 
Musketeers® candies. Each student was allowed to choose one 
option. Here are the data on the type of candy selected. Make a 
relative frequency bar graph to display the data. Describe what you 


see. 

Twix Snickers Butterfinger 
Butterfinger Snickers Snickers 

3 Musketeers Snickers Snickers 
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Butterfinger Twix Twix 


Twix Twix Twix 
Snickers Snickers Twix 
Snickers Milky Way Twix 

Twix Twix Butterfinger 
Milky Way Butterfinger 3 Musketeers 
Milky Way Butterfinger Butterfinger 


R1.3 |p) Pd die without my phone! In a survey of over 2000 U.S. 


teenagers by Harris Interactive, 47% said that “their social life 
would end or be worsened without their cell phone.”4© One survey 
question asked the teens how important it is for their phone to have 
certain features. The following figure displays data on the percent 
who indicated that a particular feature is vital. 


Percent who said vital 


Make/receive Send/receive Camera  Send/receive 
calls text messages pictures 


Feature 


a. Explain how the graph gives a misleading impression. 


b. Would it be appropriate to make a pie chart to display these data? 
Why or why not? 
R1.4 |») Facebook and age Is there a relationship between Facebook use 


and age among college students? The following two-way table 
displays data for the 219 students who responded to the survey.*4 


Age 
Younger (18— | Middle (23— | Older (28 and 
22) 27) up) 


Facebook Yes 21 
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user? No 4 21 46 


a. What percent of the students who responded were Facebook 
users? 


b. What percent of the students in the sample were aged 28 or 
older? 


c. What percent of the students who responded were older 
Facebook users? 


d. What percent of the Facebook users in the sample were younger 
students? 


R1.5 |) Facebook and age Refer to the preceding exercise. 


a. Find the distribution of Facebook use for each of the three age 
groups. Make a segmented bar graph to compare these 
distributions. 


b. Describe what the graph in part (a) reveals about the association 
between age and Facebook use. 


R1.6 |p) Density of the earth In 1798, the English scientist Henry 


Cavendish measured the density of the earth several times by careful 
work with a torsion balance. The variable recorded was the density 
of the earth as a multiple of the density of water. Here are 
Cavendish’s 29 measurements:“2 

5.90 9.61 488 507 5.26 555 5.36 5.29 5.58 5.65 
5.07 9.03 5.62 5.29 544 534 5.79 510 5.27 5.39 
5.42 9.47 5.63 534 546 530 5.75 568 5.85 


a. Make a stemplot of the data. 
b. Describe the distribution of density measurements. 


c. The currently accepted value for the density of earth is 5.51 times 
the density of water. How does this value compare to the mean of 
the distribution of density measurements? 


R1.7 |») Guinea pig survival times Here are the survival times (in days) 
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of 72 guinea pigs after they were injected with infectious bacteria in 
a medical experiment.22 Survival times, whether of machines under 
stress or cancer patients after treatment, usually have distributions 
that are skewed to the right. 


43 45 53 96 56 57 458 66 £467 73 74 79 
80 80 ~=—s 81 81 81 82 83 83 84 88 89 91 
91 92 92 97 99 99 100 100 101 102 102 = 102 
103. 104 107 108 #109 #113 «#114 #+:118 «121 #123 «126 = «128 
137. 138) «©1389 «144 «145 «147 «#156 162 174 178 179 184 
191 198 211 214 243 249 329 380 403 511 522 598 


a. Make a histogram of the data. Does it show the expected right 
skew? 


b. Now make a boxplot of the data. 


c. Compare the histogram from part (a) with the boxplot from part 
(b). Identify an aspect of the distribution that one graph reveals 
but the other does not. 


R1.8 |») Household incomes Rich and poor households differ in ways 
that go beyond income. Here are histograms that compare the 
distributions of household size (number of people) for low-income 
and high-income households.22 Low-income households had annual 


incomes less than $15,000, and high-income households had annual 
incomes of at least $100,000. 
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Percent 


Household size, low income 


Percent 
! 


1 2 3 4 5 6 


Household size, high income 


a. About what percent of each group of households consisted of 
four or more people? 


b. Describe the similarities and differences in these two 
distributions of household size. 


Exercises R1.9 and R1.10 refer to the following setting. Do you like to eat 
tuna? Many people do. Unfortunately, some of the tuna that people eat 
may contain high levels of mercury. Exposure to mercury can be 
especially hazardous for pregnant women and small children. How much 
mercury is safe to consume? The Food and Drug Administration will take 
action (like removing the product from store shelves) if the mercury 
concentration in a 6-ounce can of tuna is 1.00 ppm (parts per million) or 
higher. 

What is the typical mercury concentration in cans of tuna sold in 
stores? A study conducted by Defenders of Wildlife set out to answer this 
question. Defenders collected a sample of 164 cans of tuna from stores 
across the United States. They sent the selected cans to a laboratory that is 
often used by the Environmental Protection Agency for mercury testing. 


R1.9 |») Mercury in tuna Here are a dotplot and numerical summaries 
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of the data on mercury concentration in the sampled cans (in parts 
per million, ppm): 


Variable 
Mercury 
Variable 
Mercury 


Seeceesescesoos 
Secececccecocoos 


e@eeeoeoeoeooeooo eee ee & J J = - J 2 2 
0.00 0.25 0.50 0.75 1.00 1.25 1.50 
Mercury concentration (ppm) 


N Mean SD 
164 0.285 0.300 
Qi Med Q3 

0.071 0.180 0.380 


a. Interpret the standard deviation. 


b. Determine whether there are any outliers. 


c. Explain why the mean is so much larger than the median of the 
distribution. 


R1.10 |») Mercury in tuna Is there a difference in the mercury 


concentration of light tuna and albacore tuna? Use the parallel 


boxplots and the computer output to write a few sentences 


comparing the two distributions. 


Type 
Albacore 
Light 
Type 


Albacore 
iT 
$ 


Type 


Light 


T ae | T T a | T 


T T T 
02 04 0.6 08 10 12 1.4 1.6 
Mercury concentration (ppm) 


T 
0.0 


N Mean SD 
20 0.401 0.152 
144 0.269 0.312 
Med 
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Min 
0.170 
0.012 

Max 


Q1 Q3 
Albacore 0.293 0.400 0.460 0.730 


Light 0.059 0.160 0.347 1.500 
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Chapter 1 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T1.1 You record the age, marital status, and earned income of a sample 
of 1463 women. The number and type of variables you have 
recorded are 


a. 3 quantitative, 0 categorical. 
b. 

C. 
d. 


e. 


4 quantitative, 0 categorical. 
3 quantitative, 1 categorical. 
2 quantitative, 1 categorical. 


1 quantitative, 2 categorical. 


T1.2 The students in Mr. Tyson’s high school statistics class were 
recently asked if they would prefer a pasta party, a pizza party, or a 
donut party. The following bar graph displays the data. 


Frequency 


Donut Pasta Pizza 


Preferred party 


This graph is misleading because 


a. it should be a histogram, not a bar graph. 
b. 

C. 
d. 


e. 


there should not be gaps between the bars. 
the bars should be arranged in decreasing order by height. 
the vertical axis scale should start at 0. 


preferred party should be on the vertical axis and number of 
students should be on the horizontal axis. 


T1.3 Forty students took a statistics test worth 50 points. The dotplot 
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displays the data. The third quartile is 


a. 45. 
b. 32. 
c. 44. 
d. 23. 
e. 43. 


Questions T1.4—T1.6 refer to the following setting. Realtors collect data in 
order to serve their clients more effectively. In a recent week, data on the 
age of all homes sold in a particular area were collected and displayed in 
this histogram. 


05 1 2 35 45 55 65 75 
Age of home (yr) 


T1.4 Which of the following could be the median age? 
a. 19 years 
b. 34 years 
c. 24 years 
d. 39 years 
e. 29 years 
T1.5 Which of the following is most likely true? 
a. Mean > median, range < JQR 
b. mean < median, range < JQR 


c. mean > median, range > IQR 
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d. mean < median, range > IQR 
e. mean = median, range > IQR 


T1.6 The standard deviation of the distribution of house age is about 16 
years. Interpret this value. 


a. The age of all houses in the sample is within 16 years of the 
mean. 


b. The gap between the youngest and oldest house is 16 years. 


c. The age of all the houses in the sample is 16 years from the 
mean. 


d. The gap between the first quartile and the third quartile is 16 
years. 


e. The age of the houses in the sample typically varies by about 
16 years from the mean age. 


T1.7 The mean salary of all female workers is $35,000. The mean salary 
of all male workers is $41,000. What must be true about the mean 
salary of all workers? 


a. It must be $38,000. 

b. It must be larger than the median salary. 

It could be any number between $35,000 and $41,000. 
d. It must be larger than $38,000. 

e. It cannot be larger than $40,000. 


e) 


Questions T1.8 and 11.9 refer to the following setting. A survey was 
designed to study how business operations vary by size. Companies were 
classified as small, medium, or large. Questionnaires were sent to 200 
randomly selected businesses of each size. Because not all questionnaires 
are returned, researchers decided to investigate the relationship between 
the response rate and the size of the business. The data are given in the 
following two-way table. 


Business size 
Small Medium Large 
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Response? Yes 125 81 40 


No 793 119 160 


T1.8 What percent of all small companies receiving questionnaires 
responded? 


a. 12.5% 
b. 50.8% 
c. 20.8% 
d. 62.5% 
e. 33.3% 


T1.9 Which of the following conclusions seems to be supported by the 
data? 


a. There are more small companies than large companies in the 
survey. 


b. Small companies appear to have a higher response rate than 
medium or big companies. 


c. Exactly the same number of companies responded as didn’t 
respond. 


d. Overall, more than half of companies responded to the survey. 


e. If we combined the medium and large companies, then their 
response rate would be equal to that of the small companies. 


T1.10 An experiment was conducted to investigate the effect of a new 
weed killer to prevent weed growth in onion crops. Two chemicals 
were used: the standard weed killer (S) and the new chemical (N). 
Both chemicals were tested at high and low concentrations on 50 
test plots. The percent of weeds that grew in each plot was 
recorded. Here are some boxplots of the results. 


224 


N—low conc. 
S—low conc. 


N—high conc. 


S—high conc. 


Percent of weeds that grew 


Which of the following is not a correct statement about the results 
of this experiment? 


a. 


At both high and low concentrations, the new chemical results 
in better weed control than the standard weed killer. 


. For both chemicals, a smaller percentage of weeds typically 


grew at higher concentrations than at lower concentrations. 


The results for the standard weed killer are less variable than 
those for the new chemical. 


. High and low concentrations of either chemical have 


approximately the same effects on weed growth. 


Some of the results for the low concentration of weed killer 
show a smaller percentage of weeds growing than some of the 
results for the high concentration. 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T1.11 You are interested in how many contacts older adults have in their 
smartphones. Here are data on the number of contacts for a random 
sample of 30 elderly adults with smartphones in a large city: 


7 
42 
72 


a. 


20 24 25 25 28 28 30 oe 35 
43 44 45 46 47 48 48 50 ol 
75 a7 78 7 83 87 88 135 151 


Construct a histogram of these data. 


b. Are there any outliers? Justify your answer. 


c. Would it be better to use the mean and standard deviation or the 
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median and IQR to describe the center and variability of this 
distribution? Why? 


T1.12 A study among the Pima Indians of Arizona investigated the 
relationship between a mother’s diabetic status and the number of 
birth defects in her children. The results appear in the two-way 


table. 
Diabetic status 
Nondiabetic | Prediabetic | Diabetic 
: None 38 
Number of birth 
defects One or 31 13 9 
more 


a. What proportion of the women in this study had a child with 
one or more birth defects? 


b. What percent of the women in this study were diabetic or 
prediabetic, and had a child with one or more birth defects? 


c. Make a segmented bar graph to display the distribution of 
number of birth defects for the women with each of the three 
diabetic statuses. 


d. Describe the nature of the association between mother’s 
diabetic status and number of birth defects for the women in this 
study. 


T1.13 The back-to-back stemplot shows the lifetimes of several Brand X 
and Brand Y batteries. 
Brand X 


2 Key: 4\2 represents 
6 420-429 hours. 


223334 
56889 
0 


7 
4 
2 
2 
3 
3 
4 
4 
5 
5 


a. What is the longest that any battery lasted? 


b. Give a reason someone might prefer a Brand X battery. 
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c. Give areason someone might prefer a Brand Y battery. 


T1.14 Catherine and Ana suspect that athletes (i.e., students who have 
been on at least one varsity team) typically have a faster reaction 
time than other students. To test this theory, they gave an online 
reflex test to 33 varsity athletes at their school and 29 other 
students. Here are parallel boxplots and numerical summaries of 
the data on reaction times (in milliseconds) for the two groups of 
students. Write a few sentences comparing the distribution of 
reaction time for the two types of students. 


Student 


T T T T T T T 
150 200 250 300 350 400 450 500 


Reaction time (msec) 


Student n Mean StDev Min Qi Med Q3 Max 


Other 29) 297.3 = 65.9 197.0 255.0 292.0 325.0 478.0 
Athlete 33 270.1 97.7 189.6 236.0 261.0 300.0 398.0 
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Chapter 1 Project American Community 
Survey 


Each month, the U.S. Census Bureau selects a random sample of about 
300,000 U.S. households to participate in the American Community 
Survey (ACS). The chosen households are notified by mail and invited to 
complete the survey online. The Census Bureau follows up on any 
uncompleted surveys by phone or in person. Data from the ACS are used 
to determine how the federal government allocates over $400 billion in 
funding for local communities. 

The file acs survey ch1 project.xls, which can be accessed from the 
book’s website at highschool.bfwpub.com/tps6e, contains data for 3000 
randomly selected households in one month’s ACS survey. Download the 
file to a computer for further analysis using the application specified by 
your teacher. 

Each row in the spreadsheet describes a household. A serial number 
that identifies the household is in the first column. The other columns 
contain values of several variables. See the code sheet on the book’s 
website for details on how each variable is recorded. Note that all the 
categorical variables have been coded to have numerical values in the 
spreadsheet. 


Use the files provided to answer the following questions. 


. How many variables are recorded? Classify each one as categorical or 
quantitative. 


’, Examine the distribution of location (division or region) for the 
households in the sample. Make a bar graph to display the data. Then 
calculate numerical summaries (counts, percents, or proportions). 
Describe what you see. 


3. Explore the relationship between two categorical variables of interest to 
you. Summarize the data in a two-way table. Then calculate appropriate 
conditional relative frequencies and make a side-by-side or segmented 
bar graph. Write a few sentences comparing the distributions. 
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|. Analyze the distribution of household income (HINCP) using 
appropriate graphs and numerical summaries. 


». Compare the distribution of a quantitative variable that interests you in 
two or more groups. For instance, you might compare the distribution of 
number of people in a family (NPF) by region. Make appropriate graphs 
and calculate numerical summaries. Then write a few sentences 
comparing the distributions. 
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Chapter 2 Modeling 


Distributions of Data 
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AfriPics.com/Alamy 


ta 
LCi 


231 


Chapter 2 Review Exercises 
Chapter 2 AP® Statistics Practice Test 
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INTRODUCTION 


Suppose Emily earns 43 out of 50 points on a statistics test. Should she be 
satisfied or disappointed with her performance? That depends on how her 
score compares with the scores of the other students who took the test. If 
43 is the highest score, Emily might be very pleased. Maybe her teacher 
will “scale” the grades so that Emily’s 43 becomes an “A.” But if Emily’s 
43 falls below the class average, she may not be so happy. 

Section 2.1 focuses on describing the location of an individual within a 
distribution. We begin by discussing a familiar measure of location: 
percentiles. Next, we introduce a new type of graph that is useful for 
displaying percentiles. Then we consider another way to describe an 
individual’s location that is based on the mean and standard deviation. In 
the process, we examine the effects of transforming data on the shape, 
center, and variability of a distribution. 

Sometimes it is helpful to use graphical models called density curves to 
describe the location of individuals within a distribution, rather than 
relying on actual data values. Such models are especially helpful when 
data fall in a bell-shaped pattern called a Normal distribution. Section 2.2 
examines the properties of Normal distributions and shows you how to 
perform useful calculations with them. 


ACTIVITY Where do I stand? 


In this activity, you and your classmates will explore ways to describe 
where you stand (literally!) within a distribution. 
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1. Your teacher will mark out a number line on the floor with a scale 
running from about 58 to 78 inches. 


2. Make a human dotplot. Each member of the class should stand at the 
appropriate location along the number line scale based on height (to 
the nearest inch). 


3. Your teacher will make a copy of the dotplot on the board for your 
reference. Describe the class’s distribution of heights. 


4. What percent of the students in the class have heights less than yours? 
This percentile is one way to measure your location in the distribution 
of heights. 


5. Calculate the mean and standard deviation of the class’s distribution 
of height from the dotplot. Confirm these values with your 
classmates. 


6. Does your height fall above or below the mean? How far above or 
below the mean is it? How many standard deviations above or below 
the mean is it? This standardized score (also called a z-score) is 
another way to measure your location in the class’s distribution of 
heights. 


7. Class discussion: What would happen to the class’s distribution of 
height if you converted each data value from inches to centimeters? 
(There are 2.54 centimeters in 1 inch.) How would this change of 
units affect the shape, center, variability, and the measures of location 
(percentile and z-score) that you calculated? 


Want to know more about where you stand—in terms of height, 
weight, or even body mass index? Do a web search for “Clinical Growth 


234 


Charts” at the National Center for Health Statistics site, 
www.cdc.gov/nchs. 


Za0 


SECTION 2.1 Describing Location in a Distribution 


LEARNING TARGETS By the end of the section, you should be able to: 


e Find and interpret the percentile of an individual value in a 
distribution of data. 


e Estimate percentiles and individual values using a cumulative 
relative frequency graph. 


e Find and interpret the standardized score (z-score) of an individual 
value in a distribution of data. 


e Describe the effect of adding, subtracting, multiplying by, or 
dividing by a constant on the shape, center, and variability of a 
distribution of data. 


There are 25 students in Mr. Tabor’s statistics class. He gives them a first 
test worth 50 points. Here are the students’ scores: 


35 18 37 38 42 41 25 37 36 32 12 43 #31 
29 32 48 44 45 38 40 45 38 38 40 22 


The score marked in red is Emily’s 43. How did she perform on this test 
relative to her classmates? 


Figure 2.1 displays a dotplot of the class’s test scores, with Emily’s 
score marked in red. The distribution is skewed to the left with some 
possible low outliers. From the dotplot, we can see that Emily’s score is 
above the mean (balance point) of the distribution. We can also see that 
Emily did better on the test than most other students in the class. 


Test score 
FIGURE 2.1 Dotplot of scores (out of 50 points) on Mr. Tabor’s first statistics 
test. Emily’s score of 43 is marked in red. 
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Measuring Location: Percentiles 


One way to describe Emily’s location in the distribution of test scores is to 
calculate her percentile. 


DEFINITION Percentile 


An individual’s percentile is the percent of values in a distribution 
that are less than the individual’s data value. 


Using the dotplot, we see that Emily’s 43 is the fifth highest score in 
the class. Because 20 of the 25 observations (80%) are below her score, 
Emily is at the 80th percentile in the class’s test score distribution. 


@ Be careful with your language when describing percentiles. 


Percentiles are specific locations in a distribution, so an observation isn’t 
“in” the 80th percentile. Rather, it is “at” the 80th percentile. 


EXAMPLE | Mr. Tabor’s first test (») 


Finding and interpreting percentiles 


antoniodiaz/Shutterstock.com 


PROBLEM: Refer to the dotplot of 25 scores on Mr. Tabor’s first 
statistics test. 


Test score 


23/7 


a. Find the percentile for Jacob, who scored 18 on the test. 


b. Maria’s test score is at the 48th percentile of the distribution. 
Interpret this value in context. What score did Maria earn? 


SOLUTION: 


a. 1/25 = 0.04, so Jacob scored at the 4th percentile on this test. 


Only 1 of the 25 scores in the class is less than Jacob’s 18. 


b. 48% of students in the class earned a lower test score than Maria. 
Because (0.48)(25)=12(9-48) (25) = 12 Maria scored higher than 12 
of the 25 students. Maria earned a 38 on the test. 


Three other students in the class scored a 38 on the test. These 


students’ scores are also at the 48th percentile because 12 of the 25 


students in the class earned lower scores. 


FOR PRACTICE, TRY EXERCISE 1 


Note: Some people define the pth percentile of a distribution as the 
value with p percent of observations less than or equal to it. Using this 
alternative definition of percentile, it is possible for an individual to fall at 
the 100th percentile. If we used this definition, Jacob’s score of 18 would 
fall at the 8th percentile (2 of 25 scores were less than or equal to 18). 
Calculating percentiles is not an exact science, especially with small data 
sets! 


The median of a distribution is roughly the 50th percentile. For 
instance, 38 is the median score on Mr. Tabor’s first test. As you saw in 
part (b) of the example, Maria’s score of 38 put her at the 48th percentile 
of the distribution. The first quartile Q; is roughly the 25th percentile of a 
distribution because it separates the lowest 25% of values from the upper 
75%. Likewise, the third quartile Q3 is roughly the 75th percentile. 


og A high percentile is not always a good thing. For example, a 
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man whose cholesterol level is at the 90th percentile for his age group may 
need treatment for his high cholesterol! 


Cumulative Relative Frequency Graphs 


There are some interesting graphs that can be made with percentiles. One 
of the most common graphs starts with a frequency table for a quantitative 
variable. For instance, the frequency table on the next page summarizes 
the ages of the first 45 U.S. presidents when they took office. 


Age Frequency 
40-44 2 
45-49 7 
50-54 13 
59-59 12 
60-64 
65-69 
70-74 


Let’s expand this table to include columns for relative frequency, 
cumulative frequency, and cumulative relative frequency. 


e To get the values in the relative frequency column, divide the count in 
each interval by 45, the total number of presidents. Multiply by 100 to 
convert to a percent. 


e To fill in the cumulative frequency column, add the counts in the 
frequency column for the current interval and all intervals with smaller 
values of the variable. 


e For the cumulative relative frequency column, divide the entries in the 
cumulative frequency column by 45, the total number of presidents. 
Multiply by 100 to convert to a percent. 


Here is the original frequency table augmented with the relative 
frequency, cumulative frequency, and cumulative relative frequency 
columns: 


Cumulative Cumulative relative 
Age Frequency’ Relativefrequency frequency frequency 
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40-44 2 2/45 = 0.044 = 2 2/45 = 0.044 = 


4.4% 4.4% 
45-49 7 7/45 = 0.156 = a 9/45 = 0.200 = 
15.6% 20.0% 
50-54 13 13/45 = 0.289 = 22 22/45 = 0.489 = 
28.9% 48.9% 
55-59 T2 12/45 = 0.267 = 34 34/45 = 0.756 = 
26.7% 75.6% 
60-64 7 7/45 = 0.156 = 41 41/45 = 0.911 = 
15.6% 91.1% 
65-69 3 3/45 = 0.067 = 44 44/45 = 0.978 = 
6.7% 97.8% 
70-74 1 1/45 = 0.022 = 45 45/45 = 1.000 = 
2.2% 100% 


Now we can make a cumulative relative frequency graph. 


DEFINITION Cumulative relative frequency graph 


A cumulative relative frequency graph plots a point corresponding 
to the cumulative relative frequency in each interval at the smallest 
value of the next interval, starting with a point at a height of 0% at the 
smallest value of the firstinterval. Consecutive points are then 
connected with a line segment to form the graph. 


Some people refer to cumulative relative frequency graphs as ogives (pronounced “o- 


jives”). 


Figure 2.2 shows the completed cumulative relative frequency graph 
for the presidential age at inauguration data. Notice the following details: 


e The leftmost point is plotted at a height of 0% at age=408e = 40) the 
smallest value in the first interval. This point tells us that none of the 
first 45 U.S. presidents took office before age 40. 


e The next point to the right is plotted at a height of 4.4% at age=45. 
age = 45. This point tells us that 4.4% of these presidents (i.e., two of 
them) were inaugurated before they were 45 years old. 
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e The graph grows very gradually at first because few presidents were 
inaugurated when they were in their 40s. Then the graph gets very steep 
beginning at age 50 because most U.S. presidents were in their 50s 
when they were inaugurated. The rapid growth in the graph slows at age 
60. 


e The rightmost point on the graph is plotted above age 75 and has 
cumulative relative frequency 100%. That’s because 100% of these U.S. 
presidents took office before age 75. 


_* 


— 
S 
—) 
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Cumulative relative frequency (%) 


Cee oe T T CC" T 
40 45 50 55 60 65 70 75 


Age at inauguration (years) 
FIGURE 2.2 Cumulative relative frequency graph of the ages of U.S. presidents 
when they took office. 


A cumulative relative frequency graph can be used to describe the 
position of an individual within a distribution or to locate a specified 
percentile of the distribution. 


EXAMPLE | Ages of U.S. presidents (>) 


Interpreting a cumulative relative frequency graph 
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PROBLEM: Use the graph in Figure 2.2 to help you answer each 
question. 


a. Was Barack Obama, who was first inaugurated at age 47, unusually 
young? 


b. Estimate and interpret the 65th percentile of the distribution. 


SOLUTION: 


Cumulative relative frequency (%) 


40 45 az 50 55 60 65 70 75 


Age at inauguration (years) 


Barack Obama’s inauguration age places him at about the 11th 
percentile. About 11% of the first 45 U.S. presidents first took office 
at a younger age than Obama did. So Obama was fairly young, but 
not unusually young, when he took office. 


To find President Obama’s location in the distribution, draw a 
vertical line up from his age (47) on the horizontal axis until it meets 
the graph. Then draw a horizontal line from this point to the vertical 
axis. 
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Cumulative relative frequency (%) 


I i.9 1 


40 45 «SO SSop 60 65 FO FF 
Age at inauguration (years) 


The 65th percentile is about 58 years old. About 65% of the first 45 
U.S. presidents were younger than 58 when they took office. 


The 65th percentile of the distribution is the age with cumulative 
relative frequency 65%. To find this value, draw a horizontal line 


across from the vertical axis at a height of 65% until it meets the 
graph. Then draw a vertical line from this point down to the 
horizontal axis. 


FOR PRACTICE, TRY EXERCISE 9 


CHECK YOUR UNDERSTANDING 


(\ 


7 


J 


1. Multiple choice: Select the best answer. Mark receives a score report 
detailing his performance on a statewide test. On the math section, 
Mark earned a raw score of 39, which placed him at the 68th 
percentile. This means that 


a. Mark did better than about 39% of the students who took the test. 
b. Mark did worse than about 39% of the students who took the test. 
c. Mark did better than about 68% of the students who took the test. 
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d. Mark did worse than about 68% of the students who took the test. 
e. Mark got more than half of the questions correct on this test. 


2. Mrs. Munson is concerned about how her daughter’s height and 
weight compare with those of other girls of the same age. She uses an 
online calculator to determine that her daughter is at the 87th 
percentile for weight and the 67th percentile for height. Explain to 
Mrs. Munson what these values mean. 


Questions 3 and 4 relate to the following setting. The graph displays 
the cumulative relative frequency of the lengths of phone calls made 
from the mathematics department office at Gabalot High last month. 


S 
S 


foe] 
So 


g 


Cumulative relative frequency (%) 


Call length (min) 


3. About what percent of calls lasted less than 30 minutes? 30 minutes 
or more? 


4. Estimate Q), Qs, and the IQR of the distribution of phone call length. 


Measuring Location: z-Scores 


A percentile is one way to describe the location of an individual in a 
distribution of quantitative data. Another way is to give the standardized 
score (z-score) for the observed value. 


DEFINITION Standardized score (z-score) 


The standardized score (z-score) for an individual value in a 
distribution tells us how many standard deviations from the mean the 
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value falls, and in what direction. To find the standardized score (z- 
score), compute 


value — mean 


*» 


“~~ 


z=value — meanstandard deviation standard deviation 


Values larger than the mean have positive z-scores. Values smaller 
than the mean have negative z-scores. 


Let’s return to the data from Mr. Tabor’s first statistics test. Figure 2.3 
shows a dotplot of the data, along with numerical summaries. 


Test score 
n X s, Min Q; Med Q; Max 
25 35.44 8.77 12 315 38 415 48 


FIGURE 2.3 Dotplot and summary statistics of the scores on Mr. Tabor’s first 
statistics test. Emily’s score of 43 is marked in red on the dotplot. 


The relationship between the mean and the median is about what you’d expect in 


this left-skewed distribution. 


Where does Emily’s 43 (marked in red on the dotplot) fall in the 
distribution? Her standardized score (z-score) is 


z=value — meanstandard deviation=43 — 35.448.77=0.86 
value — mean _ 43 — 35.44 © 0.86 


z standarddeviation 8.77 


yA 
“~ 


That is, Emily’s test score is 0.86 standard deviations above the mean 
score of the class. 


EXAMPLE |Mr. Tabor’s first test, again (») 


Finding and interpreting z-scores 
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PROBLEM: Use Figure 2.3 to answer each of the following questions. 
a. Find and interpret the z-score for Jacob, who earned an 18 on the test. 


b. Tamika had a standardized score of 0.292. Find Tamika’s test score. 


SOLUTION: 


_ 18-3544 99 
a. Z=18 — 35.448.77=-1.99 8.77 


Jacob’s test score is 1.99 standard deviations below the class mean of 
35.44, 


Be sure to interpret the magnitude (number of standard deviations) 


and direction (greater than or less than the mean) of a z-score in 
context. 


b. 0.292=value — 35.448.770.292(8.77)+35.44=value38 = value 


value — 35.44 


0.292 = 

8.77 
0.292(8.77) + 35.44 = value 
38 = value 


Tamika’s test score was 38. 


Be sure to show your work when finding the value that corresponds 


to a given z-score. 


FOR PRACTICE, TRY EXERCISE 13 
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We often standardize observed values to express them on a common 


scale. For example, we might compare the heights of two children of 
different ages or genders by calculating their z-scores. 


At age 9, Jordan is 55 inches tall. Her height puts her at a z-score of 1. 
That is, Jordan is 1 standard deviation above the mean height of 9-year- 
old girls. 


Zayne’s height at age 11 is 58 inches. His corresponding z-score is 0.5. 
In other words, Zayne is 1/2 standard deviation above the mean height 
of 11-year-old boys. 


Even though Zayne’s height is larger, Jordan is taller relative to girls her 
age than Zayne is relative to boys his age. The standardized heights tell us 
where each child stands (pun intended!) in the distribution for his or her 
age group. 


CHECK YOUR UNDERSTANDING 


( 
J 


1. Mrs. Navard’s statistics class has just completed the first three steps 


of the “Where do I stand?” activity (page 90). The figure shows a 
dotplot of the distribution of height for the class, along with summary 
statistics from computer output. 


: . 
60 62 64 66 68 70 72 74 
Height (in.) 


» Min Q; Med Q; Max 


Variable nm <X Ss 
Height 25 67 4.29 60 63 66 #70 75 


Lynette, a student in the class, is 62 inches tall. Find and interpret her 
Z-SCOTre. 


. Brent is a member of the school’s basketball team and is 74 inches 


tall. The mean height of the players on the team is 76 inches. Brent’s 
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height translates to a z-score of —0.85 in the team’s distribution of 
height. What is the standard deviation of the team members’ heights? 


Transforming Data 


To find the standardized score (z-score) for an individual observation, we 
transform this data value by subtracting the mean and dividing the 
difference by the standard deviation. Transforming converts the 
observation from the original units of measurement (e.g., inches) to a 
standardized scale. 


There are other reasons to transform data. We may want to change the 
units of measurement for a data set from kilograms to pounds (1 kg*2.2 lb 
lkg = 2.21b), or from Fahrenheit to Celsius [ °C=59 (°F — 32) ]. 
|"C = $("F - 32)], oe 

: Or perhaps a measuring device is calibrated wrong, 
so we have to add a constant to each data value to get accurate 
measurements. What effect do these kinds of transformations—adding or 
subtracting; multiplying or dividing—have on the shape, center, and 
variability of a distribution? 


EFFECT OF ADDING OR SUBTRACTING A CONSTANT Recall 
that Mr. Tabor gave his class of 25 statistics students a first test worth 50 
points. Here is a dotplot of the students’ scores along with some numerical 
summaries. 


Test score 


n xX Sy Min Q; Med Q; Max 
25 35.44 877 12 315 38 41.5 48 


Suppose Mr. Tabor was nice and added 5 points to each student’s test 
score. How would this affect the distribution of scores? Figure 2.4 shows 
graphs and numerical summaries for the original test scores and adjusted 
scores. 
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Original 


Adjusted we J e e ee ed @eeeees e 
10 15 20 25 30 35 40 45 50 55 
Test score 
n x Ss; Min Q, Med Q; Max JOR Range 


Original 25 35.44 877 12 315 38 415 48 10 36 

Adjusted 25 4044 877 17 365 43 465 53 10 36 
FIGURE 2.4 Dotplots and summary statistics for the original scores and adjusted 
scores (with 5 points added) on Mr. Tabor’s statistics test. 


From both the graph and summary statistics, we can see that the 
measures of center (mean and median) and location (min, Q), Q3, and 


max) increased by 5 points. The shape of the distribution did not change. 
Neither did the variability of the distribution—the range, the standard 
deviation, and the interquartile range (IQR) all stayed the same. 


As this example shows, adding the same positive number to each value 
in a data set shifts the distribution to the right by that number. Subtracting 
a positive constant from each data value would shift the distribution to the 
left by that constant. 


THE EFFECT OF ADDING OR SUBTRACTING A 
CONSTANT 


Adding the same positive number a to (subtracting a from) each 
observation: 


e Adds a to (subtracts a from) measures of center and location (mean, 
five-number summary, percentiles) 


e Does not change measures of variability (range, JQR, standard 
deviation) 


e Does not change the shape of the distribution 


EXAMPLE | How wide is this room? (>) 
Effect of adding/subtracting a constant 
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PROBLEM: Soon after the metric system was introduced in Australia, 
a group of students was asked to guess the width of their classroom to 
the nearest meter. A dotplot of the data and some numerical summaries 


are shown. 


The actual width of the room was 13 meters. We can examine the 
distribution of students’ errors by defining a new variable as follows: 
error=guess—13.etror = guess — 13. Note that a negative value for error 
indicates that a student’s guess for the width of the room was too small. 


0 10 20 30 40 
Guess (m) 


n x Ss, Min Q, Med Q; Max JOR Range 
Guess(m) 44 16.02 714 8 11 15 17 40° 6 32 


a. What shape does the distribution of error have? 
b. Find the mean and the median of the distribution of error. 


c. Find the standard deviation and interquartile range (IQR) of the 
distribution of error. 


SOLUTION: 


a. The same shape as the original distribution of guesses: skewed to the 
right with two distinct peaks. 


b. Mean:16.02—13=3.02 meters; Median: 15-13=2 meters. 
Mean : 16.02 — 13 = 3.02 meters; Median : 15 — 13 = 2 meters. 
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c. The same as for the original distribution of guesses: 


Standard deviation=7.14 meters, IQR=6 meters 
Standard deviation = 7.14 meters, IQR = 6 meters 


It is not a surprise that the mean is greater than the median in this right- 


skewed distribution. 


FOR PRACTICE, TRY EXERCISE 21 


Figure 2.5 confirms the results of the example. 


Guess (m) 


n x Ss; Min Q,; Med Q; Max JOR Range 
Guess(m) 44 16.02 7.14 8 11 15 17 £440 6 32 
Error(m) 44 3.02 7.14 -5 2 2 4 27 6 32 
FIGURE 2.5 Dotplots and summary statistics for the Australian students’ guesses 
of classroom width and the errors in their guesses, in meters. 


What about outliers? You can check that the four highest guesses— 
which are 27, 35, 38, 40 meters—are outliers by the 1.5xIQRI-9 x IQR 
rule. The same individuals will still be outliers in the distribution of error, 
but each of their values will be decreased by 13 meters: 14, 22, 25, and 27 
meters. 


EFFECT OF MULTIPLYING OR DIVIDING BY A CONSTANT 
Suppose that Mr. Tabor wants to convert his students’ adjusted test scores 
to percents. Because the test was out of 50 points, he should multiply each 
score by 2 to be counted out of 100 points instead. Here are graphs and 
numerical summaries for the adjusted scores and the doubled scores: 
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Adjusted a & ee @ ee cese seecee 6 


Doubled o 6 e o 8 se ee 
10 20 30 40 50 60 70 80 100 110 
Test score 


n x Sy Min Q; Med @Q; Max JOR Range 


Adjusted 25 4044 877 17 365 43 465 53 10 36 
Doubled 25 80.88 17.54 34 73 86 93 106 20 72 


From the graphs and summary statistics, we can see that the measures 
of center, location, and variability have all doubled, just like the individual 
observations. But the shape of the two distributions is the same. 


EFFECT OF MULTIPLYING OR DIVIDING BY A 
CONSTANT 


Multiplying (or dividing) each observation by the same positive number 
b: 


e Multiplies (divides) measures of center and location (mean, five- 


number summary, percentiles) by b 


e Multiplies (divides) measures of variability (range, IQR, standard 
deviation) by b 


e Does not change the shape of the distribution 


It is not common to multiply (or divide) each observation in a data set by a negative 
number b. Doing so would multiply (or divide) the measures of variability by the 
absolute value of b. We can’t have a negative amount of variability! Multiplying or 
dividing by a negative number would also affect the shape of the distribution, as all 


values would be reflected over the y axis. 


EXAMPLE | How far off were our guesses? » 
Effect of multiplying/dividing by a constant 
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PROBLEM: Refer to the preceding example. The graph and numerical 
summaries describe the distribution of the Australian students’ guessing 
errors (in meters) when they tried to estimate the width of their 
classroom. 


T T T T T T T T 
-5 0 5 10 aS 20: 2: <0 


Error (m) 


n x Ss; Min Q,; Med Q; Max JQR_ Range 
44 3.02 714 -5 2 2 4 27 6 32 


Because the students are having some difficulty with the metric system, 
it may not be helpful to tell them that their guesses tended to be about 2 
meters too high. Let’s convert the error data to feet before we report 
back to them. 


There are roughly 3.28 feet in a meter. For the student whose error was 
—5 meters, that translates to 


.28 feet 
—5 meters x atid at —16.4 feet 


—5 metersx3.28 feet1 meter=—16.4 feet 1 meter 


To change the units of measurement from meters to feet, we multiply 
each of the error values by 3.28. 


a. What shape does the resulting distribution of error have? 
b. Find the median of the distribution of error in feet. 


c. (c) Find the interquartile range (IQR) of the distribution of error in 
feet. 


SOLUTION: 
a. The same shape as the original distribution of guesses: skewed to the 
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right with two distinct peaks. 
b. Median=2x3.28=6.56 feetMedian = 2 x 3.28 = 6.56 feet 
c. IQR=6x3.28=19.68 feet1QR = 6 x 3.28 = 19.68 feet 


FOR PRACTICE, TRY EXERCISE 25 


Figure 2.6 confirms the results of the example. 


nex Sy Min Q; Med Q; Max JOR Range 
Error(m) 44 3.02 714 5 —2 2 4 27 6 32 
Error (ft) 44 9.91 23.42 -164 -656 656 13.12 8856 19.68 104.96 


FIGURE 2.6 Dotplots and summary statistics for the errors in Australian students’ 
guesses of classroom width, in meters and in feet. 


PUTTING IT ALL TOGETHER: ADDING/SUBTRACTING AND 
MULTIPLYING/DIVIDING What happens if we transform a data set by 
both adding or subtracting a constant and multiplying or dividing by a 
constant? We just use the facts about transforming data that we’ve already 
established. 


EXAMPLE |Too cool at the cabin? » 


Analyzing the effects of transformations 
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gulfix/Getty Images 


PROBLEM: During the winter months, the temperatures at the 
Starnes’s Colorado cabin can stay well below freezing (32°F, or 0°C) 
for weeks at a time. To prevent the pipes from freezing, Mrs. Starnes 
sets the thermostat at 50°F. She also buys a digital thermometer that 
records the indoor temperature each night at midnight. Unfortunately, 
the thermometer is programmed to measure the temperature in degrees 
Celsius. Following are a dotplot and numerical summaries of the 
midnight temperature readings for a 30-day period. 


Use the fact that °F=(9/5)°C+32 F = (9/5) “C + 32 to help you answer 
the following questions. 


a. Find the mean temperature in degrees Fahrenheit. Does the 
thermostat setting seem accurate? 


b. Calculate the standard deviation of the temperature readings in 
degrees Fahrenheit. Interpret this value in context. 


e 
° ee a 
T T 


I} 
0 2 4 6 8 10 12 14 16 
Temperature (°C) 


n Mean SD Min Q; Med Q; Max 
30 843 2.27 3.00 7.00 8.50 10.00 14.00 


SOLUTION: 


a. Mean=(9/5)(8.43)+32=47.17 FMean = (9/5) (8.43) + 32 = 47.17 °F | 
The thermostat doesn’t seem to be very accurate. It is set at 50°F, but 
the mean temperature over the 30-day period is about 47°F. 


— SSS! 
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Multiplying each observation by 9/5 multiplies the standard deviation 
by 9/5. However, adding 32 to each observation doesn’t affect the 
variability. 


b. SD=(9/5)(2.27)=4.09°F, 5D = (9/5) (2.27) = 4.09°F. The temperature 
readings typically vary from the mean by about 4°F. That’s a lot of 
variation! 


FOR PRACTICE, TRY EXERCISE 29 


Many other types of transformations can be very useful in analyzing 
data. We have only studied what happens when you transform data by 
adding, subtracting, multiplying, or dividing by a constant. 


CONNECTING TRANSFORMATIONS AND z-SCORES What 
happens if we standardize all the values in a distribution of quantitative 
data? Here is a dotplot of the original test scores for the 25 students in Mr. 
Tabor’s statistics class, along with some numerical summaries: 


r n x s 
10 15 20 25 30 35 40 45 50 25 35.44 8.77 
Test score 


We calculate the z-score for each student using 


score — 35.44 


z=score — 35.448.77 8.77 
In other words, we subtract 35.44 from each student’s test score and then 
divide by 8.77. What effect would these transformations have on the 
shape, center, and variability of the distribution? 


Here is a dotplot of the class’s z-scores. Let’s describe the distribution. 


° 

® ee 

2 e e s a ee eese 
I I | I I I 


T 
3.0 2.5 2.0 1.5 1.0 05 0.0 0.5 1.0 1.5 2.0 
z-score 


e Shape: The shape of the distribution of z-scores is the same as the 
shape of the original distribution of test scores—skewed to the left. 
Neither subtracting a constant nor dividing by a constant would change 
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the shape of the graph. 


e Center: The mean of the distribution of z-scores is 0. Subtracting 35.44 
from each test score would reduce the mean from 35.44 to 0. Dividing 
each of these new data values by 8.77 would divide the new mean of 0 
by 8.77, which still yields a mean of 0. 


e Variability: The standard deviation of the distribution of z-scores is 1. 
Subtracting 35.44 from each test score does not affect the standard 
deviation. However, dividing all of the resulting values by 8.77 would 
divide the original standard deviation of 8.77 by 8.77, yielding 1. 


CHECK YOUR UNDERSTANDING 


( 


», Knoebels Amusement Park in Elysburg, Pennsylvania, has earned 
acclaim for being an affordable, family-friendly entertainment venue. 
Knoebels does not charge for general admission or parking, but it does 
charge customers for each ride they take. How much do the rides cost at 
Knoebels? The figure shows a dotplot of the cost for each of 22 rides in 
a recent year, along with summary statistics. 


s 
e e a 
T T T T = 


2.00 2.25 2.50 2.75 3.00 
Cost ($) 


~) 4eeeece 
t 


e 
N 
VW 

un 
i) 
a 
wn 


n Mean SD Min Q,; Median Q3; Max 
22 1.705 0.447 1.25 15 LS 175 3 


1. Suppose you convert the cost of the rides from dollars to cents 
($1=100 cents), (51 = 100 cents) - Describe the shape, mean, and 
standard deviation of the distribution of ride cost in cents. 


2. Knoebels’ managers decide to increase the cost of each ride by 25 
cents. How would the shape, center, and variability of this 
distribution compare with the distribution of cost in Question 1? 


3. Now suppose you convert the increased costs from Question 2 to z- 
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scores. What would be the shape, mean, and standard deviation of this 
distribution? Explain your answers. 


Section 2.1 Summary 


e Two ways of describing an individual’s location in a distribution are 
percentiles and standardized scores (z-scores). An individual’s 
percentile is the percent of the distribution that is less than the 
individual’s data value. 


e To standardize any data value, subtract the mean of the distribution and 
then divide the difference by the standard deviation. The resulting z- 
score 


value — mean 


z=value — meanstandard deviation standard deviation 


measures how many standard deviations the data value lies above or 
below the mean of the distribution. We can also use percentiles and z- 
scores to compare the relative location of individuals in different 
distributions. 


e A cumulative relative frequency graph allows us to examine location 
in a distribution. The completed graph allows you to estimate the 
percentile for an individual value, and vice versa. 


e It is necessary to transform data when changing units of measurement. 


» When you add a positive constant a to (subtract a from) all the values 
in a data set, measures of center and location—mean, five-number 
summary, percentiles—increase (decrease) by a. Measures of 
variability—range, IQR, SD—do not change. 


« When you multiply (divide) all the values in a data set by a positive 
constant b, measures of center, location, and variability are multiplied 
(divided) by b. 


. Neither of these transformations changes the shape of the distribution. 
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Section 2.1 Exercises 


1. pg 92 |») Shoes How many pairs of shoes does a typical teenage boy 


own? To find out, two AP® Statistics students surveyed a random 
sample of 20 male students from their large high school. Then they 
recorded the number of pairs of shoes that each boy owned. Here is a 
dotplot of the data: 


10 15 20 25 30 35 40 
Number of pairs of shoes 


a. Find the percentile for Jackson, who reported owning 22 pairs of 
shoes. 


b. Raul’s reported number of pairs of shoes is at the 45th percentile of 
the distribution. Interpret this value. How many pairs of shoes does 
Raul own? 
2. Old folks Here is a stemplot of the percents of residents aged 65 and 
older in the 50 states: 


71] 0 Key: 15|2 means 15.2% of 
8 | 8 this state’s residents are 65 
918 or older. 

10 |} 019 

UT 1 46722 


12 | 01122456778999 
13 | 0001223344455689 


14 | 023568 
15 | 24 
16|,9 


a. Find the percentile for Colorado, where 10.1% of the residents are 
aged 65 and older. 


b. Rhode Island is at the 80th percentile of the distribution. Interpret 
this value. What percent of Rhode Island’s residents are aged 65 
and older? 

3. Wear your helmet! Many athletes (and their parents) worry about the 
risk of concussions when playing sports. A football coach plans to 
obtain specially made helmets for his players that are designed to 
reduce the chance of getting a concussion. Here are the measurements 
of head circumference (in inches) for the players on the team: 


200 


23.0 2c.2 21.7 22.0 22.3 22.6 22.7 21.5 
22./ 25.6 20.8 23.0 24.2 23.5 20.8 24.0 


22. 22.6 2o:0 22.5 23.1 21.9 21.0 22.4 
23.5 225 23.9 23.4 21.6 23.3 


a. Antawn, the team’s starting quarterback, has a head circumference 
of 22.4 inches. What is Antawn’s percentile? 


b. Find the head circumference of the player at the 90th percentile of 


the distribution. 


4. Don’t call me According to a study by Nielsen Mobile, “Teenagers 


6. 


re 


ages 13 to 17 are by far the most prolific texters, sending 1742 
messages a month.” Mr. Williams, a high school statistics teacher, was 
skeptical about the claims in the article. So he collected data from his 
first-period statistics class on the number of text messages they had 
sent over the past 24 hours. Here are the data: 


a. Sunny was the student who sent 42 text messages. What is Sunny’s 
percentile? 


b. Find the number of text messages sent by Joelle, who is at the 16th 


percentile of the distribution. 
Setting speed limits According to the Los Angeles Times, speed limits 
on California highways are set at the 85th percentile of vehicle speeds 
on those stretches of road. Explain to someone who knows little 
Statistics what that means. 
Blood pressure Larry came home very excited after a visit to his 
doctor. He announced proudly to his wife, “My doctor says my blood 
pressure is at the 90th percentile among men like me. That means I’m 
better off than about 90% of similar men.” How should his wife, who 
is a Statistician, respond to Larry’s statement? 
Growth charts We used an online growth chart to find percentiles for 
the height and weight of a 16-year-old girl who is 66 inches tall and 
weighs 118 pounds. According to the chart, this girl is at the 48th 
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8. 


9. 


percentile for weight and the 78th percentile for height. Explain what 
these values mean in plain English. 

Run fast Peter is a star runner on the track team. In the league 
championship meet, Peter records a time that would fall at the 80th 
percentile of all his race times that season. But his performance places 
him at the 50th percentile in the league championship meet. Explain 
how this is possible. (Remember that shorter times are better in this 
case!) 

pg 94 |») Run fast! As part of a student project, high school 


students were asked to sprint 50 yards and their times (in seconds) 
were recorded. A cumulative relative frequency graph of the sprint 
times is shown here. 


100 4 2 


0 a as a ee 
5.8 63 68 7.3 78 83 88 93 9.8 10.3 
Sprint time (sec) 


Cumulative relative frequency (%) 


a. One student ran the 50 yards in 8 seconds. Is a sprint time of 8 
seconds unusually slow? 


b. Estimate and interpret the 20th percentile of the distribution. 


10. Household incomes The cumulative relative frequency graph 


describes the distribution of median household incomes in the 50 states 


in a recent year.2 


100 = 


0 | caer lee emma cams’) amar Fame] Ean 
35 40 45 50 55 60 65 70 75 
Median household income ($1000s) 


Cumulative relative frequency (%) 
3 
l 
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a. The median household income in North Dakota that year was 
$55,766. Is North Dakota an unusually wealthy state? 


b. Estimate and interpret the 90th percentile of the distribution. 

11. Foreign-born residents The cumulative relative frequency graph 
shows the distribution of the percent of foreign-born residents in the 
50 states. 


100 


Cumulative relative frequency (%) 


0 5 10 15 20 25 
Percent foreign-born residents 


a. Estimate the interquartile range (IQR) of this distribution. Show 
your method. 


b. What is the percentile for Arizona, which had 15.1% foreign-born 
residents that year? 


c. Explain why the graph is fairly flat between 20% and 27.5%. 


d. Draw the histogram that corresponds to this graph. 
12. Shopping spree The figure is a cumulative relative frequency graph of 
the amount spent by 50 consecutive grocery shoppers at a store. 


100 + 
90 4 a 


—— 


eed ie ode gle> eagle cle. aia il 
0 10 20 30 40 50 60 70 80 90 100 


Amount spent ($) 


Cumuhtive relative frequency (%) 


a. Estimate the interquartile range (IQR) of this distribution. Show 
your method. 
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b. What is the percentile for the shopper who spent $19.50? 
c. Explain why the graph is steepest between $10 and $30. 


d. Draw the histogram that corresponds to this graph. 
13. pg 96 |») Foreign-born residents Refer to Exercise 11. Here are 


summary statistics for the percent of foreign-born residents in the 50 


States: 
n Mean SD Min Qi Med Q3 Max 
50 8.73 6.12 1.3 4.1 6.2 13.4 27k 


a. Find and interpret the z-score for Montana, which had 1.9% foreign- 
born residents. 


b. New York had a standardized score of 2.10. Find the percent of 
foreign-born residents in New York at that time. 
14. Household incomes Refer to Exercise 10. Here are summary statistics 
for the state median household incomes: 


n Mean SD Min Qi Med Q3 Max 
50 51,742.44 8210.64 36,641 46,071 50,009 57,020 71,836 


a. Find and interpret the z-score for North Carolina, with a median 
household income of $41,553. 


b. New Jersey had a standardized score of 1.82. Find New Jersey’s 
median household income for that year. 
15. Shoes Refer to Exercise 1. Jackson, who reported owning 22 pairs of 
shoes, has a standardized score of z=1.102 = 1.10. 


a. Interpret this z-score. 


b. The standard deviation of the distribution of number of pairs of 
shoes owned in this sample of 20 boys is 9.42. Use this information 
along with Jackson’s z-score to find the mean of the distribution. 

16. Don’t call me Refer to Exercise 4. Alejandro, who sent 92 texts, has a 

standardized score of z=1.892 = 1.89. 


a. Interpret this z-score. 


b. The standard deviation of the distribution of number of text 
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messages sent over the past 24 hours by the students in Mr. 
Williams’s class is 34.15. Use this information along with 
Alejandro’s z-score to find the mean of the distribution. 

17. Measuring bone density Individuals with low bone density 
(osteoporosis) have a high risk of broken bones (fractures). Physicians 
who are concerned about low bone density in patients can refer them 
for specialized testing. Currently, the most common method for testing 
bone density is dual-energy X-ray absorptiometry (DEXA). The bone 
density results for a patient who undergoes a DEXA test usually are 
reported in grams per square centimeter (g/cm) and in standardized 
units. 

Judy, who is 25 years old, has her bone density measured using 
DEXA. Her results indicate bone density in the hip of 948 g/cm? and a 
standardized score of z=—1.45.2 = —1.49. The mean bone density in 
the hip is 956 g/cm? in the reference population of 25-year-old women 
like Judy.2 


a. Judy has not taken a statistics class in a few years. Explain to her in 
simple language what the standardized score reveals about her bone 
density. 


b. Use the information provided to calculate the standard deviation of 
bone density in the reference population. 

18. Comparing bone density Refer to Exercise 17. Judy’s friend Mary 
also had the bone density in her hip measured using DEXA. Mary is 
35 years old. Her bone density is also reported as 948 g/cm2, but her 
standardized score is z=0.50.2 = 0.50. The mean bone density in the 
hip for the reference population of 35-year-old women is 944 
grams/cm2. 


a. Whose bones are healthier for her age: Judy’s or Mary’s? Justify 
your answer. 


b. Calculate the standard deviation of the bone density in Mary’s 
reference population. How does this compare with your answer to 


Exercise 17(b)? Are you surprised? 
19. SAT versus ACT Eleanor scores 680 on the SAT Mathematics test. 


The distribution of SAT Math scores is symmetric and single-peaked 
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with mean 500 and standard deviation 100. Gerald takes the American 
College Testing (ACT) Mathematics test and scores 29. ACT scores 
also follow a symmetric, single-peaked distribution—but with mean 
21 and standard deviation 5. Find the standardized scores for both 
students. Assuming that both tests measure the same kind of ability, 
who has the higher score? 

20. Comparing batting averages Three landmarks of baseball 
achievement are Ty Cobb’s batting average of 0.420 in 1911, Ted 
Williams’s 0.406 in 1941, and George Brett’s 0.390 in 1980. These 
batting averages cannot be compared directly because the distribution 
of major league batting averages has changed over the years. The 
distributions are quite symmetric, except for outliers such as Cobb, 
Williams, and Brett. While the mean batting average has been held 
roughly constant by rule changes and the balance between hitting and 
pitching, the standard deviation has dropped over time. Here are the 


facts: 
Decade Mean Standard deviation 
1910s 0.266 0.0371 
1940s 0.267 0.0326 
1970s 0.261 0.0317 


Find the standardized scores for Cobb, Williams, and Brett. Who had 
the best performance for the decade he played? 
21. pg 99 |») Long jump A member of a track team was practicing the 


long jump and recorded the distances (in centimeters) shown in the 
dotplot. Some numerical summaries of the data are also provided. 


I 
565 570 575 580 585 
Long-jump distance (cm) 


n Mean SD Min Q, Med Q3 Max 
40 577.3 4.713 564 574.5 577 581.5 586 


After chatting with a teammate, the jumper realized that he measured 
his jumps from the back of the board instead of the front. Thus, he had 
to subtract 20 centimeters from each of his jumps to get the correct 
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22. 


23. 


24. 


measurement for each jump. 


a. What shape would the distribution of corrected long jump distance 
have? 


b. Find the mean and median of the distribution of corrected long- 
jump distance. 


c. Find the standard deviation and interquartile range (IQR) of the 
distribution of corrected long-jump distance. 

Step right up! A dotplot of the distribution of height for Mrs. 

Navard’s class is shown, along with some numerical summaries of the 

data. 


60 62 64 66 68 70 72 74 
Height (in.) 

n xX  §s, Min Q; Med Q3; Max 

25 67 4.29 60 63 66 70 75 


Suppose that Mrs. Navard has the entire class stand on a 6-inch-high 
platform and then asks the students to measure the distance from the 
top of their heads to the ground. 


a. What shape would this distribution of distance have? 
b. Find the mean and median of the distribution of distance. 


c. Find the standard deviation and interquartile range (IQR) of the 
distribution of distance. 

Teacher raises A school system employs teachers at salaries between 
$38,000 and $70,000. The teachers’ union and school board are 
negotiating the form of next year’s increase in the salary schedule. 
Suppose that every teacher is given a $1000 raise. What effect will this 
raise have on each of the following characteristics of the resulting 
distribution of salary? 


a. Shape 
b. Mean and median 


c. Standard deviation and interquartile range (IQR) 
Used cars, cheap! A used-car salesman has 28 cars in his inventory, 
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with prices ranging from $11,500 to $25,000. For a Labor Day sale, he 
reduces the price of each car by $500. What effect will this reduction 
have on each of the following characteristics of the resulting 
distribution of price? 


a. Shape 
b. Mean and median 


c. Standard deviation and interquartile range ([QR) 
25. pg 100 |») Long jump Refer to Exercise 21. Suppose that the 


corrected long-jump distances are converted from centimeters to 
meters (note that 100 cm=1 m100 cm = 1 m), 


a. What shape would the resulting distribution have? Explain your 
answer. 


b. Find the mean of the distribution of corrected long-jump distance in 
meters. 


c. Find the standard deviation of the distribution of corrected long- 
jump distance in meters. 
26. Step right up! Refer to Exercise 22. Suppose that the distances from 
the tops of the students’ heads to the ground are converted from inches 
to feet (note that 12 in.=1 ft),12 in. = 1 ft 


a. What shape would the resulting distribution have? Explain your 
answer. 


b. Find the mean of the distribution of distance in feet. 


c. Find the standard deviation of the distribution of distance in feet. 

27. Teacher raises Refer to Exercise 23. Suppose each teacher receives a 
5% raise instead of a $1000 raise. What effect will this raise have on 
each of the following characteristics of the resulting salary 
distribution? 


a. Shape 
b. Median 


c. Interquartile range (IQR) 
28. Used cars, cheap! Refer to Exercise 24. Suppose each car’s price is 
reduced by 10% instead of $500. What effect will this discount have 
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on each of the following characteristics of the resulting price 
distribution? 


a. Shape 
b. Median 


c. Interquartile range (IQR) 
29. pg 101 |») Cool pool? Coach Ferguson uses a thermometer to 


measure the temperature (in degrees Fahrenheit) at 20 different 
locations in the school swimming pool. An analysis of the data yields a 
mean of 77°F and a standard deviation of 3°F. (Recall that °C=59 °F 


‘ia 3 160 
~ig99hecall that C=5 F-5 ) 


a. Find the mean temperature reading in degrees Celsius. 


b. Calculate the standard deviation of the temperature readings in 
degrees Celsius. 

30. Measure up Clarence measures the diameter of each tennis ball in a 
bag with a standard ruler. Unfortunately, he uses the ruler incorrectly 
so that each of his measurements is 0.2 inch too large. Clarence’s data 
had a mean of 3.2 inches and a standard deviation of 0.1 inch. 

(Recall that 1 in.=2.54 cm.Recall that 1 in. = 2.54 cm.) 


a. Find the mean of the corrected measurements in centimeters. 


b. Calculate the standard deviation of the corrected measurements in 
centimeters. 

31. Taxi! In 2016, taxicabs in Los Angeles charged an initial fee of $2.85 
plus $2.70 per mile. In equation form, Fare Fare=2.85+2.7(miles). 
Fare = 2.85 + 2.7(miles). at the end of a month, a businessman 
collects all his taxicab receipts and calculates some numerical 
summaries. The mean fare he paid was $15.45 with a standard 
deviation of $10.20. What are the mean and standard deviation of the 
lengths of his cab rides in miles? 

32. Quiz scores The scores on Ms. Martin’s statistics quiz had a mean of 
12 and a standard deviation of 3. Ms. Martin wants to transform the 
scores to have a mean of 75 and a standard deviation of 12. What 
transformations should she apply to each test score? Explain your 
answer. 
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Multiple Choice Select the best answer for Exercises 33-38. 


33. 


34. 


35. 


Jorge’s score on Exam 1 in his statistics class was at the 64th 
percentile of the scores for all students. His score falls 


a. between the minimum and the first quartile. 
b. between the first quartile and the median. 

c. between the median and the third quartile. 

d. between the third quartile and the maximum. 


e. at the mean score for all students. 


When Sam goes to a restaurant, he always tips the server $2 plus 10% 
of the cost of the meal. If Sam’s distribution of meal costs has a mean 
of $9 and a standard deviation of $3, what are the mean and standard 
deviation of his tip distribution? 


a. $2.90, $0.30 
b. $2.90, $2.30 
c. $9.00, $3.00 
d. $11.00, $2.00 
e. $2.00, $0.90 


Scores on the ACT college entrance exam follow a bell-shaped 
distribution with mean 21 and standard deviation 5. Wayne’s 
standardized score on the ACT was —0.6. What was Wayne’s actual 
ACT score? 


a 3 
b. 13 
c. 16 
d. 18 


e. 24 
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36. George’s average bowling score is 180; he bowls in a league where 
the average for all bowlers is 150 and the standard deviation is 20. 
Bill’s average bowling score is 190; he bowls in a league where the 
average is 160 and the standard deviation is 15. Who ranks higher in 
his own league, George or Bill? 


a. Bill, because his 190 is higher than George’s 180. 
b. Bill, because his standardized score is higher than George’s. 


c. Bill and George have the same rank in their leagues, because both 
are 30 pins above the mean. 


d. George, because his standardized score is higher than Bill’s. 


e. George, because the standard deviation of bowling scores is higher 
in his league. 


Exercises 37 and 38 refer to the following setting. The number of absences 
during the fall semester was recorded for each student in a large 
elementary school. The distribution of absences is displayed in the 
following cumulative relative frequency graph. 
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Number of absences 

37. What is the interquartile range (IQR) for the distribution of absences? 
a 1 
b. 2 
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e. 14 


38. If the distribution of absences was displayed in a histogram, what 


would be the best description of the histogram’s shape? 


a. Symmetric 
b. Uniform 

c. Skewed left 
d. Skewed right 


e. Cannot be determined 


Recycle and Review Exercises 39 and 40 refer to the following setting. 
We used Census At School’s Random Data Selector to choose a sample of 
50 Canadian students who completed a survey in a recent year. 


39. 


40. 


Travel time (1.2) The dotplot displays data on students’ responses to 
the question “How long does it usually take you to travel to school?” 
Describe the distribution. 


obo SESS. eS a 
0 20 40 60 80 100 
Travel time (min) 
Lefties (1.1) Students were asked, “Are you right-handed, left- 
handed, or ambidextrous?” The responses (R=right-handed, L=left- 


handed, A=ambidextrous 
R = right-handed, L = left-handed, A = ambidextrous) are shown 


here. 
R R R R R R R R R R R L R R 
R R R R R R R R R R R R R A 
R R R R A R R L R R R R L A 
R R R R R R R R 


a. Make an appropriate graph to display these data. 
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b. Over 10,000 Canadian high school students took the Census At 
School survey that year. What percent of this population would 
you estimate is left-handed? Justify your answer. 
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SECTION 2.2 Density Curves and Normal 
Distributions 


LEARNING TARGETS By the end of the section, you should be able to: 


Use a density curve to model distributions of quantitative data. 


Identify the relative locations of the mean and median of a 
distribution from a density curve. 


Use the 68—95-99.7 rule to estimate (i) the proportion of values in a 
specified interval, or (ii) the value that corresponds to a given 
percentile in a Normal distribution. 


Find the proportion of values in a specified interval in a Normal 
distribution using ‘Table A or technology. 


Find the value that corresponds to a given percentile in a Normal 
distribution using Table A or technology. 


Determine whether a distribution of data is approximately Normal 
from graphical and numerical evidence. 


In Chapter 1, we developed graphical and numerical tools for describing 
distributions of quantitative data. Our work gave us a clear strategy for 
exploring data on a single quantitative variable. 


e Always plot your data: make a graph—usually a dotplot, stemplot, or 
histogram. 


e Look for the overall pattern (shape, center, variability) and for striking 
departures such as outliers. 


e Calculate numerical summaries to describe center and variability. 


In this section, we add one more step to this strategy. 


e When there’s a regular overall pattern, use a simplified model called a 
density curve to describe it. 
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Density Curves 


age footstock/Alamy Stock Photo 


Selena works at a bookstore in the Denver International Airport. She takes 
the airport train from the main terminal to get to work each day. The 
airport just opened a new walkway that would allow Selena to get from the 
main terminal to the bookstore in 4 minutes. She wonders if it will be 
faster to walk or take the train to work. 

Figure 2.7(a) shows a dotplot of the amount of time it has taken Selena 
to get to the bookstore by train each day for the last 1000 days she worked. 
To estimate the percent of days on which it would be quicker for her to 
take the train, we could find the percent of dots (marked in red) that 
represent journey times of less than 4 minutes. Surely, there’s a simpler 
way than counting all those dots! 

Figure 2.7(b) shows the dotplot modeled with a density curve. You 
might wonder why the density curve is drawn at a height of 1/3. That’s so 
the area under the density curve between 2 minutes and 5 minutes is equal 
to 


3x1/3=1,00=100%2 * 1/3 = 1.00 = 100% 


which represents 100% of the observations in the distribution shown in 


Figure 2.7(a). 
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Density curve 


eee 
et Height = = 
cus esas = 3 
Sereses 
: by 
acasmes | Seer ewan ji 
20 25 30 35 40 45 5.0 2.0 2.5 3.0 3.5 4.0 4.5 5.0 
(a) Journey time (min) (b) Journey time (min) 


FIGURE 2.7 (a) Dotplot of Selena’s travel time over the past several years via 
train from the Denver airport main terminal to the bookstore where she works. The 
red dots indicate times when it took her less than 4 minutes to get to work. (b) 
Density curve modeling the dotplot in part (a). The red shaded area estimates the 
proportion of times that it took Selena less than 4 minutes to get to work. 


DEFINITION Density curve 

A density curve is a curve that 

e Is always on or above the horizontal axis 
e Has area exactly 1 underneath it 


The area under the curve and above any interval of values on the 
horizontal axis estimates the proportion of all observations that fall in 
that interval. 


The density curve in Figure 2.7(b) is called a uniform density curve because it has 
constant height. Recall from Chapter 1 that we can describe the dotplot of journey 
times in Figure 2.7(a) as having roughly a uniform distribution. 


The red shaded area under the density curve in Figure 2.7(b) provides a 
good approximation for the proportion or percent of red dots. Because the 
shaded region is rectangular, 


area=base xheight=2 x 1/3=2/3=0.667=66.7% 
area = base x height = 2 x 1/3 = 2/3 = 0.667 = 66.7% 


So we estimate that it would be quicker for Selena to take the train to work 


275 


on about 66.7% of days. In fact, on 669 of the 1000 days, Selena’s journey 
from the terminal to the bookstore took less than 4 minutes. That’s 
669/1000 = 0.669 = 66.9%—-ery close to the estimate we got using the 
density curve. 

No set of quantitative data is exactly described by a density curve. The 
curve is an approximation that is easy to use and accurate enough in most 
cases. The density curve simply smooths out the irregularities in the 
distribution. 


EXAMPLE |That’s so random! >» 


Density curves 


PROBLEM: Suppose you use a calculator or computer random number 
generator to produce a number between 0 and 2 (like 0.84522 or 
1.1111119). The random number generator will spread its output 
uniformly across the entire interval from 0 to 2 as we allow it to 
generate a long sequence of random numbers. 


a. Draw a density curve to model this distribution of random numbers. 
Be sure to include scales on both axes. 


b. About what percent of the randomly generated numbers will fall 
between 0.87 and 1.55? 


c. Estimate the 65th percentile of this distribution of random numbers. 


SOLUTION: 


tT l 
0 2 
Random number 


| The height of the curve needs to be 1/2 so that | 
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area = base x height 


area=basexheight =I =2Zxi/2=1 


b. 
0 0.8% 1.55 2 
Random number 
Area=(1.55-0.87)x1/2=0.34=34% 
Area = (1.55 — 0.87) x 1/2 = 0.34 = 34% 
ran 


Random number 


0.65 = (x — 0) x 1/2 
0.65 = (1/2)x 
0.65=(x—0)x1/20.65=(1/2)x1.30=x !-30 = x 


FOR PRACTICE, TRY EXERCISE 41 


Describing Density Curves 


Density curves come in many shapes. As with the distribution of a 
quantitative variable, we start by looking for rough symmetry or clear 
skewness. Then we identify any clear peaks. Figure 2.8 shows three 
density curves with distinct shapes. 


ay F 


Skewed to the left, Roughly symmetric, Skewed to the right, 


al double-peaked (— peaked 


FIGURE 2.8 Density curves with different shapes. 


Our measures of center and variability apply to density curves as well 
as to distributions of quantitative data. Recall that the mean is the balance 
point of a distribution. Figure 2.9 illustrates this idea for the mean of a 
density curve. 


FIGURE 2.9 The mean of a density curve is its balance point. 


The median of a distribution of quantitative data is the point with half 
the observations on either side. Similarly, the median of a density curve 
is the point with half of the area on each side. 


DEFINITION Mean of a density curve, Median of a 
density curve 


The mean of a density curve is the point at which the curve would 
balance if made of solid material. 


The median of a density curve is the equal-areas point, the point that 
divides the area under the curve in half. 


A symmetric density curve balances at its midpoint because the two 
sides are identical. So the mean and median of a symmetric density curve 
are equal, as in Figure 2.10(a). It isn’t so easy to spot the equal-areas point 
on a skewed density curve. We used technology to locate the median in 
Figure 2.10(b). The mean is greater than the median because the balance 
point of the distribution is pulled toward the long right tail. 
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The long right tail pulls 
the mean to the right 
of the median. 


I Mean 
(a) Median and mean (b) Median 


FIGURE 2.10 (a) Both the median and mean of a symmetric density curve lie at 
the point of symmetry. (b) In a right-skewed density curve, the mean is pulled 
away from the median toward the long tail. 


EXAMPLE | What does the left skew do? > 


Mean versus median 


ABC 
PROBLEM: The density curve that models a distribution of 


quantitative data is shown here. Identify the location of the mean and 
median by letter. Justify your answers. 


SOLUTION: 


Median=BMedian = B, Mean=A.Mean = A. B is the equal-areas point 
of the distribution. The mean will be less than the median due to the 
left-skewed shape. 


Even though C is directly under the peak of the curve, more than half of 


the area is to its left, so C cannot be the median. 


FOR PRACTICE, TRY EXERCISE 45 


2/9 


Because a density curve is an idealized description of a distribution of 
quantitative data, we distinguish between the mean and standard deviation 
of the density curve and the mean x @ and standard deviation s, computed 


from the actual observations. The usual notation for the mean of a density 
curve is p! (the lowercase Greek letter mu). We write the standard deviation 
of a density curve as o (the lowercase Greek letter sigma). 


You probably noticed that we used the same notation for the population mean and 
standard deviation, p and o, in Chapter 1 as we do here for the mean and standard 


deviation of a density curve. 


We can roughly locate the mean p of any density curve by eye, as the 
balance point. No easy way exists to estimate the standard deviation for 
density curves in general. But there is one family of density curves for 
which we can estimate the standard deviation by eye. 


Normal Distributions 


Blend Images/Shutterstock.com 


When we examine a distribution of quantitative data, how does it compare 
with an idealized density curve? Figure 2.11(a) shows a histogram of the 
scores of all seventh-grade students in Gary, Indiana, on the vocabulary 
part of the Iowa Test of Basic Skills (ITBS). The scores are grade-level 
equivalents, so a score of 6.3 indicates that the student’s performance is 
typical for a student in the third month of grade 6. The histogram is 
roughly symmetric, and both tails fall off smoothly from a single center 
peak. There are no large gaps or obvious outliers. 
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The density curve drawn through the tops of the histogram bars in 
Figure 2.11(b) is a good description of the overall pattern of the ITBS 
score distribution. We call it a Normal curve. The distributions described 
by Normal curves are called Normal distributions. In this case, the ITBS 
vocabulary scores of Gary, Indiana, seventh-graders are approximately 
Normally distributed. 


ITBS Seawlaenh score (b) ITBS vocabulary score 
FIGURE 2.11 (a) Histogram of the Iowa Test of Basic Skills (ITBS) vocabulary 
scores of all seventh-grade students in Gary, Indiana. (b) The Normal density curve 
shows the overall shape of the distribution. 


Normal distributions play a large role in statistics, but they are rather 
special and not at all “normal” in the sense of being usual or typical. We 
capitalize Normal to remind you that these density curves are special. 

Look at the two Normal distributions in Figure 2.12. They illustrate 
several important facts: 


e Shape: All Normal distributions have the same overall shape: 
symmetric, single-peaked, and bell-shaped. 


e Center: The mean p is located at the midpoint of the symmetric density 
curve and is the same as the median. 


e Variability: The standard deviation o measures the variability (width) 
of a Normal distribution. 
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FIGURE 2.12 Two Normal curves, showing the mean p and standard deviation o. 


You can estimate o by eye on a Normal curve. Here’s how. Imagine 
that you are skiing down a mountain that has the shape of a Normal 
distribution. At first, you descend at an increasingly steep angle as you go 
out from the peak. 


o™% 
; \ 


/ \ 


Fortunately, before you find yourself going straight down, the slope begins 
to get flatter rather than steeper as you go out and down: 
re, \. 


> a _~ 
— 7 


The points at which this change of curvature takes place are located at 
a distance o on either side of the mean p!. (Advanced math students know 
these as “inflection points.”) You can feel the change as you run a pencil 
along a Normal curve, which will allow you to estimate the standard 
deviation. 


DEFINITION Normal distribution, Normal curve 


A Normal distribution is described by a symmetric, single-peaked, 
bell-shaped density curve called a Normal curve. Any Normal 
distribution is completely specified by two numbers: its mean p and 
standard deviation o. 


The distribution of ITBS vocabulary scores for seventh-grade students 
in Gary, Indiana, is modeled well by a Normal distribution with mean 
u=6.84! = 6-84 and standard deviation o=1.550 = 1.55, The figure shows 
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this distribution with the points 1, 2, and 3 standard deviations from the 
mean labeled on the horizontal axis. 


/ o o 


T T T T T T T 
2.19 3.74 5.29 684 839 9.94 11.49 


ITBS vocabulary score 


You will be asked to make reasonably accurate sketches of Normal 
distributions to model quantitative data sets like the ITBS vocabulary 
scores. The best way to learn is to practice. 


EXAMPLE |Stop the car! (») 


Sketching a Normal distribution 


fstop123/Getty Images 


PROBLEM: Many studies on automobile safety suggest that when 
automobile drivers make emergency stops, the stopping distances follow 
an approximately Normal distribution. Suppose that for one model of 
car traveling at 62 mph under typical conditions on dry pavement, the 
mean stopping distance is p=155 ftl4 = 159 ft with a standard deviation 
of o=3 fto = 3 ft. Sketch the Normal curve that approximates the 
distribution of stopping distance. Label the mean and the points that are 
1, 2, and 3 standard deviations from the mean. 
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SOLUTION: 


T T 
146 149 12 155 158 161 164 


Stopping distance (ft) 


The mean (155) is at the midpoint of the bell-shaped density curve. The 
standard deviation (3) is the distance from the center to the change-of- 
curvature points on either side. Label the mean and the points that are 1, 
2, and 3 SDs from the mean: 


1 SD: 155 — 1(3) = 152 and 155 + 1(3) = 158 
2 SD: 155 — 2(3) = 149 and 155 + 2(3) = 161 
3 SD: 155 — 3(3) = 146 and 155 + 3(3) = 164 


FOR PRACTICE, TRY EXERCISE 47 


oOo Remember that p and o alone do not specify the appearance of 


most distributions. The shape of density curves, in general, does not 
reveal o. These are special properties of Normal distributions. 


Why are Normal distributions important in statistics? Here are three 
reasons. 


.. Normal distributions are good descriptions for some distributions of real 
data. Distributions that are often close to Normal include: 


e Scores on tests taken by many people (such as SAT exams and IQ 
tests) 


e Repeated careful measurements of the same quantity (like the 
diameter of a tennis ball) 


e Characteristics of biological populations (such as lengths of crickets 
and yields of corn) 


’, Normal distributions are good approximations to the results of many 
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kinds of chance outcomes, like the proportion of heads in many tosses 
of a fair coin. 


3. Many of the inference methods in Chapters 8—12 are based on Normal 
distributions. 
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DEA/A. DAGLI ORTI/De Agostini/Getty Images 


Normal curves were first applied to data by the great mathematician 
Carl Friedrich Gauss (1777-1855). He used them to describe the small 
errors made by astronomers and surveyors in repeated careful 
measurements of the same quantity. You will sometimes see Normal 
distributions labeled “Gaussian” in honor of Gauss. His image was even 
featured on a previous German 10 DM bill, along with a sketch of the 
Normal! distribution. 


The 68—95-99.7 Rule 


Earlier, we saw that the distribution of Iowa Test of Basic Skills (ITBS) 
vocabulary scores for seventh-grade students in Gary, Indiana, is 
approximately Normal with mean p=6.844 — 6.84 and standard deviation 
o0=1.550 = 1.55, How unusual is it for a Gary seventh-grader to get an 
ITBS score less than 3.74? The figure shows the Normal density curve for 
this distribution with the area of interest shaded. Note that the boundary 
value, 3.74, is exactly 2 standard deviations below the mean. 
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nas =a) T | on 
2.19 3.74 5.29 6.84 8.39 9.94 11.49 


ITBS vocabulary score 
Calculating the shaded area isn’t as easy as multiplying base x height 
base x height but it’s not as hard as you might think. The following 
activity shows you how to do it. 


ACTIVITY What’s so special about Normal distributions? 


prPLe> 


¢ 


In this activity, you will use the Normal Density Curve applet at 
highschool.bfwpub.com/tps6e to explore an interesting property of 
Normal distributions. 

Change the mean to 6.84 and the standard deviation to 1.55, and 
click on “UPDATE.” (These are the values for the distribution of ITBS 
vocabulary scores of seventh-graders in Gary, Indiana.) A figure like the 
one that follows should appear: 

Use the applet to help you answer the following questions. 


5.290 8.390 


0.64 2.19 3.74 529 684 839 9.94 1149 13.04 


1. What percent of the area under the Normal curve lies within 1 
standard deviation of the mean? That is, about what percent of Gary, 
Indiana, seventh-graders have ITBS vocabulary scores between 5.29 
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and 8.39? 

(b) What percent of the area under the Normal curve lies within 2 
standard deviations of the mean? Interpret this result in context. 
(c) What percent of the area under the Normal curve lies within 3 
standard deviations of the mean? Interpret this result in context. 


2. The distribution of IQ scores in the adult population is approximately 
Normal with mean p = 100 and standard deviation o = 15. Adjust the 
applet to display this distribution. About what percent of adults have 
IQ scores within 1, 2, and 3 standard deviations of the mean? 


3. Adjust the applet to have a mean of 0 and a standard deviation of 1. 
Then click “UPDATE.” The resulting density curve describes the 
standard Normal distribution. What percent of the area under this 
Normal density curve lies within 1, 2, and 3 standard deviations of the 
mean? 


4. Summarize by completing this sentence: “For any Normal 
distribution, the area under the Normal curve within 1, 2, and 3 
standard deviations of the mean is about %, %, and vou 


When you hear the phrase “standard Normal distribution,” think standardized 


scores (z-scores), which have a mean of 0 and a standard deviation of 1. 


Although there are many Normal distributions, they all have properties 
in common. In particular, all Normal distributions obey the 68—95-—99.7 
rule. 


DEFINITION The 68—95-99.7 rule 
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68% of 
values 
95% of 
values 
99.7% of 
values 


Standard deviations 


In a Normal distribution with mean p and standard deviation o: 

e Approximately 68% of the observations fall within o of the mean 
Ll. 

e Approximately 95% of the observations fall within 20 of the mean 
LL. 

e Approximately 99.7% of the observations fall within 30 of the 


mean [. 


This result is known as the 68—95-99.7 rule. 


Some people refer to the 68—95-99.7 rule as the empirical rule (empirical means 


“learned from experience or by observation”). 


By remembering these three numbers, you can quickly estimate 
proportions or percents of observations (areas) using Normal distributions 
and recognize when an observation is unusual. 

Earlier, we asked how unusual it would be for a Gary seventh-grader to 
get an ITBS score less than 3.74. Figure 2.13 gives the answer in graphical 
form. By the 68—95-—99.7 rule, about 95% of these students have ITBS 
vocabulary scores between 3.74 and 9.94, which means that about 5% of 
the students have scores less than 3.74 or greater than 9.94. Due to the 
symmetry of the Normal distribution, about 5%/2 = 2.5%270/2 = 2.5% of 
students have scores less than 3.74. So it is quite unusual for a Gary, 
Indiana, seventh-grader to get an ITBS vocabulary score below 3.74. 
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About 2.5% of scores 
are less than 3.74. 


About 95% of scores 
within 20 of 2. 


2.19 3.74 5.29 6.84 8.39 9.94 11.49 
ITBS vocabulary score 


FIGURE 2.13 Using the 68—95—99.7 rule to estimate the percent of Gary, Indiana, 
seventh-graders with ITBS vocabulary scores less than 3.74. 


How well does the 68—95-—99.7 rule describe the distribution of ITBS 
vocabulary scores for Gary, Indiana, seventh-graders? Well, 900 of the 
947 scores are between 3.74 and 9.94. That’s 95.04%, which is very 
accurate indeed. Of the remaining 47 scores, 20 are below 3.74 and 27 are 
above 9.94. The number of values in each tail is not quite equal, as it 
would be in an exactly Normal distribution. Normal distributions often 
describe real data better in the center of the distribution than in the extreme 
high and low tails. As famous statistician George Box once noted, “All 
models are wrong, but some are useful!” 


EXAMPLE | Stop the car! (») 
Using the 68—95-99.7 rule 


PROBLEM: Many studies on automobile safety suggest that when 
automobile drivers must make emergency stops, the stopping distances 
follow an approximately Normal distribution. Suppose that for one 
model of car traveling at 62 mph under typical conditions on dry 
pavement, the mean stopping distance is p=155 ft! — 155 ft with a 
standard deviation of o=3 fto = 3 ft. 


a. About what percent of cars of this model would take more than 158 
feet to make an emergency stop? Show your method clearly. 


b. A car of this model that takes 158 feet to make an emergency stop 
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would be at about what percentile of the distribution? Justify your 
answer. 


SOLUTION: 


d. About 68% 


146 149° «sn tss ASR 161—«164 
Stopping distance (ft) 


About 16% of cars of this model would take more than 158 feet to 
make an emergency stop. 


Start by sketching a Normal curve and labeling the values 1, 2, and 3 
standard deviations from the mean. Then shade the area of interest. 


Use the 68—95-—99.7 rule and the symmetry of the Normal distribution 
to find the desired area. 


b. About the 84th percentile because about 100%-16%=84% 
100% — 16% = 84% of cars of this model would stop in less than 158 
feet. 


FOR PRACTICE, TRY EXERCISE 51 


oOo Note that the 68—95-99.7 rule applies only to Normal 


distributions. Is there a similar rule that would apply to any distribution? 
Sort of. A result known as Chebyshev’s inequality says that in any 


distribution, the proportion of observations falling within k standard 
1 


1-— 
deviations of the mean is at least 1-1k2 If k = 2, for example, 


1—-+ =0.75 
Chebyshev’s inequality tells us that at least 1-122=0.75 2 of 


the observations in any distribution are within 2 standard deviations of the 
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mean. For Normal distributions, we know that this proportion is much 
higher than 0.75. In fact, it’s approximately 0.95. 


Chebyshev’s inequality is an interesting result, but it is not required for the AP® 


Statistics exam. 


CHECK YOUR UNDERSTANDING 


(\ 


\ 


y The distribution of heights of young women aged 18 to 24 is 

approximately Normal with mean »=64.5 inches! = 64.5 inches ang 

standard deviation o=2.5 incheso = 2.5 inches, 

1. Sketch the Normal curve that approximates the distribution of young 
women’s height. Label the mean and the points that are 1, 2, and 3 
standard deviations from the mean. 


2. About what percent of young women have heights less than 69.5 
inches? Show your work. 


3. Is a young woman with a height of 62 inches unusually short? Justify 
your answer. 


Finding Areas in a Normal Distribution 


Let’s return to the distribution of ITBS vocabulary scores among all Gary, 
Indiana, seventh-graders. Recall that this distribution is approximately 
Normal with mean p=6.84/ = 6-84 and standard deviation o=1.550 = 1.55 
. What proportion of these seventh-graders have vocabulary scores that are 
below sixth-grade level? Figure 2.14 shows the Normal curve with the area 
of interest shaded. We can’t use the 68—95—99.7 rule to find this area 
because the boundary value of 6 is not exactly 1, 2, or 3 standard 
deviations from the mean. 
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| 
2.19 3.74 5.29 6 6.84 8.39 9.94 11.49 
ITBS vocabulary score 


FIGURE 2.14 Normal curve we would use to estimate the proportion of Gary, 
Indiana, seventh-graders with ITBS vocabulary scores that are less than 6—that is, 
below sixth-grade level. 


As the 68—95-—99.7 rule suggests, all Normal distributions are the same 
if we measure in units of size o from the mean p. Changing to these units 
requires us to standardize, just as we did in Section 2.1: 

value — mean rt 


z=value — meanstandard deviation=x- po standarddeviation o 


Recall that subtracting a constant and dividing by a constant don’t change 
the shape of a distribution. If the quantitative variable we standardize has 
an approximately Normal distribution, then so does the new variable z. 
This new distribution of standardized values can be modeled with a 
Normal curve having mean p=04 — 0 and standard deviation o=10 = 1. It 
is called the standard Normal distribution. 


DEFINITION Standard Normal distribution 


The standard Normal distribution is the Normal distribution with 
mean 0 and standard deviation 1. 
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Because all Normal distributions are the same when we standardize, 
we can find areas under any Normal curve using the standard Normal 
distribution. Table A in the back of the book gives areas under the standard 
Normal curve. The table entry for each z-score is the area under the curve 
to the left of z. 


For the ITBS test score data, we want to find the area to the left of 6 
under the Normal distribution with mean 6.84 and standard deviation 1.55. 


See Figure 2.15(a). We start by standardizing the boundary value x=6 
a= 6: 


z=value — meanstandard deviation=6 — 6.841.55=—0.54 


: value—mean _ 6-6.84 | 0.54 
. ~ standard deviation 1.55 | ™ 


Figure 2.15(b) shows the standard Normal distribution with the area to the 
left of z=-0.542 = —0.54 shaded. Notice that the shaded areas in the two 
graphs are the same. 


An A 


5.29 6 AF ae a -1_954 0 


ITBS vocabulary score z-score 


FIGURE 2.15 (a) Normal distribution estimating the proportion of Gary, Indiana, 
seventh-graders who earn ITBS vocabulary scores less than 6th-grade level. (b) 
The corresponding area in the standard Normal distribution. 


To find the area to the left of z=-0.542 = —9.54 locate —0.5 in the left- 
hand column of Table A, then locate the remaining digit 4 as .04 in the top 
row. The entry to the right of —0.5 and under .04 is .2946. This is the area 
we seek. We estimate that about 29.46% of Gary, Indiana, seventh-grader 
scores fall below the sixth-grade level on the ITBS vocabulary test. Note 
that we have made a connection between z-scores and percentiles when 
the shape of a distribution is approximately Normal. 


Z .03 04 05 
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—0.6 .2643 2611 2978 


—0.4 3336 .3300 .3264 


It is also possible to find areas under a Normal curve using technology. 


5. Technology Corner | FINDING AREAS FROM 
VALUES IN A NORMAL DISTRIBUTION (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


The normalcdf command on the TI-83/84 can be used to find areas 
under a Normal curve. The syntax is normalcdf(lower bound, upper 
bound, mean, standard deviation). Let’s use this command to calculate 
the proportion of ITBS vocabulary scores in Gary, Indiana, that are less 
than 6. Note that we can do the area calculation using the standard 
Normal distribution or the Normal distribution with mean 6.84 and 
standard deviation 1.55. 


I. Using the standard Normal distribution: What proportion of 
observations in a standard Normal distribution are less than z=—0.54 
z = —0.54? Recall that the standard Normal distribution has mean 
u=0/ = 9 and standard deviation o=1¢ = 1. 


e Press (Distr) and choose normalcdf(. OS 2.55 or 


later: In the dialog box, enter these values: lower:—1000, upper: 


-0.54, :0/ 9, o:10 : 1, choose Paste, and then press [ENTER | 
Older OS: Complete the command normalcdf(—1000,—0.54,0,1) 


and press [ENTER | 


Note: We chose —1000 as the lower bound because it’s many, many 
standard deviations less than the mean. 


294 


NORMAL FLOAT AUTO REAL RADIAN MP it 


normalcdf (-1000, -@.54.0,1) 
Q@. 2945984889 


I. Using the unstandardized Normal distribution: What proportion of 
observations in a Normal distribution with mean p=6.84/ = 6-84 and 
standard deviation o6=1.550 = 1.55 are less than x=62 = 6? 


e Press (Distr) and choose normalcdf(. OS 2.55 or 


later: In the dialog box, enter these values: lower:—1000, upper:6, 
1:6.8414:6.84 0:1.55¢ : 1.55, choose Paste, and then press 


[ENTER] 


Older OS: Complete the command normalcdf(—1000,6,6.84,1.55) 


and press [ENTER|. 


This answer differs slightly from the one we got using the standard 
Normal distribution because we rounded the standardized score to 
two decimal places: z=-0.542 = —9.54, 

[NORMAL FLOAT AUTO REAL RADIAN MP =F] 


normalcdf ( -1000,6,6.84,1. 
8. 2939314473 


The following box summarizes the process of finding areas in a 
Normal distribution. In Step 2, each method of performing calculations has 
some advantages, so check with your teacher to see which option will be 
used in your class. 
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HOW TO FIND AREAS IN ANY NORMAL 
DISTRIBUTION 


Step 1: Draw a Normal distribution with the horizontal axis labeled 
and scaled using the mean and standard deviation, the boundary 
value(s) clearly identified, and the area of interest shaded. 


Step 2: Perform calculations—show your work! Do one of the 
following: 

(i) Standardize each boundary value and use Table A or technology to 
find the desired area under the standard Normal curve; or 


(ii) Use technology to find the desired area without standardizing. 


Be sure to answer the question that was asked. 


AP® EXAM TIP 


Students often do not get full credit on the AP® Statistics exam because 
they use option (ii) with “calculator-speak” to show their work on 
Normal calculation questions—for example, normalcdf(— 
1000,6,6.84,1.55). This is not considered clear communication. To get 
full credit, follow the two-step process above, making sure to carefully 
label each of the inputs in the calculator command if you use 
technology in Step 2: normalcdf(lower: —1000, upper: 6, mean: 6.84, 
SD:1.55). 


EXAMPLE |Stop the car! (») 
Finding area to the left 
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fizkes/Shutterstock.com 


PROBLEM: As noted in the preceding example, studies on automobile 
safety suggest that stopping distances follow an approximately Normal 
distribution. For one model of car traveling at 62 mph, the mean 
stopping distance is p=155 ft! ~ 155 ft with a standard deviation of 
o=3 fto = 3 ft. Danielle is driving one of these cars at 62 mph when she 
spots a wreck 160 feet in front of her and needs to make an emergency 
stop. About what percent of cars of this model when going 62 mph 
would be able to make an emergency stop in less than 160 feet? Is 


Danielle likely to stop safely? 


SOLUTION: 


Stopping distance (ft) 


1. Draw a normal distribution. Be sure to: 
e Scale the horizontal axis. 


e Label the horizontal axis with the variable name, including units 
of measurement. 


e Clearly identify the boundary value(s). 
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| e Shade the area of interest. | 


, — 160-155 _ 
i. z=160 - 1553=1.67 3 © aie 


Using Table A: Area for z < 1.67 is 0.9525. 


Using technology: normalcdf(lower: —1000, upper: 1.67, mean: 0, 
SD: 1) = 0.9525 


ii. normalcdf(lower: —1000, upper: 160, mean: 155, SD: 3) = 0.9522 


About 95% of cars of this model would be able to make an emergency 
stop within 160 feet. So Danielle is likely to be able to stop safely. 


2. Perform calculations—show your work! 
i. Standardize the boundary value and use Table A or technology 


to find the desired probability; or 
ii. Use technology to find the desired area without standardizing. 


Z 06 07 08 
15 9406 9418 9429 
1.6 9515 ae ae 9535 
1.7 9608 9616 9625 


Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 53 


What percent of cars of this model would be able to make an 
emergency stop in less than 140 feet? The standardized score for x=140 
z = 140 js 

140-155 


z=——_—= -5.00 
z=140 - 1553=-5.00 3 


Table A does not go beyond z=—3.502 = —3-90 and z=3.502 = 3.50 
because it is highly unusual for a value to be more than 3.5 standard 
deviations from the mean in a Normal distribution. For practical purposes, 
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we can act as if there is approximately zero probability outside the range 
of Table A. So there is almost no chance that a car of this model going 62 
mph would be able to make an emergency stop within 140 feet. 


FINDING AREAS TO THE RIGHT IN A NORMAL 
DISTRIBUTION What proportion of Gary, Indiana, seventh-grade scores 
on the ITBS vocabulary test are at least 9? Start with a picture. Figure 
2.16(a) on the next page shows the Normal distribution with mean p=6.84 
t = 6.84 and standard deviation o=1.550 = 1.55 with the area of interest 
shaded. Next, standardize the boundary value: 


z= — = 1.39 
zZz—9 — 6.841.55=1.39 1.55 


Figure 2.16(b) shows the standard Normal distribution with the area to the 
right of z=1.39z = 1.39 shaded. Again, notice that the shaded areas in the 
two graphs are the same. 


co \ . / \ 


2.19 3.74 de aa : 9.94 11.49 4 0 1 439 2 


(a) ITBS vocabulary score z-score 

FIGURE 2.16 (a) Normal distribution eae the proportion of Gary, Indiana, 
seventh-graders who earn ITBS vocabulary scores at the ninth-grade level or 
higher. (b) The corresponding area in the standard Normal distribution. 


To find the area to the right of z=1.392 = 1.39, locate 1.3 in the left- 
hand column of Table A, then locate the remaining digit 9 as .09 in the top 
row. The entry to the right of 1.3 and under .09 is .9177. However, this is 
the area to the left of z=1.39z = 1.39. We can use the fact that the total 
area in the standard Normal distribution is 1 to find that the area to the 
right of z=1.392 = 1.39 is 1-0.9177=0.08231 — 0.9177 = 0.0823, We 
estimate that about 8.23% of Gary, Indiana, seventh-graders earn scores at 
the ninth-grade level or above on the ITBS vocabulary test. 


Zz 07 08 .09 
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LZ .8980 .8997 2015 


1.4 9292 .9306 .9319 


oo A common student mistake is to look up a z-score in Table A 


and report the entry corresponding to that z-score, regardless of 
whether the problem asks for the area to the left or to the right of that 
z-score. This mistake can usually be prevented by drawing a Normal 
distribution and shading the area of interest. Look to see if the area should 
be closer to 0 or closer to 1. In the preceding example, for instance, it 
should be obvious that 0.9177 is not the correct area. 


EXAMPLE | Can Spieth clear the trees? » 
Finding area to the right 


Christian Petersen/Getty Images 


PROBLEM: When professional golfer Jordan Spieth hits his driver, the 
distance the ball travels can be modeled by a Normal distribution with 
mean 304 yards and standard deviation 8 yards. On a specific hole, 
Jordan would need to hit the ball at least 290 yards to have a clear 
second shot that avoids a large group of trees. What percent of Spieth’s 
drives travel at least 290 yards? Is he likely to have a clear second shot? 


SOLUTION: 
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280 Pe) 296 304 312 320 328 
290 


Distance traveled (ya) 


1. Draw a Normal distribution. 


290-304 
| 2-000 = 208 — — > aii 


Using Table A: Area for z < -1.75 is 0.0401. Area for z > -1.75 is 
1—0.0401 = 0.9599. 


Using technology: normalcdf(lower: —1.75, upper: 1000, mean: 0, 
SD: 1) = 0.9599 


ii. normalcdf(lower: 290, upper: 1000, mean: 304, SD: 8) = 0.9599 


2. Perform calculations—show your work! 


i. Standardize and use Table A or technology; or 
ii. Use technology without standardizing. 


About 96% of Jordan Spieth’s drives travel at least 290 yards. So he is 
likely to have a clear second shot. 


Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 55 


Think About It 


WHAT PROPORTION OF JORDAN SPIETH’S DRIVES GO 
EXACTLY 290 YARDS? There is no area under the Normal density 
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curve in the preceding example directly above the point 
290.000000000... . So the answer to our question based on the Normal 
distribution is 0. One more thing: the areas under the curve with x>290 
« = 290 and x>290z > 290 are the same. According to the Normal 
model, the proportion of Spieth’s drives that travel at least 290 yards is 
the same as the proportion that travel more than 290 yards. 


FINDING AREAS BETWEEN TWO VALUES IN A NORMAL 
DISTRIBUTION How do you find the area in a Normal distribution that 
is between two values? For instance, suppose we want to estimate the 
proportion of Gary, Indiana, seventh-graders with ITBS vocabulary scores 
between 6 and 9. Figure 2.17(a) shows the desired area under the Normal 
curve with mean p=6.84/ = 6-84 and standard deviation o=1.550 = 1.55, 
We can use Table A or technology to find the desired area. 


Option (i): If we standardize each boundary value, we get 


Z=6 — 6.841.55=-0.54,  z=9- 6.841.55=1.39 


PA 
~ 


Figure 2.17(b) shows the corresponding area of interest in the standard 
Normal distribution. 


| 
2.19 3.74 5.29 6 684 839 9 994 11.49 


(a) ITBS vocabulary score (b) z-score 

FIGURE 2.17 (a) Normal distribution approximating the proportion of seventh- 
graders in Gary, Indiana, with ITBS vocabulary scores between 6 and 9. (b) The 
corresponding area in the standard Normal distribution. 


Using Table A: The table makes this process a bit trickier because it 
only shows areas to the left of a given z-score. The visual shows one way 
to think about the calculation. 
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0.54 
z-score 1.39 


Area between z= 
—0.54 and z=1.39=Area to the left of z=1.39—Area to the left of z= 
—0.54=0.9177 —0.2946=0.6231 
Area between z = —0.54 and z = 1.39 


= Area to the left of z = 1.39 — Area to the left of z = —0.54 
= 0.9177 — 0.2946 
= 0.6231 


Using technology: normalcdf(lower: —0.54, upper: 1.39, mean: 0, SD: 1) = 
0.6231 


Option (ii): normalcdf(lower: 6, upper: 9, mean: 6.84, SD: 1.55) = 0.6243. 


About 62% of Gary, Indiana, seventh-graders earned grade-equivalent 
scores between 6 and 9. 


EXAMPLE | Can Spieth reach the green? » 


Finding areas between two values 


Ben Stansall/AFP/Getty Images 


PROBLEM: When professional golfer Jordan Spieth hits his driver, the 
distance the ball travels can be modeled by a Normal distribution with 
mean 304 yards and standard deviation 8 yards. On another golf hole, 
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Spieth has the opportunity to drive the ball onto the green if he hits the 
ball between 305 and 325 yards. What percent of Spieth’s drives travel a 
distance that falls in the interval? Is he likely to get the ball on the green 
with his drive? 


SOLUTION: 


280 288 256 so4K 312 sz0} 328 
305 325 
Distance traveled (ya) 


1. Draw a Normal distribution. 


i. Z=305-3048=0.13 = z=325-3048=2.63 


305—304 325—304 
z= BM _013 z= @™ = 263 


5 


Using Table A: 0.9957—0.5517 = 0.4440 

Using technology: 

normalcdf(lower:0.13, upper:2.63, mean:0, SD:1) = 0.4440 
ii. normalcdf(lower:305, upper:325, mean:304, SD:8) = 0.4459 


2. Perform calculations—show your work! 


i. Standardize and use Table A or technology; or 
ii. Use technology without standardizing. 


About 45% of Spieth’s drives travel between 305 and 325 yards. He has 
a fairly good chance of getting the ball on the green—assuming he hits 
the shot straight. 


Be sure to answer the question that was asked. 
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FOR PRACTICE, TRY EXERCISE 57 


CHECK YOUR UNDERSTANDING 


( 


y High levels of cholesterol in the blood increase the risk of heart 

disease. For 14-year-old boys, the distribution of blood cholesterol is 

approximately Normal with mean p=170 milligrams/’ = 170 milligrams 

of cholesterol per deciliter of blood (mg/dl) and standard deviation 

0=30 mg/dl? = 30 mg/dl 6 

1. Cholesterol levels higher than 240 mg/dl may require medical 
attention. What percent of 14-year-old boys have more than 240 
mg/dl of cholesterol? 


2. People with cholesterol levels between 200 and 240 mg/dl are at 
considerable risk for heart disease. What proportion of 14-year-old 
boys have blood cholesterol between 200 and 240 mg/dl? 


Working Backward: Finding Values from Areas 


So far, we have focused on finding areas in Normal distributions that 
correspond to specific values. What if we want to find the value that 
corresponds to a particular area? For instance, suppose we want to estimate 
the 90th percentile of the distribution of ITBS vocabulary scores for Gary, 
Indiana, seventh-graders. Figure 2.18(a) shows the Normal curve with 
mean p=6.84/ = 9.84 and standard deviation o=1.557 = 1.55 that models 
this distribution. We’re looking for the ITBS score x that has 90% of the 
area to its left. Figure 2.18(b) shows the standard Normal distribution with 
the corresponding area shaded. 
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Area = 0.90 Area = 0.90 


219 374 5.29 684 839 * 9.94 1149 3 2 = 0 , 
(a) ITBS vocabulary score (b) z-score 


FIGURE 2.18 (a) Normal distribution showing the 90th percentile of ITBS 
vocabulary scores for Gary, Indiana, seventh-graders. (b) The 90th percentile in the 


standard Normal distribution. 


We can use Table A or technology to find the z-score with an area of 
0.90 to its left. Because Table A gives the area to the left of a specified z- 
score, all we have to do is find the value closest to 0.90 in the middle of 
the table. From the reproduced portion of Table A, you can see that the 
desired value is z=1.282 = 1.28. Then we “unstandardize” to get the 
corresponding ITBS vocabulary score x. 


zZ 07 08 09 
11 8790 8810 8830 
1 8980 re: 9015 
1.3 9147 9162 9177 


2=x — 101.28=x — 6.841.551.28(1.55)+6.84=x8.824=x 


c—ji 
Zz = 
oO 
128 = r—6.84 
1.55 
1.28(1.55)+6.84 = « 
8.824 = zx 


So the 90th percentile of the distribution of ITBS vocabulary scores for 
Gary, Indiana, seventh-graders is 8.824. 


It is also possible to find the 90th percentile of either distribution in 
Figure 2.18 using technology. 


6. Technology Corner | FINDING VALUES FROM 
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AREAS IN A NORMAL DISTRIBUTION |») 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


The TI-83/84 invNorm command calculates the value corresponding to 
a given percentile in a Normal distribution. The syntax is invNorm(area 
to the left, mean, standard deviation). Let’s use this command to 
confirm the 90th percentile for the ITBS vocabulary scores in Gary, 
Indiana. Note that we can do the calculation using the standard Normal 
distribution or the Normal distribution with mean 6.84 and standard 
deviation 1.55. 
i. Using the standard Normal distribution: What is the 90th percentile 
of the standard Normal distribution? 


e Press VARS| (Distr) and choose invNorm(. 


OS 2.55 or later: In the dialog box, enter these values: area:0.90, 


u:0/ * 9, 6:10: 1, choose Paste, and then press [ENTER|, 
Older OS: Complete the command invNorm(0.90,0,1) and press 


[ENTER] 


Note: The most recent TI-84 Plus CE OS has added an option for 
specifying area in the LEFT, CENTER, or RIGHT of the distribution. 
Choose LEFT in this case. 


This result matches what we got using Table A. Now 
“unstandardize” as shown preceding the Technology Corner to get 
x=8.824@ = 8.824, 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


invNorm(@.90,0,1,LEFT) 
1.281551567 


ii. Using the unstandardized Normal distribution: What is the 90th 
percentile of a Normal distribution with mean p = 6.84 and standard 
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deviation 0=1.55¢ = 1.55? 


e Press (Distr) and choose invNorm(. 
OS 2.55 or later: In the dialog box, enter these values: area:0.90, 
:6.84, 0:1.55, choose Paste, and then press [ENTER |. 
Older OS: Complete the command invNorm(0.90,6.84,1.55) and 


press [ENTER| 


Note: The most recent TI-84 Plus CE OS has added an option for 
specifying area in the LEFT, CENTER, or RIGHT of the distribution. 
Choose LEFT in this case. 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


invNorm(@. 90,6.84,1.55,LE)> 
8. 826484928 


The following box summarizes the process of finding a value 
corresponding to a given area in a Normal distribution. In Step 2, each 
method of performing calculations has some advantages, so check with 
your teacher to see which option will be used in your class. 


HOW TO FIND VALUES FROM AREAS IN ANY 
NORMAL DISTRIBUTION 


Step 1: Draw a Normal distribution with the horizontal axis labeled 
and scaled using the mean and standard deviation, the area of interest 
shaded, and unknown boundary value clearly marked. 


Step 2: Perform calculations—show your work! Do one of the 
following: 

(i) Use Table A or technology to find the value of z with the indicated 
area under the standard Normal curve, then “unstandardize” to 
transform back to the original distribution; or 
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(ii) Use technology to find the desired value without standardizing. 
Be sure to answer the question that was asked. 


AP® EXAM TIP 


As noted previously, to make sure that you get full credit on the AP® 
Statistics exam, do not use “calculator-speak” alone—for example, 
invNorm(0.90,6.84,1.55). This is not considered clear communication. 
To get full credit, follow the two-step process above, making sure to 
carefully label each of the inputs in the calculator command if you use 
technology in Step 2: invNorm(area: 0.90, mean: 6.84, SD:1.55). 


EXAMPLE | How tall are 3-year-old girls? (») 


Finding a value from an area 


idk 


Cultura Creative (RF)/Alamy 
Stock Photo 


PROBLEM: According to www.cdc.gov/growthcharts/, the heights of 
3-year-old females are approximately Normally distributed with a mean 
of 94.5 centimeters and a standard deviation of 4 centimeters. Seventy- 
five percent of 3-year-old girls are taller than what height? 


SOLUTION: 
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Area = 0.75 
Area =1- 0.75 = 025 


94.5 BS 102.5 106.5 


Heights of 3-year-old girls (cm) 


If 75% of 3-year-old girls are taller than a certain height, then 25% of 3- 
year-old girls are shorter than that height. So we just need to find the 
25th percentile of this distribution of height. 


1. Draw a Normal distribution. 
From the 68—95—99.7 rule, we know that about 16% of the 
observations in a Normal distribution will fall more than 1 
standard deviation less than the mean. So the 25th percentile will 
be located slightly to the right of 90.5, as shown. 


i. Using Table A: 0.25 area to the left — z = —0.67 
Using technology: invNorm(area: 0.25, mean: 0, SD: 1) = —0.67 
~0.67 = = 
—0.67(4) + 94.5 =x 


~0.67=x-94.54-0.67(4)+94.5=x91.82=x 91.82 = x 
ii. invNorm(area: 0.25, mean: 94.5, SD: 4) = 91.80 
About 75% of 3-year-old girls are taller than 91.80 centimeters. 


2. Perform calculations—show your work! 
i. Use Table A or technology to find the value of z with the 
indicated area under the standard Normal curve, then 


“unstandardize” to transform back to the original distribution; 
or 
ii. Use technology to find the desired value without standardizing. 
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Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 63 


Here’s an activity that gives you a chance to apply what you have 
learned so far in this section. 


ACTIVITY Team challenge: The vending machine 
problem 


Ed Honowitz/The Image Bank/Getty Images 


In this activity, you will work in a team of three or four students to 
resolve a real-world problem. 

Have you ever purchased a hot drink from a vending machine? The 
intended sequence of events goes something like this: You insert your 
money into the machine and select your preferred beverage. A cup falls 
out of the machine, landing upright. Liquid pours out until the cup is 
nearly full. You reach in, grab the piping-hot cup, and drink happily. 

Except sometimes, things go wrong. The machine might swipe your 
money. Or the cup might fall over. More frequently, everything goes 
smoothly until the liquid begins to flow. It might stop flowing when the 
cup is only half full. Or the liquid might keep coming until your cup 
overflows. Neither of these results leaves you satisfied. 


The vending machine company wants to keep its customers happy. 
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So it has decided to hire you as a statistical consultant. The company 
provides you with the following summary of important facts about the 
vending machine: 


e Cups will hold 8 ounces. 


e The amount of liquid dispensed varies according to a Normal 
distribution centered at the mean p that is set in the machine. 


e o=0.2 ounceso = 0.2 ounces, 


If a cup contains too much liquid, a customer may get burned from a 
spill. This could result in an expensive lawsuit for the company. On the 
other hand, customers may be irritated if they get a cup with too little 
liquid from the machine. 

Given these issues, what mean setting for the machine would you 
recommend? Provide appropriate graphical and numerical evidence to 
support your conclusion. Be prepared to defend your answer. 


CHECK YOUR UNDERSTANDING 


A 


y High levels of cholesterol in the blood increase the risk of heart 
disease. For 14-year-old boys, the distribution of blood cholesterol is 
approximately Normal with mean p=1704 — 170 milligrams of 
cholesterol per deciliter of blood (mg/dl) and standard deviation 
0=30 mg/dl7 = 30 mg/dl What cholesterol level would place a 14- 
year-old boy at the 10th percentile of the distribution? 


Assessing Normality 


Normal distributions provide good models for some distributions of 
quantitative data. Examples include SAT and IQ test scores, the highway 
gas mileage of 2018 Corvette convertibles, weights of 9-ounce bags of 
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potato chips, and heights of 3-year-old girls (see Figure 2.19). 


eo 
Fa \ 
\ 
/ 


—=T | | =r 


82.5 86.5 90.5 94.5 98.5 102.5 106.5 
Heights of 3-year-old girls (cm) 


FIGURE 2.19 The heights of 3-year-old girls are approximately Normally 
distributed with a mean of 94.5 centimeters and standard deviation of 4 
centimeters. 


The distributions of other quantitative variables are skewed and 
therefore distinctly non-Normal. Examples include single-family home 
prices in a certain city, survival times of cancer patients after treatment, 
and number of siblings for students in a statistics class. 


While experience can suggest whether or not a Normal distribution is a 
reasonable model in a particular case, it is risky to assume that a 
distribution is approximately Normal without first analyzing the data. As 
in Chapter 1, we start with a graph and then add numerical summaries to 
assess the Normality of a distribution of quantitative data. 


If a graph of the data is clearly skewed, has multiple peaks, or isn’t 
bell-shaped, that’s evidence the distribution is not Normal. Here is a 
dotplot of the number of siblings reported by each student in a statistics 
class. This distribution is skewed to the right and therefore not 
approximately Normal. 


I 
) 4 5 6 7 
Number of siblings 


—-<—-eeeeeeeeeeeeeee 


NO eeeeeeeeeeeee 


Even if a graph of the data looks roughly symmetric and bell-shaped, 
we shouldn’t assume that the distribution is approximately Normal. The 
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68—95-99.7 rule can give additional evidence in favor of or against 
Normality. 

Figure 2.20 shows a dotplot and numerical summaries for data on 
calories per serving in 77 brands of breakfast cereal.2 The graph is roughly 
symmetric, single-peaked, and somewhat bell-shaped. Let’s count the 
number of data values within 1, 2, and 3 standard deviations of the mean: 


eeseeeee 


“eeeeeeseeeeoeeoeese 


ee 
ee @ 
T 


: 
Ld 
T T T 
60 80 100 120 140 160 


Calories 


n Mean SD Min Q, Med Q; Max 

77 106.883 19.484 50 100 110 110 160 
FIGURE 2.20 Dotplot and summary statistics for data on calories per serving in 
77 different brands of breakfast cereal. 


Mean + 1 SD: 106.883 + 1(19.484)87.399 to 126.36763 out of 77 = 
81.8%Mean + 2 SD: 106.883 + 2(19.484)67.915 to 145.85171 out of 77 = 
92.2%Mean + 3 SD: 106.883 + 3(19.484)48.431 to 165.33577 out of 77 = 


100.0% 
Mean + 1 SD: 106.883 + 1(19.484) 87.399 to 126.367 63 out of 77 = 81.8% 


Mean + 2 SD: 106.883 + 2(19.484) 67.915 to 145.851 71 out of 77 = 92.2% 
Mean + 3 SD: 106.883 + 3(19.484) 48.431 to 165.335 77 out of 77 = 100.0% 


In a Normal distribution, about 68% of the values fall within 1 
standard deviation of the mean. For the cereal data, almost 82% of the 
brands had between 87.399 and 126.367 calories. These two percentages 
are far apart. So this distribution of calories in breakfast cereals is not 
approximately Normal. 


EXAMPLE | Are IQ scores Normally distributed? » 
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| Assessing Normality 


Monkey Business Images/Shutterstock.com 


PROBLEM: Many people believe that the distribution of IQ scores 
follows a Normal distribution. Is that really the case? To find out, 
researchers obtained the IQ scores of 60 randomly selected fifth-grade 


students from one school. Here are the data:2 


81 82 89 90 94 96 97 100 101 £101 ~=~ 101 
102. :102——=S 102. 1103) 105~=—s :1106~=S— 108 )»3=6109 =6109 36109110 
110 110 412) d12) 3 gS. aa 14 4 IS «C6 
117) 117)—S 107) 118118122222 123-124 1241124 
1250 1260 «©1270 «©127)=«128) 130s 131) 133) 134) «134 ~—Ss«136 
137. 1389) 1389) 142) «145 


A histogram and summary statistics for the data are shown. Is this 
distribution of IQ scores of fifth-graders at this school approximately 
Normal? Justify your answer based on the graph and the 68—95-99.7 
rule. 
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80 90 100 110 120 130 140 =~ = 150 
IQ score 


n Mean SD Min Q,; Med Q3 Max 
60 114.983 14.801 81 104 114 125.5 145 


SOLUTION: 


The histogram looks roughly symmetric, single-peaked, and somewhat 
bell-shaped. The percents of values within 1, 2, and 3 standard 
deviations of the mean are 


Mean + 1 SD: 114.983 + 1(14.801) 100.182 to 129.78441 out of 60 = 
68.3%Mean + 2 SD: 114.983 + 2(14.801)85.381 to 144.58557 out of 60 
= 95.0% Mean + 3 SD: 114.983 + 


3(14.801)70.580 to 159.38660 out of 60 = 100.0% 
Mean + 1 SD: 114.983 + 1(14.801) 100.182 to 129.784 41 out of 60 = 68.3% 


Mean + 2 SD: 114.983 + 2(14.801) 85.381 to 144.585 57 out of 60 = 95.0% 
Mean + 3 SD: 114.983 + 3(14.801) 70.580 to 159.386 60 out of 60 = 100.0% 


These percents are very close to the 68%, 95%, and 99.7% targets for a 
Normal distribution. The graphical and numerical evidence suggests that 
this distribution of IQ scores is approximately Normal. 


FOR PRACTICE, TRY EXERCISE 75 


AP® EXAM TIP 


Never say that a distribution of quantitative data is Normal. Real-world 
data always show at least slight departures from a Normal distribution. 
The most you can say is that the distribution is “approximately 
Normal.” 
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Because the IQ data come from a random sample, we can use the 
sample mean IQ score to make an inference about the population mean IQ 
score of all fifth-graders at the school. As you will see in later chapters, the 
methods for inference about a population mean work best when the 
population distribution is Normal. Because the distribution of IQ scores in 
the sample is approximately Normal, it is reasonable to believe that the 
population distribution is approximately Normal. 


NORMAL PROBABILITY PLOTS A graph called a Normal 
probability plot (or a Normal quantile plot) provides a good assessment 
of whether or not a distribution of quantitative data is approximately 
Normal. 


DEFINITION Normal probability plot 


A Normal probability plot is a scatterplot of the ordered pair (data 
value, expected z-score) for each of the individuals in a quantitative 
data set. That is, the x-coordinate of each point is the actual data value 
and the y-coordinate is the expected z-score corresponding to the 
percentile of that data value in a standard Normal distribution. 


AP® EXAM TIP 


Normal probability plots are not included on the AP® Statistics topic 
outline. However, these graphs are very useful for assessing Normality. 


You may use them on the AP® Statistics exam if you wish—just be 
sure that you know what you’re looking for. 


Some software plots the data values on the horizontal axis and the z-scores on the 
vertical axis, while other software does just the reverse. The TI-83/84 gives you both 
options. We prefer the data values on the horizontal axis, which is consistent with 


other types of graphs we have made. 


Technology Corner 7 at the end of this subsection shows you how to 
make a Normal probability plot. For now, let’s focus on how to interpret 


317 


Normal probability plots. 


Figure 2.21 shows dotplots and Normal probability plots for each of 
the data sets in this subsection. 


e Panel (a): We confirmed earlier that the distribution of IQ scores is 
approximately Normal. Its Normal probability plot has a linear form. 


e Panel (b): The distribution of number of siblings is clearly right-skewed. 
Its Normal probability plot has a curved form. 


e Panel (c): We determined earlier that the distribution of calories in 
breakfast cereals is not approximately Normal, even though the graph 
looks roughly symmetric and somewhat bell-shaped. Its Normal 
probability plot has a different kind of nonlinear form. 
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FIGURE 2.21 Dotplot and Normal probability plot of (a) IQ scores for 60 
randomly selected fifth-graders from one school, (b) Number of siblings for each 
student in a college statistics class, and (c) Calories per serving in 77 brands of 
breakfast cereal. The distribution of IQ scores in (a) is approximately Normal 
because the Normal probability plot has a linear form. The nonlinear Normal 
probability plots in (b) and (c) confirm that neither of these distributions is 
approximately Normal. 


HOW TO ASSESS NORMALITY WITH A NORMAL 
PROBABILITY PLOT 


If the points on a Normal probability plot lie close to a straight line, the 
data are approximately Normally distributed. A nonlinear form in a 
Normal probability plot indicates a non-Normal distribution. 


When examining a Normal probability plot, look for shapes that show 
clear departures from Normality. Don’t overreact to minor wiggles in the 
plot. We used a TI-84 to generate three different random samples of size 
20 from a Normal distribution. The screen shots show Normal probability 
plots for each of the samples. Although none of the plots is perfectly 
linear, it is reasonable to believe that each sample came from a Normal 
population. 


Sample 1 Sample 2 Sample 3 


EXAMPLE |How Normal are survival times? » 
Interpreting Normal probability plots 


PROBLEM: Researchers recorded the survival times in days of 72 
guinea pigs after they were injected with infectious bacteria in a medical 
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experiment.2 A Normal probability plot of the data is shown. Use the 
graph to determine if the distribution of survival times is approximately 
Normal. 


3 a | 
5 5 : 
in eo e ad 
—) & 
g 17 “ 
% ¢” 
s 0 
v 
b 
| 

é 

eR : 

—3 

0 100 200 300 400 500 600 


Survival time (days) 


SOLUTION: 


The Normal probability plot is clearly curved, indicating that the 
distribution of survival time for the 72 guinea pigs is not approximately 
Normal. 


FOR PRACTICE, TRY EXERCISE 79 


Think About It 


HOW CAN WE DETERMINE SHAPE FROM A NORMAL 
PROBABILITY PLOT? Look at the Normal probability plot of the 
guinea pig survival data in the example. Imagine all the points falling 
down onto the horizontal axis. The resulting dotplot would have many 
values stacked up between 50 and 150 days, and fewer values that are 
further spread apart from 150 to about 600 days. The distribution would 
be skewed to the right due to the greater variability in the upper half of 
the data set. The dotplot of the data confirms our answer. 
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7. Technology Corner |MAKING NORMAL 
PROBABILITY PLOTS (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Let’s use the TI-83/84 to make a Normal probability plot for the IQ 
score data (page 132). 


1. Enter the data values in list L1. 


e Press |STAT| and choose Edit.... 


e Type the values into list Lj. 


MORMAL FLOAT AUTO REAL DEGREE MP ) 


Plot2 Plot3 


Data List:Li 
Data Axis: Y 
Mark :‘B«- . 
Color ‘i= 


2. Set up a Normal probability plot in the statistics plots menu. 


e Press (STAT PLOT). 
e Press or |1/ to go into Plot1. 


e Adjust the settings as shown. 


ORHAL FLOAT AUTO REAL DEGREE MP 
PloticLa 


Y="2.39398 


3. Use ZoomStat to see the finished graph. 
Interpretation: The Normal probability plot is quite linear, which 
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confirms our earlier belief that the distribution of IQ scores is 
approximately Normal. 


Section 2.2. Summary 


e We can describe the overall pattern of a distribution by a density curve. 
A density curve always remains on or above the horizontal axis and has 
total area 1 underneath it. An area under a density curve estimates the 
proportion of observations that fall in an interval of values. 


e A density curve is an idealized description of the overall pattern of a 
distribution that smooths out the irregularities in the actual data. We 
write the mean of a density curve as p and the standard deviation of a 
density curve as o to distinguish them from the mean x @ and the 
standard deviation s, of the actual data. 


e The mean and the median of a density curve can be located by eye. The 
mean p/ is the balance point of the curve. The median divides the area 
under the curve in half. The standard deviation o cannot be located by 
eye on most density curves. 


e The mean and median are equal for symmetric density curves. The mean 
of a skewed density curve is located farther toward the long tail than the 
median is. 


e Normal distributions are described by a special family of bell-shaped, 
symmetric density curves, called Normal curves. The mean p and 
standard deviation o completely specify a Normal distribution. The 
mean is the center of the curve, and o is the distance from p to the 
change-of-curvature points on either side. 


e The 68—95-99.7 rule describes what percent of observations in any 
Normal distribution fall within 1, 2, and 3 standard deviations of the 
mean. 


e All Normal distributions are the same when observations are 
standardized. If x follows a Normal distribution with mean p and 
standard deviation 0, we can standardize using 


322 


r—p 


Z=X-[lO o 


The variable z has the standard Normal distribution with mean 0 and 
standard deviation 1. 


Table A in the back of the book gives percentiles for the standard 
Normal distribution. You can use Table A or technology to determine 
area for given values of the variable or the value that corresponds to a 
given percentile in any Normal distribution. 


To find the area in a Normal distribution corresponding to given values: 
Step 1: Draw a Normal distribution with the horizontal axis labeled 
and scaled using the mean and standard deviation, the boundary value(s) 
clearly identified, and the area of interest shaded. 
Step 2: Perform calculations—show your work! Do one of the 
following: 

i. Standardize each boundary value and use Table A or technology to 

find the desired area under the standard Normal curve; or 
ii. Use technology to find the desired area without standardizing. 


Be sure to answer the question that was asked. 


To find the value in a Normal distribution corresponding to a given 
percentile (area): 

Step 1: Draw a Normal distribution with the horizontal axis labeled 
and scaled using the mean and standard deviation, the area of interest 
shaded, and unknown boundary value clearly marked. 

Step 2: Perform calculations—show your work! Do one of the 
following: 

i. Use Table A or technology to find the value of z with the indicated 
area under the standard Normal curve, then “unstandardize” to 
transform back to the original distribution; or 

ii. Use technology to find the desired area without standardizing. 
Be sure to answer the question that was asked. 


To assess Normality for a given set of quantitative data, we first observe 
the shape of a dotplot, stemplot, or histogram. Then we can check how 
well the data fit the 68—95—99.7 rule for Normal distributions. Another 
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good method for assessing Normality is to construct a Normal 
probability plot. If the Normal probability plot has a linear form, then 
we can say that the distribution is approximately Normal. 


2.2 ‘Technology Corner (> 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
5. Finding areas from values in a Normal distribution 


6. Finding values from areas in a Normal Page 121 
distribution Page 128 


7. Making Normal probability plots Page 136 


Section 2.2 Exercises 


41. pg 111 |») Where’s the bus? Sally takes the same bus to work 


every moming. The amount of time (in minutes) that she has to wait 
for the bus can be modeled by a uniform distribution on the interval 
from 0 minutes to 10 minutes. 


a. Draw a density curve to model the amount of time that Sally has to 
wait for the bus. Be sure to include scales on both axes. 


b. On about what percent of days does Sally wait between 2.5 and 5.3 
minutes for the bus? 


c. Find the 70th percentile of Sally’s wait times. 


42. Still waiting for the server? How does your web browser get a file 
from the Internet? Your computer sends a request for the file to a web 
server, and the web server sends back a response. For one particular 
web server, the time (in seconds) after the start of an hour at which a 
request is received can be modeled by a uniform distribution on the 
interval from 0 to 3600 seconds. 
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a. Draw a density curve to model the amount of time after an hour at 
which a request is received by the web server. Be sure to include 
scales on both axes. 


b. About what proportion of requests are received within the first 5 
minutes (300 seconds) after the hour? 


c. Find the interquartile range of this distribution. 


43. Quick, click! An Internet reaction time test asks subjects to click 
their mouse button as soon as a light flashes on the screen. The light 
is programmed to go on at a randomly selected time after the subject 
clicks “Start.” The density curve models the amount of time the 
subject has to wait for the light to flash. 


0 1 2 3 4 5 
Time (sec) until the light flashes 


a. What height must the density curve have? Justify your answer. 


b. About what percent of the time will the light flash more than 3.75 
seconds after the subject clicks “Start”? 


c. Calculate and interpret the 38th percentile of this distribution. 


44. Class is over! Mr. Shrager does not always let his statistics class out 
on time. In fact, he seems to end class according to his own “internal 
clock.” The density curve models the distribution of the amount of 
time after class ends (in minutes) when Mr. Shrager dismisses the 
class. (A negative value indicates he ended class early.) 


-1 0 4 


Dismissal time (minutes after class ends) 


a. What height must the density curve have? Justify your answer. 
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b. About what proportion of the time does Mr. Shrager dismiss class 
within 1 minute of its scheduled end time? 


c. Calculate and interpret the 20th percentile of the distribution. 


45. pg 112 |») Mean and median The figure displays two density 


46. 


47. 


48. 


curves that model different distributions of quantitative data. Identify 
the location of the mean and median by letter for each graph. Justify 
your answers. 


(a) ABC : (b)a Bc 
Mean and median The figure displays two density curves that model 


different distributions of quantitative data. Identify the location of the 
mean and median by letter for each graph. Justify your answers. 


pg 115 |») Potato chips The weights of 9-ounce bags of a 


particular brand of potato chips can be modeled by a Normal 
distribution with mean p=9.124 — 9.12 ounces and standard deviation 
o=0.05 ounceg = 0.05 ounce, Sketch the Normal density curve. Label 
the mean and the points that are 1, 2, and 3 standard deviations from 
the mean. 


Batter up! In baseball, a player’s batting average is the proportion of 
times the player gets a hit out of his total number of times at bat. The 
distribution of batting averages in a recent season for Major League 
Baseball players with at least 100 plate appearances can be modeled 
by a Normal distribution with mean p=0.2614 — 0.261 and standard 
deviation o=0.034¢ = 0.034, Sketch the Normal density curve. Label 
the mean and the points that are 1, 2, and 3 standard deviations from 
the mean. 
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49. 


50. 


ol. 


52. 


53. 


Normal curve Estimate the mean and standard deviation of the 
Normal density curve below. 


== ar (aa a | ae ae ee ee A (| 
3 4 5 € 7 £8 POH Rb 4 SS 16 17 


Normal curve Estimate the mean and standard deviation of the 
Normal density curve below. 


pg 118 |») Potato chips Refer to Exercise 47. Use the 68-95—99.7 
rule to answer the following questions. 


a. About what percent of bags weigh less than 9.02 ounces? Show 
your method clearly. 


b. A bag that weighs 9.07 ounces is at about what percentile in this 
distribution? Justify your answer. 


Batter up! Refer to Exercise 48. Use the 68—95—99.7 rule to answer 
the following questions. 


a. About what percent of Major League Baseball players with 100 
plate appearances had batting averages of 0.363 or higher? Show 
your method clearly. 


b. A player with a batting average of 0.227 is at about what percentile 
in this distribution? Justify your answer. 
pg 122 |») Potato chips Refer to Exercise 47. About what percent 


of 9-ounce bags of this brand of potato chips weigh less than the 
advertised 9 ounces? Is this likely to pose a problem for the company 
that produces these chips? 
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54. Batter up! Refer to Exercise 48. A player with a batting average 
below 0.200 is at risk of sitting on the bench during important games. 
About what percent of players are at risk? 


55. pg 124 |) Watch the salt! A study investigated about 3000 meals 


ordered from Chipotle restaurants using the online site Grubhub. 
Researchers calculated the sodium content (in milligrams) for each 
order based on Chipotle’s published nutrition information. The 
distribution of sodium content is approximately Normal with mean 
2000 mg and standard deviation 500 mg.12 About what percent of the 
meals ordered exceeded the recommended daily allowance of 2400 
mg of sodium? 


56. Blood pressure According to a health information website, the 
distribution of adults’ diastolic blood pressure (in millimeters of 
mercury) can be modeled by a Normal distribution with mean 70 and 
standard deviation 20. A diastolic pressure above 100 for an adult is 
classified as very high blood pressure. About what proportion of 
adults have very high blood pressure according to this criterion? 


97. pg 126 |») Watch the salt! Refer to Exercise 55. About what 
percent of the meals ordered contained between 1200 mg and 1800 


mg of sodium? 


58. Blood pressure Refer to Exercise 56. According to the same health 
information website, a diastolic blood pressure between 80 and 90 
indicates borderline high blood pressure. About what percent of 
adults have borderline high blood pressure? 


59. Standard Normal areas Find the proportion of observations in a 
standard Normal distribution that satisfies each of the following 
statements. 


a. z>-1.662 > —1.66 
b. -1.66<z<2.85—1.66 < z < 2.85 


60. Standard Normal areas Find the proportion of observations in a 
standard Normal distribution that satisfies each of the following 
statements. 
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61. 


62. 


63. 


64. 


65. 


66. 


a. Z<-2.462 < —2.46 
b. 0.89<z<2.460.89 < z < 2.46 


Sudoku Mrs. Starnes enjoys doing Sudoku puzzles. The time she 
takes to complete an easy puzzle can be modeled by a Normal 
distribution with mean 5.3 minutes and standard deviation 0.9 minute. 


a. What proportion of the time does Mrs. Starnes finish an easy 
Sudoku puzzle in less than 3 minutes? 


b. How often does it take Mrs. Starnes more than 6 minutes to 
complete an easy puzzle? 


c. What percent of easy Sudoku puzzles take Mrs. Starnes between 6 
and 8 minutes to complete? 


Hit an ace! Professional tennis player Novak Djokovic hits the ball 
extremely hard. His first-serve speeds can be modeled by a Normal 
distribution with mean 112 miles per hour (mph) and standard 
deviation 5 mph. 


a. How often does Djokovic hit his first serve faster than 120 mph? 
b. What percent of Djokovic’s first serves are slower than 100 mph? 


c. What proportion of Djokovic’s first serves have speeds between 
100 and 110 mph? 

pg 129 |») Sudoku Refer to Exercise 61. Find the 20th percentile 

of Mrs. Starnes’s Sudoku times for easy puzzles. 


Hit an ace! Refer to Exercise 62. Find the 85th percentile of 
Djokovic’s first-serve speeds. 


Deciles The deciles of any distribution are the values at the 10th, 
20th, ... , 90th percentiles. The first and last deciles are the 10th and 
the 90th percentiles, respectively. What are the first and last deciles of 
the standard Normal distribution? 


Outliers The percent of the observations that are classified as outliers 
by the 1.5xIQRI-5 x JQR rule is the same in any Normal 
distribution. What is this percent? Show your method clearly. 
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67. IQ test scores Scores on the Wechsler Adult Intelligence Scale (an 
IQ test) for the 20- to 34-year-old age group are approximately 
Normally distributed with p=1104 = 110 and o=250 = 25, 


a. What percent of people aged 20 to 34 have IQs between 125 and 
150? 


b. MENSA is an elite organization that admits as members people 
who score in the top 2% on IQ tests. What score on the Wechsler 
Adult Intelligence Scale would an individual aged 20 to 34 have to 
earn to qualify for MENSA membership? 


68. Post office A local post office weighs outgoing mail and finds that 
the weights of first-class letters are approximately Normally 
distributed with a mean of 0.69 ounce and a standard deviation of 
0.16 ounce. 


a. Estimate the 60th percentile of first-class letter weights. 


b. First-class letters weighing more than 1 ounce require extra 
postage. What proportion of first-class letters at this post office 
require extra postage? 


Exercises 69 and 70 refer to the following setting. At some fast-food 
restaurants, customers who want a lid for their drinks get them from a 
large stack near the straws, napkins, and condiments. The lids are made 
with a small amount of flexibility so they can be stretched across the 
mouth of the cup and then snugly secured. When lids are too small or too 
large, customers can get very frustrated, especially if they end up spilling 
their drinks. At one particular restaurant, large drink cups require lids with 
a “diameter” of between 3.95 and 4.05 inches. The restaurant’s lid supplier 
claims that the diameter of its large lids follows a Normal distribution with 
mean 3.98 inches and standard deviation 0.02 inch. Assume that the 
supplier’s claim is true. 


69. Put a lid on it! 
a. What percent of large lids are too small to fit? 


b. What percent of large lids are too big to fit? 
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70. 


71. 


des 


ioe 


c. Compare your answers to parts (a) and (b). Does it make sense for 
the lid manufacturer to try to make one of these values larger than 
the other? Why or why not? 


Put a lid on it! The supplier is considering two changes to reduce to 
1% the percentage of its large-cup lids that are too small. One 
strategy is to adjust the mean diameter of its lids. Another option is to 
alter the production process, thereby decreasing the standard 
deviation of the lid diameters. 


a. If the standard deviation remains at o=0.020 = 0.02 inch, at what 
value should the supplier set the mean diameter of its large-cup 
lids so that only 1% are too small to fit? 


b. If the mean diameter stays at p=3.984 — 3.98 inches, what value of 


the standard deviation will result in only 1% of lids that are too 
small to fit? 


c. Which of the two options in parts (a) and (b) do you think is 
preferable? Justify your answer. (Be sure to consider the effect of 
these changes on the percent of lids that are too large to fit.) 


Flight times An airline flies the same route at the same time each 
day. The flight time varies according to a Normal distribution with 
unknown mean and standard deviation. On 15% of days, the flight 
takes more than an hour. On 3% of days, the flight lasts 75 minutes or 
more. Use this information to determine the mean and standard 
deviation of the flight time distribution. 


Brush your teeth The amount of time Ricardo spends brushing his 
teeth follows a Normal distribution with unknown mean and standard 
deviation. Ricardo spends less than 1 minute brushing his teeth about 
40% of the time. He spends more than 2 minutes brushing his teeth 
2% of the time. Use this information to determine the mean and 
standard deviation of this distribution. 


Normal highway driving? The dotplot shows the EPA highway gas 
mileage estimates in miles per gallon (mpg) for a random sample of 

21 model year 2018 midsize cars! Explain why this distribution of 
highway gas mileage is not approximately Normal. 
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Highway gas mileage (mpg) 


74. Normal to be foreign born? The histogram displays the percent of 


73s 


foreign-born residents in each of the 50 states.14 Explain why this 
distribution of the percent of foreign-born residents in the states is not 
approximately Normal. 


20 - 


Number of states 


0 5 10 15 20 25 30 
Percent foreign-born residents 


pg 132 |») Refrigerators Consumer Reports magazine collected 


data on the usable capacity (in cubic feet) of a sample of 36 side-by- 
side refrigerators. Here are the data: 


12.9 13.7 141 142 145 145 146 14.7 15.1 15.22 153 15.3 
15.3 153 155 156 15.6 15.8 160 160 162 162 163 164 
16.5 166 166 166 168 17.0 170 17.2 174 17.4 17.9 18.4 


A histogram of the data and summary statistics are shown here. Is this 
distribution of refrigerator capacities approximately Normal? Justify 
your answer based on the graph and the 68—95—99.7 rule. 
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76. 


74s 


Frequency 


12 13 14 15 16 17 18 19 
Usable capacity (ft) 


n Mean SD Min Q; Med Q3; Max 
36 15.825 1.217 12.9 15.15 159 16.6 18.4 


Big sharks Here are the lengths (in feet) of 44 great white sharks:12 


18.7. 123 186 164 15.7 183 146 158 149 17.6 12.1 
16.4 16.7 17.8 162 126 178 138 12.22 15.2 14.7 12.4 
13.2 158 143 166 94 182 132 136 153 161 13.5 
19.1 16.2 228 168 136 13.2 15.7 19.7) 18.7 13.2 168 


A dotplot of the data and summary statistics are shown below. Is this 
distribution of shark length approximately Normal? Justify your 
answer based on the graph and the 68—95—99.7 rule. 


. 
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8 10 12 14 16 18 20 22 
Length (ft) 
n Mean SD Min Q) Med Q3 Max 


44 15.586 2.00 9.4 13.55 15.75 17.2 22.8 


Is Michigan Normal? We collected data on the tuition charged by 
colleges and universities in Michigan. Here are some numerical 
summaries for the data: 


Mean SD Min Max 
10,614 8049 1873 30,823 


Based on the relationship between the mean, standard deviation, 
minimum, and maximum, is it reasonable to believe that the 
distribution of Michigan tuitions is approximately Normal? Explain 
your answer. 
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78. 


79. 


80. 


Are body weights Normal? The heights of people of the same 
gender and similar ages follow Normal distributions reasonably 
closely. How about body weights? The weights of women aged 20 to 
29 have mean 141.7 pounds and median 133.2 pounds. The first and 
third quartiles are 118.3 pounds and 157.3 pounds. Is it reasonable to 
believe that the distribution of body weights for women aged 20 to 29 
is approximately Normal? Explain your answer. 


pg 135 |») Runners’ heart rates The following figure is a Normal 


probability plot of the heart rates of 200 male runners after 6 minutes 
of exercise on a treadmill.44 Use the graph to determine if this 
distribution of heart rates is approximately Normal. 


3 


2 7 


Expected z-score 


70 80 90 100 110 120 130 140 150 
Heart rate (beats per minute) 
Carbon dioxide emissions The following figure is a Normal 
probability plot of the emissions of carbon dioxide (CO) per person 
in 48 countries.42 Use the graph to determine if this distribution of 
CO), emissions is approximately Normal. 


Expected z-score 
o 
| 
% 
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CO, emissions (metric tons per person) 
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81. Normal states? The Normal probability plot displays data on the 
areas (in thousands of square miles) of each of the 50 states. Use the 
graph to determine if the distribution of land area is approximately 
Normal. 


Expected z-score 


C- @ @e 


200 400 600 

Area (thousands of mi”) 

82. Density of the earth In 1798, the English scientist Henry Cavendish 
measured the density of the earth several times by careful work with a 
torsion balance. The variable recorded was the density of the earth as 
a multiple of the density of water. A Normal probability plot of the 
data is shown.2® Use the graph to determine if this distribution of 
density measurement is approximately Normal. 
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Density of earth 


83. Refrigerators Refer to Exercise 75. 


a. Use your calculator to make a Normal probability plot of the data. 
Sketch this graph on your paper. 


b. What does the graph in part (a) imply about whether the 
distribution of refrigerator capacity is approximately Normal? 
Explain. 


84. Big sharks Refer to Exercise 76. 
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a. Use your calculator to make a Normal probability plot of the data. 
Sketch this graph on your paper. 


b. What does the graph in part (a) imply about whether the 
distribution of shark length is approximately Normal? Explain. 


Multiple Choice Select the best answer for Exercises 85—90. 


85. Two measures of center are marked on the density curve shown. 
Which of the following is correct? 
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a. The median is at the yellow line and the mean is at the red line. 
b. The median is at the red line and the mean is at the yellow line. 
c. The mode is at the red line and the median is at the yellow line. 
d. The mode is at the yellow line and the median is at the red line. 


e. The mode is at the red line and the mean is at the yellow line. 


Exercises 86—88 refer to the following setting. The weights of 
laboratory cockroaches can be modeled with a Normal distribution 
having mean 80 grams and standard deviation 2 grams. The 
following figure is the Normal curve for this distribution of 
weights. 
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86. 


87. 


88. 
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Point C on this Normal curve corresponds to 
a. 84 grams. 
b. 82 grams. 
c. 78 grams. 
d. 76 grams. 


e. 74 grams. 


About what percent of the cockroaches have weights between 76 and 
84 grams? 


a. 99.7% 
b. 95% 
c. 68% 
d. 47.5% 
e. 34% 


About what proportion of the cockroaches will have weights greater 
than 83 grams? 


a. 0.0228 
b. 0.0668 
c. 0.1587 
d. 0.9332 


go/ 


e. 0.0772 


89. A different species of cockroach has weights that are approximately 


90. 


Normally distributed with a mean of 50 grams. After measuring the 
weights of many of these cockroaches, a lab assistant reports that 
14% of the cockroaches weigh more than 55 grams. Based on this 
report, what is the approximate standard deviation of weight for this 
species of cockroaches? 


a. 4.6 
b. 5.0 
c. 6.2 
d. 14.0 


e. Cannot determine without more information. 


The following Normal probability plot shows the distribution of 
points scored for the 551 players in a single NBA season. 


3 a 
2 nil 


Expected z-score 


T T l T 
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Points 


If the distribution of points was displayed in a histogram, what would 
be the best description of the histogram’s shape? 


a. Approximately Normal 

b. Symmetric but not approximately Normal 
c. Skewed left 

d. Skewed right 


e. Cannot be determined 
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Recycle and Review 


91. Making money (2.1) The parallel dotplots show the total family 
income of randomly chosen individuals from Indiana (38 individuals) 
and New Jersey (44 individuals). Means and standard deviations are 
given below the dotplots. 


Indiana **** 


ee ee ee ~ e 
New Jersey pee Socssecrs os ece. rs : e 
0 25,000 50,000 75,000 100,000 125,000 150,000 175,000 


Total family income ($) 


Standard 
deviation 


Indiana $47,400 $29,400 
New Jersey $58,100 $41,900 


Mean 


Consider individuals in each state with total family incomes of 
$95,000. Which individual has a higher income, relative to others in 
his or her state? Use percentiles and z-scores to support your answer. 


92. More money (1.3) Refer to Exercise 91. 


a. How do the ranges of the two distributions compare? Justify your 
answer. 


b. Explain why the standard deviation of the total family incomes in 
the New Jersey sample is so much larger than for the Indiana 
sample. 
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— FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


The distribution of scores on a recent test closely followed a Normal 
distribution with a mean of 22 points and a standard deviation of 4 
points. 


a. What proportion of the students scored at least 25 points on 
this test? 


b. What is the 31st percentile of the distribution of test scores? 


c. The teacher wants to transform the test scores so that they have 
an approximately Normal distribution with a mean of 80 points 
and a standard deviation of 10 points. To do this, she will use a 
formula in the form 


new score=a+b (old score)" Seore = 4 + b(old score) 


Find the values of a and b that the teacher should use to 
transform the distribution of test scores. 
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d. Before the test, the teacher gave a review assignment for 
homework. The maximum score on the assignment was 10 
points. The distribution of scores on this assignment had a 
mean of 9.2 points and a standard deviation of 2.1 points. 
Would it be appropriate to use a Normal distribution to 
calculate the proportion of students who scored below 7 points 
on this assignment? Explain your answer. 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 


“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide a scoring rubric. Score your response and note what, if 
anything, you would do differently to improve your own score. 
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Chapter 2 Review 


Section 2.1: Describing Location in a Distribution 


In this section, you learned two different ways to describe the location of 
individuals in a distribution: percentiles and standardized scores (z-scores). 
Percentiles describe the location of an individual by measuring what 
percent of the observations in the distribution have a value less than the 
individual’s value. A cumulative relative frequency graph is a handy tool 
for identifying percentiles in a distribution. You can use it to estimate the 
percentile for a particular value of a variable or to estimate the value of the 
variable at a particular percentile. 


Standardized scores (z-scores) describe the location of an individual in 
a distribution by measuring how many standard deviations the individual is 
above or below the mean. To find the standardized score for a particular 
observation, transform the value by subtracting the mean and then dividing 
the difference by the standard deviation. Besides describing the location of 
an individual in a distribution, you can also use z-scores to compare 
observations from different distributions—standardizing the values puts 
them on a standard scale. 


You also learned to describe the effects on the shape, center, and 
variability of a distribution when transforming data from one scale to 
another. Adding a positive constant to (or subtracting it from) each value 
in a data set changes the measures of center and location, but not the shape 
or variability of the distribution. Multiplying or dividing each value ina 
data set by a positive constant changes the measures of center and location 
and measures of variability, but not the shape of the distribution. 


Section 2.2: Density Curves and Normal Distributions 

In this section, you learned how density curves are used to model 
distributions of data. An area under a density curve estimates the 
proportion of observations that fall in a specified interval of values. The 
total area under a density curve is 1, or 100%. 
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The most commonly used density curve is called a Normal curve. The 
Normal curve is symmetric, single-peaked, and bell-shaped with mean p 
and standard deviation o. For any distribution of data that is approximately 
Normal in shape, about 68% of the observations will be within 1 standard 
deviation of the mean, about 95% of the observations will be within 2 
standard deviations of the mean, and about 99.7% of the observations will 
be within 3 standard deviations of the mean. Conveniently, this 
relationship is called the 68—95—99.7 rule. 


When observations do not fall exactly 1, 2, or 3 standard deviations 
from the mean, you learned how to use Table A or technology to identify 
the proportion of values in any specified interval under a Normal curve. 
You also learned how to use Table A or technology to determine the value 
of an individual that falls at a specified percentile in a Normal distribution. 


On the AP® Statistics exam, it is extremely important that you clearly 
communicate your methods when answering questions that involve a 
Normal distribution. Shading a Normal curve with the mean, standard 
deviation, and boundaries clearly identified is a great start. If you use 
technology to perform calculations, be sure to label the inputs of your 
calculator commands. 

Finally, you learned how to determine if a distribution of data is 
approximately Normal using graphs (dotplots, stemplots, histograms) and 
the 68—95-—99.7 rule. You also learned that a Normal probability plot is a 
great way to determine whether the shape of a distribution is 
approximately Normal. The more linear the Normal probability plot, the 
more Normal the distribution of the data. 


What Did You Learn? 


Related Relevant 
Example Chapter 


on Review 
Learning Target Section Page(s) Exercise(s) 
Find and interpret the percentile of an eel 92 R2.1, 
individual value in a distribution of data. R2.3(c) 
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Estimate percentiles and individual values 
using a cumulative relative frequency 
graph. 

Find and interpret the standardized score 
(z-score) of an individual value in a 
distribution of data. 


Describe the effect of adding, subtracting, 
multiplying by, or dividing by a constant 
on the shape, center, and variability of a 
distribution of data. 


Use a density curve to model distributions 
of quantitative data. 


Identify the relative locations of the mean 
and median of a distribution from a density 
curve. 


Use the 68—95-—99.7 rule to estimate (i) the 
proportion of values in a specified interval, 
or (ii) the value that corresponds to a given 
percentile in a Normal distribution. 


Find the proportion of values in a specified 
interval in a Normal distribution using 
Table A or technology. 


Find the value that corresponds to a given 
percentile in a Normal distribution using 
Table A or technology. 


Determine whether a distribution of data is 
approximately Normal from graphical and 
numerical evidence. 
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Chapter 2 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R2.1 |) Is Paul tall? According to the National Center for Health 


Statistics, the distribution of heights for 15-year-old males has a 
mean of 170 centimeters (cm) and a standard deviation of 7.5 cm. 
Paul is 15 years old and 179 cm tall. 


a. Find the z-score corresponding to Paul’s height. Explain what 
this value means. 


b. Paul’s height puts him at the 85th percentile among 15-year-old 
males. Explain what this means to someone who knows no 
Statistics. 

R2.2 |») Computer use Mrs. Causey asked her students how much time 


they had spent watching television during the previous week. The 
figure shows a cumulative relative frequency graph of her students’ 
responses. 


a. At what percentile is a student who watched TV for 7 hours last 
week? 


b. Estimate from the graph the interquartile range (IQR) for time 
spent watching TV. 
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Cumulative relative frequency (%) 


Time spent watching TV (h/wk) 


R2.3 >» Aussie, Aussie, Aussie A group of Australian students were 


asked to estimate the width of their classroom in feet. Use the 
dotplot and summary statistics to answer the following questions. 


5 Jeecccccce 


» 
o 


Guess (ft) 
Variable Mean SD Min Q, Med Qs Max 
Guess (ft) 66 43.70 12.50 24.00 35.50 42.00 48.00 94.00 


a. Suppose we converted each student’s guess from feet to meters 
(3.28 ft=1 m3.28 ft = 1 m), How would the shape of the 
distribution be affected? Find the mean, median, standard 
deviation, and JQR for the transformed data. 


b. The actual width of the room was 42.6 feet. Suppose we 
calculated the error in each student’s guess as follows: guess — 
42.6. Find the mean and standard deviation of the errors in feet. 


c. Find the percentile for the student who estimated the classroom 
width as 63 feet. 


R2.4 |») Density curves The following figure is a density curve that 


models a distribution of quantitative data. Trace the curve onto your 
paper. 
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a. What percent of observations have values less than 13? Justify 
your answer. 


b. Mark the approximate location of the median. Explain your 
choice of location. 


c. Mark the approximate location of the mean. Explain your choice 
of location. 


R2.5 |») Low-birth-weight babies Researchers in Norway analyzed data 
on the birth weights of 400,000 newborns over a 6-year period. The 
distribution of birth weights is approximately Normal with a mean 


of 3668 grams and a standard deviation of 511 grams./ Babies that 
weigh less than 2500 grams at birth are classified as “low birth 


weight.” 
a. Fill in the blanks: About 99.7% of the babies had birth weights 
between and grams. 


b. What percent of babies will be identified as low birth weight? 
c. Find the quartiles of the birth weight distribution. 
R2.6 \_») Acing the GRE The Graduate Record Examinations (GREs) are 


widely used to help predict the performance of applicants to 
graduate schools. The scores on the GRE Chemistry test are 
approximately Normal with mean = 694 and standard deviation = 
112. 


a. Approximately what percent of test takers earn a score less than 
500 or greater than 900 on the GRE Chemistry test? 


b. Estimate the 99th percentile score on the GRE Chemistry test. 
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R2.7 |) Ketchup A fast-food restaurant has just installed a new 


automatic ketchup dispenser for use in preparing its burgers. The 
amount of ketchup dispensed by the machine can be modeled by a 
Normal distribution with mean 1.05 ounces and standard deviation 
0.08 ounce. 


a. If the restaurant’s goal is to put between 1 and 1.2 ounces of 
ketchup on each burger, about what percent of the time will this 
happen? 


b. Suppose that the manager adjusts the machine’s settings so that 
the mean amount of ketchup dispensed is 1.1 ounces. How much 
does the machine’s standard deviation have to be reduced to 
ensure that at least 99% of the restaurant’s burgers have between 
1 and 1.2 ounces of ketchup on them? 


R2.8 |p) Where the old folks live Here are a stemplot and numerical 


summaries of the percents of residents aged 65 and older in the 50 
states and the District of Columbia. Is this distribution of the percent 
of state residents who are age 65 and older approximately Normal? 
Justify your answer based on the graph and the 68—95-99.7 rule. 


7 Key: 12|2 represents 
a state in which 


0 12.2% of residents 
379 are 65 and older. 
44 


02333445899 
02234555557788889 
012334445689 


exh aod ood aod od od od od 
NOUEPWN-—ODOON 


n Mean SD Min Q; Med Q3 Max 
51 13.255 1.668 7.7 12.4 13.5 143 17.3 


R2.9 |) Assessing Normality Catherine and Ana gave an online reflex 


test to 33 varsity athletes at their school. The following Normal 
probability plot displays the data on reaction times (in milliseconds) 
for these students. Is the distribution of reaction times for these 
athletes approximately Normal? Why or why not? 
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Chapter 2 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T2.1 Many professional schools require applicants to take a 
standardized test. Suppose that 1000 students take such a test. 
Several weeks after the test, Pete receives his score report: he got a 
63, which placed him at the 73rd percentile. This means that 


a. Pete’s score was below the median. 

b. Pete did worse than about 63% of the test takers. 
c. Pete did worse than about 73% of the test takers. 
d. Pete did better than about 63% of the test takers. 
e. Pete did better than about 73% of the test takers. 


T2.2 For the Normal distribution shown, the standard deviation is 
closest to 


Ts p 


0 
1 
c. 2. 
3 
e. 5. 


T2.3 Rainwater was collected in water containers at 30 different sites 
near an industrial complex, and the amount of acidity (pH level) 
was measured. The mean and standard deviation of the values are 
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4.60 and 1.10, respectively. When the pH meter was recalibrated 
back at the laboratory, it was found to be in error. The error can be 
corrected by adding 0.1 pH unit to all of the values and then 
multiplying the result by 1.2. What are the mean and standard 
deviation of the corrected pH measurements? 


a. 5.64, 1.44 
b. 

C. 
d. 


e. 


9.64, 1.32 
5.40, 1.44 
5.40, 1.32 
5.64, 1.20 


T2.4 The figure shows a cumulative relative frequency graph of the 
number of ounces of alcohol consumed per week in a sample of 
150 adults who report drinking alcohol occasionally. About what 
percent of these adults consume between 4 and 8 ounces per week? 


ST p 


e. 


Cumulative relative 
frequency (%) 


20% 
40% 
50% 
60% 
80% 


100 4 


80 4 


mal 
o 


> 
i) 
! 


T T T 


T T T 
4 6 8 10 12 14 16 18 


Consumption (0z) 


T2.5 The average yearly snowfall in Chillyville is approximately 
Normally distributed with a mean of 55 inches. If the snowfall in 
Chillyville exceeds 60 inches in 15% of the years, what is the 


standard deviation? 


a. 4.83 inches 
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b. 5.18 inches 
6.04 inches 
d. 8.93 inches 


A 


e. The standard deviation cannot be computed from the given 
information. 


T2.6 The figure shown is the density curve of a distribution. Seven 
values are marked on the density curve. Which of the following 
statements is true? 


A = B c D EFG 
The mean of the distribution is E. 


The area between B and F is 0.50. 


o f 


c. The median of the distribution is C. 
d. The 3rd quartile of the distribution is D. 
e. The area under the curve between A and G is 1. 


T2.7 If the heights of a population of men are approximately Normally 
distributed, and the middle 99.7% have heights between 5'0” and 
7'0”, what is the standard deviation of the heights in this 
population? 


a 1" 
b. 3” 
c. 4" 
d. 6" 
a 
T2.8 The distribution of the time it takes for different people to solve a 
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certain crossword puzzle is strongly skewed to the right with a 
mean of 30 minutes and a standard deviation of 15 minutes. The 
distribution of z-scores for those times is 


a. Normally distributed with mean 30 and SD 15. 
b. skewed to the right with mean 30 and SD 15. 
c. Normally distributed with mean 0 and SD 1. 
d. skewed to the right with mean 0 and SD 1. 


e. skewed to the right, but the mean and standard deviation cannot 
be determined without more information. 


T2.9 The Environmental Protection Agency (EPA) requires that the 
exhaust of each model of motor vehicle be tested for the level of 
several pollutants. The level of oxides of nitrogen (NOX) in the 
exhaust of one light truck model was found to vary among 
individual trucks according to an approximately Normal 
distribution with mean p=1.45/ = 1-49 grams per mile driven and 
standard deviation o=0.400 = 0.40 gram per mile. Which of the 
following best estimates the proportion of light trucks of this 
model with NOX levels greater than 2 grams per mile? 


a. 0.0228 
b. 0.0846 
c. 0.4256 
d. 0.9154 
e. 0.9772 


T2.10 Until the scale was changed in 1995, SAT scores were based on a 
scale set many years ago. For Math scores, the mean under the old 
scale in the early 1990s was 470 and the standard deviation was 
110. In 2016, the mean was 510 and the standard deviation was 
103. Gina took the SAT in 1994 and scored 500. Her cousin 
Colleen took the SAT in 2016 and scored 530. Who did better on 
the exam, and how can you tell? 


a. Colleen—she scored 30 points higher than Gina. 


b. Colleen—her standardized score is higher than Gina’s. 
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c. Gina—her standardized score is higher than Colleen’s. 
d. Gina—the standard deviation was larger in 2016. 


e. The two cousins did equally well—their z-scores are the same. 


Section IJ: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T2.11 The dotplot gives the sale prices for 40 houses in Ames, Iowa, sold 
during a recent month. The mean sale price was $203,388 with a 
standard deviation of $87,609. 


ees see 
« 

T T T T T T 

50 100 150 200 250 300 350 400 450 

Price ($1000s) 


a. Find the percentile of the house indicated in red on the dotplot. 


b. Calculate and interpret the standardized score (z-score) for the 
house indicated by the red dot, which sold for $234,000. 


T2.12 A study of 12,000 able-bodied male students at the University of 
Illinois found that their times for the mile run were approximately 
Normally distributed with mean 7.11 minutes and standard 


deviation 0.74 minute.48 


a. About how many students ran the mile in less than 6 minutes? 


b. Approximately how long did it take the slowest 10% of students 
to run the mile? 


c. Suppose that these mile run times are converted from minutes to 
seconds. Estimate the percent of students who ran the mile in 
between 400 and 500 seconds. 


T2.13 A study recorded the amount of oil recovered from the 64 wells in 
an oil field, in thousands of barrels. Here are descriptive statistics 
for that set of data from statistical software. 


Descriptive Statistics: Oilprod 
Variable oN Mean Median StDev Min Max Qi Q3 
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Oilprod 64 48.25 37.80 40.24 2.00 20490 21.40 60.75 


Based on the summary Statistics, is the distribution of amount of oil 
recovered from the wells in this field approximately Normal? 


Justify your answer. 
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INTRODUCTION 


Investigating relationships between variables is central to what we do in 
Statistics. When we understand the relationship between two variables, we 
can use the value of one variable to help us make predictions about the 
other variable. In Section 1.1, we explored relationships between 
categorical variables, such as membership in an environmental club and 
snowmobile use for visitors to Yellowstone National Park. The association 
between these two variables suggests that members of environmental clubs 
are less likely to own or rent snowmobiles than nonmembers. 

In this chapter, we investigate relationships between two quantitative 
variables. Does knowing how much money a Major League Baseball team 
spent on its players tell us anything about how many wins that team will 
have? What can we learn about the price of a used car from the number of 
miles it has been driven? Can students with larger hands grab more candy? 
The following activity will help you explore the last question. 


ACTIVITY Candy grab 


In this activity, you will investigate if students with a larger hand span 
can grab more candy than students with a smaller hand span.4 
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Josh Tabor 


. Measure the span of your dominant hand to the nearest half- 
centimeter (cm). Hand span is the distance from the tip of the thumb 
to the tip of the pinkie finger on your fully stretched-out hand. 


. One student at a time, go to the front of the class and use your 
dominant hand to grab as many candies as possible from the 
container. You must grab the candies with your fingers pointing down 
(no scooping!) and hold the candies for 2 seconds before counting 
them. After counting, put the candy back into the container. 


. On the board, record your hand span and number of candies in a table 
with the following headings: 


Hand span (cm) Number of candies 


. While other students record their values on the board, copy the table 
onto a piece of paper and make a graph. Begin by constructing a set 
of coordinate axes. Label the horizontal axis “Hand span (cm)” and 
the vertical axis “Number of candies.” Choose an appropriate scale 
for each axis and plot each point from your class data table as 
accurately as you can on the graph. 


. What does the graph tell you about the relationship between hand 
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span and number of candies? Summarize your observations in a 
sentence or two. 
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SECTION 3.1 Scatterplots and Correlation 


LEARNING TARGETS By the end of the section, you should be able to: 


e Distinguish between explanatory and response variables for 
quantitative data. 


Make a scatterplot to display the relationship between two 
quantitative variables. 


Describe the direction, form, and strength of a relationship 
displayed in a scatterplot and identify unusual features. 


Interpret the correlation. 


Understand the basic properties of correlation, including how the 
correlation is influenced by outliers. 


Distinguish correlation from causation. 


Most statistical studies examine data on more than one variable. 
Fortunately, analysis of relationships between two variables builds on the 
same tools we used to analyze one variable. The principles that guide our 
work also remain the same: 


e Plot the data, then look for overall patterns and departures from those 
patterns. 


e Add numerical summaries. 


e When there’s a regular overall pattern, use a simplified model to 
describe it. 


Explanatory and Response Variables 


In the “Candy grab” activity, the number of candies is the response 
variable. Hand span is the explanatory variable because we anticipate 
that knowing a student’s hand span will help us predict the number of 
candies that student can grab. 
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DEFINITION Response variable, Explanatory variable 


A response variable measures an outcome of a study. An 
explanatory variable may help predict or explain changes in a 
response variable. 


You will often see explanatory variables called independent variables and response 


variables called dependent variables. Because the words independent and dependent 


have other meanings in statistics, we won’t use them here. 


It is easiest to identify explanatory and response variables when we 
initially specify the values of one variable to see how it affects another 
variable. For instance, to study the effect of alcohol on body temperature, 
researchers gave several different amounts of alcohol to mice. Then they 
measured the change in each mouse’s body temperature 15 minutes later. 
In this case, amount of alcohol is the explanatory variable, and change in 
body temperature is the response variable. When we don’t specify the 
values of either variable before collecting the data, there may or may not 
be a clear explanatory variable. 


EXAMPLE | Diamonds and the SAT (>) 


Explanatory or response? 


stori/Deposit Photos 


PROBLEM: Identify the explanatory variable and response variable for 
the following relationships, if possible. Explain your reasoning. 


a. The weight (in carats) and the price (in dollars) for a sample of 
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diamonds. 


b. The SAT math score and the SAT evidence-based reading and 
writing score for a sample of students. 


SOLUTION: 


a. Explanatory: weight; Response: price. The weight of a diamond 
helps explain how expensive it is. 


b. Either variable could be the explanatory variable because each one 
could be used to predict or explain the other. 


FOR PRACTICE, TRY EXERCISE 1 


In many studies, the goal is to show that changes in one or more 
explanatory variables actually cause changes in a response variable. 
However, other explanatory—response relationships don’t involve direct 
causation. In the alcohol and mice study, alcohol actually causes a change 
in body temperature. But there is no cause-and-effect relationship between 
SAT math and evidence-based reading and writing scores. 


Displaying Relationships: Scatterplots 


Although there are many ways to display the distribution of a single 
quantitative variable, a scatterplot is the best way to display the 
relationship between two quantitative variables. 


DEFINITION Scatterplot 


A scatterplot shows the relationship between two quantitative 
variables measured on the same individuals. The values of one 
variable appear on the horizontal axis, and the values of the other 
variable appear on the vertical axis. Each individual in the data set 
appears as a point in the graph. 


Figure 3.1 shows a scatterplot that displays the relationship between 
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hand span (cm) and number of Starburst™ candies for the 24 students in 
Mr. Tyson’s class who did the “Candy grab” activity. As you can see, 
students with larger hand spans were typically able to grab more candies. 


40 4 + 


Number of Starburst candies 
Y , 
=| 


17 18 19 20 21 22 23 24 25 
Hand span (cm) 


FIGURE 3.1 Scatterplot of hand span (cm) and number of Starburst candies 
grabbed by 24 students. Only 23 points appear because two students had hand 
spans of 19 cm and grabbed 28 Starburst candies. 


After collecting data for two quantitative variables, it is easy to make a 
scatterplot. 


HOW TO MAKE A SCATTERPLOT 


e Label the axes. Put the name of the explanatory variable under the 
horizontal axis and the name of the response variable to the left of the 
vertical axis. If there is no explanatory variable, either variable can 
go on the horizontal axis. 


Scale the axes. Place equally spaced tick marks along each axis, 
beginning at a convenient number just below the smallest value of 
the variable and continuing until you exceed the largest value. 


Plot individual data values. For each individual, plot a point 
directly above that individual’s value for the variable on the 
horizontal axis and directly to the right of that individual’s value for 
the variable on the vertical axis. 


The following example illustrates the process of constructing a 
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scatterplot. 


EXAMPLE | Buying wins (») 
Making a scatterplot 


Ezra Shaw/Getty Images 


PROBLEM: Do baseball teams that spend more money on players also 
win more games? The table shows the payroll (in millions of dollars) 
and number of wins for each of the 30 Major League Baseball teams 
during the 2016 regular season.2 Make a scatterplot to show the 
relationship between payroll and wins. 


Team Payroll Wins 
Arizona Diamondbacks 103 69 
Atlanta Braves 122 68 
Baltimore Orioles 157 89 
Boston Red Sox 215 93 
Chicago Cubs 182 103 
Chicago White Sox 141 78 
Cincinnati Reds 114 68 
Cleveland Indians 114 94 
Colorado Rockies 121 7a) 
Detroit Tigers 206 86 
Houston Astros 118 84 
Kansas City Royals 145 81 
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Los Angeles Angels 181 74 


Los Angeles Dodgers 274 91 
Miami Marlins 81 79 
Milwaukee Brewers 75 fis: 
Minnesota Twins 112 59 
New York Mets 150 87 
New York Yankees 227 84 
Oakland Athletics 98 69 
Philadelphia Phillies 117 71 
Pittsburgh Pirates 106 78 
San Diego Padres 127 68 
San Francisco Giants 181 87 
Seattle Mariners 155 86 
St. Louis Cardinals 167 86 
Tampa Bay Rays TA 68 
Texas Rangers 169 95 
Toronto Blue Jays 159 89 
Washington Nationals 163 95 
SOLUTION: 


Number of wins 


{ I 
so 100 150 200 4250 300 
Payroll (+ millions) 


e Label the axes. The explanatory variable is payroll because we think 
it might help explain the number of wins. 


e Scale the axes. 
e Plot individual data values. 
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FOR PRACTICE, TRY EXERCISE 3 


Describing a Scatterplot 


To describe a scatterplot, follow the basic strategy of data analysis from 
Chapter 1: look for patterns and important departures from those patterns. 
The scatterplot in Figure 3.2(a) shows a positive association between 
wins and payroll for MLB teams in 2016. That is, teams that spent more 
money typically won more games. Other scatterplots, such as the one in 


Figure 3.2(b), show a negative association. Teams that allow their 
opponent to score more runs typically win fewer games. 


1104 1104 


. 
100 100 4 


Number of wins 
Number of wins 


T T T T T 
50 100 150 200 250 300 550 600 650 700 750 s00 85) 


) 900 


(a) Payroll (millions of dollars) (b) Runs allowed 

FIGURE 3.2 Scatterplots using data from the 30 Major League Baseball teams in 
2016. (a) There is a positive association between payroll (in millions of dollars) 
and number of wins. (b) There is a negative association between runs allowed and 
number of wins. 


In some cases, there is no association between two variables. For 
example, the following scatterplot shows the relationship between height 
(in centimeters) and the typical amount of sleep on a non-school night (in 
hours) for a sample of students.2 Knowing the height of a student doesn’t 
help predict how much he or she likes to sleep on Saturday night! 
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DEFINITION Positive association, Negative association, 
No association 


Two variables have a positive association when above-average 
values of one variable tend to accompany above-average values of the 
other variable and when below-average values also tend to occur 
together. 


Two variables have a negative association when above-average 
values of one variable tend to accompany below-average values of the 
other variable. 


There is no association between two variables if knowing the value 
of one variable does not help us predict the value of the other 
variable. 


Identifying the direction of an association in a scatterplot is a good 
start, but there are several other characteristics that we need to address 
when describing a scatterplot. 


HOW TO DESCRIBE A SCATTERPLOT 


To describe a scatterplot, make sure to address the following four 
characteristics in the context of the data: 


e Direction: A scatterplot can show a positive association, negative 
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association, or no association. 


e Form: A scatterplot can show a linear form or a nonlinear form. The 
form is linear if the overall pattern follows a straight line. Otherwise, 
the form is nonlinear. 


e Strength: A scatterplot can show a weak, moderate, or strong 
association. An association is strong if the points don’t deviate much 
from the form identified. An association is weak if the points deviate 
quite a bit from the form identified. 


e Unusual features: Look for outliers that fall outside the overall 
pattern and distinct clusters of points. 


AP® EXAM TIP 


When you are asked to describe the association shown in a scatterplot, 
you are expected to discuss the direction, form, and strength of the 
association, along with any unusual features, in the context of the 
problem. This means that you need to use both variable names in your 
description. 


Even though they have opposite directions, both associations in Figure 
3.2 on page 156 have a linear form. However, the association between runs 
allowed and wins is stronger than the relationship between payroll and 
wins because the points in Figure 3.2(b) deviate less from the linear 
pattern. Each scatterplot has one potential outlier: In Figure 3.2(a), the Los 
Angeles Dodgers spent $274 million and had “only” 91 wins. In Figure 
3.2(b), the Texas Rangers gave up 757 runs but had 95 wins. 

oOo Even when there is a clear relationship between two variables 


in a scatterplot, the direction of the association describes only the 
overall trend—not an absolute relationship. For example, even though 
teams that spend more generally have more wins, there are plenty of 
exceptions. The Minnesota Twins spent more money than six other teams, 
but had fewer wins than any team in the league. 


| EXAMPLE | Old Faithful and fertility (») 
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| Describing a scatterplot 


BigshotD3/iStock/Getty Images 


PROBLEM: Describe the relationship in each of the following 


contexts. 

The scatterplot on the left shows the relationship between the 
duration (in minutes) of an eruption and the interval of time until the 
next eruption (in minutes) of Old Faithful during a particular month. 


b. The scatterplot on the right shows the relationship between the 
average income (gross domestic product per person, in dollars) and 
fertility rate (number of children per woman) in 187 countries.4 


a. 
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SOLUTION: 


a. There is a strong, positive linear relationship between the duration of 
an eruption and the interval of time until the next eruption. There are 
two main clusters of points: one cluster has durations around 2 
minutes with intervals around 55 minutes, and the other cluster has 
durations around 4.5 minutes with intervals around 90 minutes. 


Even with the clusters, the overall direction is still positive. In some 


cases, however, the points in a cluster go in the opposite direction of 
the overall association. 


b. There is a moderately strong, negative nonlinear relationship 
between average income and fertility rate in these countries. There is 
a potential outlier with an average income around $30,000 and a 
fertility rate around 4.7. 


The association is called “nonlinear” because the pattern of points is 


clearly curved. 


FOR PRACTICE, TRY EXERCISE 5 


CHECK YOUR UNDERSTANDING 


( 


,, Is there a relationship between the amount of sugar (in grams) and 
the number of calories in movie-theater candy? Here are the data from a 
sample of 12 types of candy.2 


Name Sugar (g) Calories 
Butterfinger Minis 45 450 
Junior Mints 107 970 
M&M’S® 62 480 
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Milk Duds 44 370 


Peanut M&M’S® 79 790 
Raisinets 60 420 
Reese’s Pieces 61 980 
Skittles 87 450 
Sour Patch Kids a2 490 
Swee Tarts 136 680 
Twizzlers 59 460 
Whoppers 48 350 


1. Identify the explanatory and response variables. Explain your 
reasoning. 


2. Make a scatterplot to display the relationship between amount of 
sugar and the number of calories in movie-theater candy. 


3. Describe the relationship shown in the scatterplot. 


8. Technology Corner |MAKING SCATTERPLOTS (») 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Making scatterplots with technology is much easier than constructing 
them by hand. We’Il use the MLB data from page 155 to show how to 
construct a scatterplot on a TI-83/84. 


1. Enter the payroll values in L1 and the number of wins in L2. 


e Press |STAT| and choose Edit.... 


e Type the values into L1 and L2. 


LaCc=103 
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2. Set up a Scatterplot in the statistics plots menu. 


° Press (STAT PLOT). 
e Press or |1/ to go into Plot1. 


e Adjust the settings as shown. 


NORMAL FLOAT AUTO REAL RADIAN HP n 
PRESS (<] OR (>] TO SELECT AN OPTION 


Plot2 Plot3 


Mark :B] + = - 
Color HEU 


3. Use ZoomStat to let the calculator choose an appropriate window. 


e Press |ZOOM| and choose ZoomStat. 


e oo i . 


AP® EXAM TIP 


If you are asked to make a scatterplot, be sure to label and scale both 


axes. Don’t just copy an unlabeled calculator graph directly onto 
your paper. 


Measuring Linear Association: Correlation 


A scatterplot displays the direction, form, and strength of a relationship 
between two quantitative variables. Linear relationships are particularly 
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important because a straight line is a simple pattern that is quite common. 
A linear relationship is considered strong if the points lie close to a straight 
line and is considered weak if the points are widely scattered about the 
line. Unfortunately, our eyes are not the most reliable tools when it comes 
to judging the strength of a linear relationship. When the association 
between two quantitative variables is linear, we can use the correlation r 
to help describe the strength and direction of the association. 


Some people refer to r as the “correlation coefficient.” 


DEFINITION Correlation r 


For a linear association between two quantitative variables, the 
correlation r measures the direction and strength of the association. 


Here are some important properties of the correlation r: 
e The correlation r is always a number between —1 and 1 (-1<r<1). 


e The correlation r indicates the direction of a linear relationship by its 
sign: r > 0 for a positive association and r < 0 for a negative association. 


e The extreme values r= —1 andr = 1 occur only in the case of a perfect 
linear relationship, when the points lie exactly along a straight line. 


e If the linear relationship is strong, the correlation r will be close to 1 or 
—1. If the linear relationship is weak, the correlation r will be close to 0. 

o@ It is only appropriate to use the correlation to describe 
strength and direction for a linear relationship. This is why the word 
linear kept appearing in the list above! 

Figure 3.3 shows six scatterplots that correspond to various values of r. 
To make the meaning of r clearer, the standard deviations of both variables 
in these plots are equal, and the horizontal and vertical scales are the same. 
The correlation r describes the direction and strength of the linear 
relationship in each scatterplot. 
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Correlation r = -—0.3 
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Correlation r = —0.7 Correlation r = 0.9 Correlation r = —0.99 


FIGURE 3.3 How correlation measures the strength and direction of a linear 


relationship. When the dots are tightly packed around a line, the correlation will be 


close to 1 or -1. 


prPLe> 


ACTIVITY Guess the correlation ¢ 


In this activity, we will have a class competition to see who can best 
guess the correlation. 


Li. 


Load the Guess the Correlation applet at 
www.rossmanchance.com/applets. 


. The teacher will press the “New Sample” button to see a “random” 


scatterplot. As a class, try to guess the correlation. Type the guess in 
the “Correlation guess” box and press “Check Guess” to see how the 
class did. Repeat several times to see more examples. For the 
competition, there will be two rounds. 


. Starting on one side of the classroom and moving in order to the other 


side, the teacher will give each student one new sample and have him 
or her guess the correlation. The teacher will then record how far off 
the guess was from the true correlation. 


. Once every student has made an attempt, the teacher will give each 


student a second sample. This time, the students will go in reverse 


375 


order so that the student who went first in Round 1 will go last in 


Round 2. The student who has the closest guess in either round wins a 
prize! 


Correlation Guessing Game 


Number of points: 100 ae 
New Sample 
OD Edit/Paste Data ad (Ee ‘ 
— e e 
60 e 
Correlation guess: fo | > ak ce 
Check Guess 40 Pu Re - 
e . 
20 - @ 
0 


The following example illustrates how to interpret the correlation. 


EXAMPLE | Payroll and wins (>) 


Interpreting a correlation 


jonathansloane/E+/Getty Images 


PROBLEM: Here is the scatterplot showing the relationship between 


payroll (in millions of dollars) and wins for MLB teams in 2016. For 
these data, r = 0.613. Interpret the value of r. 
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Number of wins 


50 100 150 200 250 300 
Payroll ($ millions) 


SOLUTION: 


The correlation of r = 0.613 confirms that the linear association between 
payroll and number of wins is moderately strong and positive. 


Always include context by using the variable names in your answer. 


FOR PRACTICE, TRY EXERCISE 15 


CHECK YOUR UNDERSTANDING 


A 


y The scatterplot shows the 40-yard-dash times (in seconds) and long- 
jump distances (in inches) for a small class of 12 students. The 
correlation for these data is r = —0.838. Interpret this value. 
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Cautions about Correlation 


While the correlation is a good way to measure the strength and direction 
of a linear relationship, it has several limitations. 


o@ Correlation doesn’t imply causation. In many cases, two 


variables might have a strong correlation, but changes in one variable are 
very unlikely to cause changes in the other variable. Consider the 
following scatterplot showing total revenue generated by skiing facilities 
in the United States and the number of people who died by becoming 
tangled in their bedsheets in 10 recent years.© The correlation for these 
data is r = 0.97. Does an increase in skiing revenue cause more people to 
die by becoming tangled in their bedsheets? We doubt it! 


Number of deaths 
from tangling 
S 
S 


= 


T T T T T 
1600 1800 2000 2200 2400 


Revenue from skiing ($ millions) 


Even though we shouldn’t automatically conclude that there is a cause- 
and-effect relationship between two variables when they have an 
association, in some cases there might actually be a cause-and-effect 
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relationship. You will learn how to distinguish these cases in Chapter 4. 


The following activity helps you explore some additional limitations of 
the correlation. 


ACTIVITY Correlation and Regression applet 


In this activity, you will use an applet to investigate some important 
properties of the correlation. Go to the book’s website at 
highschool.bfwpub.com/tps6e and launch the Correlation and 
Regression applet. 


1. You are going to use the Correlation and Regression applet to make 
several scatterplots that have correlation close to 0.7 


a. Start by putting two points on the graph. What’s the value of the 
correlation? Why does this make sense? 


b. Make a lower-left to upper-right pattern of 10 points with 
correlation about r = 0.7. You can drag points up or down to adjust 
r after you have 10 points. 


c. Make a new scatterplot, this time with 9 points in a vertical stack at 
the left of the plot. Add 1 point far to the right and move it until the 
correlation is close to 0.7. 


d. Make a third scatterplot, this time with 10 points in a curved 
pattern that starts at the lower left and rises to the right. Adjust the 
points up or down until you have a smooth curve with correlation 
close to 0.7. 


Summarize: If you know only that the correlation between two variables 
is r= 0.7, what can you say about the form of the relationship? 

2. Click on the scatterplot to create a group of 7 points in a U shape so 
that there is a strong nonlinear association. What is the correlation? 
Summarize: If you know only that the correlation between two variables 

is r = 0, what can you say about the strength of the relationship? 


3. Click on the scatterplot to create a group of 10 points in the lower-left 
comer of the scatterplot with a strong linear pattern (correlation about 
0.9). 


379 


a. Add 1 point at the upper right that is in line with the first 10. How 
does the correlation change? 


b. Drag this last point straight down. How small can you make the 
correlation? Can you make the correlation negative? 


Summarize: What did you learn from Step 3 about the effect of an 
outlier on the correlation? 


The activity highlighted some important cautions about correlation. 
oOo Correlation does not measure form. Here is a scatterplot showing 
the speed (in miles per hour) and the distance (in feet) needed to come to a 
complete stop when a motorcycle’s brake was applied.2 The association is 
clearly curved, but the correlation is quite large: r = 0.98. In fact, the 
correlation for this nonlinear association is much greater than the 


correlation of r = 0.613 for the MLB payroll data, which had a clear linear 
association. 


Braking distance (ft) 
2 
l 


Speed (mph) 


ow Correlation should only be used to describe linear 
relationships. The association displayed in the following scatterplot is 
extremely strong, but the correlation is r = 0. This isn’t a contradiction 


because correlation doesn’t measure the strength of nonlinear 
relationships. 
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rl The correlation is not a resistant measure of strength. In the 


following scatterplot, the correlation is r = —0.13. But when the outlier is 
excluded, the correlation becomes r = 0.72. 


Like the mean and the standard deviation, the correlation can be 
greatly influenced by outliers. 


EXAMPLE | Nobel chocolate » 


Cautions about correlation 
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amphotora/Getty Images 


PROBLEM: Most people love chocolate for its great taste. But does it 
also make you smarter? A scatterplot like this one recently appeared in 
the New England Journal of Medicine.® The explanatory variable is the 
chocolate consumption per person for a sample of countries. The 
response variable is the number of Nobel Prizes per 10 million residents 
of that country. 


35 
e Switzerland 


r= 0.791 * Sweden 


Denmark 
25 Austria *y 


30 


e Norway 


20 
* United Kingdom 


The Netherlands, United States * Ireland ¢ Germany 
sd 
10 : . 
Belgium e e France 
Canad ° Finland 
anacae . Australia 


Nobel laureates (per 10 million people) 
a 


Portugal Poland » Italy 
Japan ° Greece * Spain 
e 


e 
China Brazil 


0 | 10 15 
Chocolate consumption (kg/yr per person) 


a. If people in the United States started eating more chocolate, could we 
expect more Nobel Prizes to be awarded to residents of the United 
States? Explain. 


b. What effect does Switzerland have on the correlation? Explain. 


SOLUTION 
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a. No; even though there is a strong correlation between chocolate 
consumption and the number of Nobel laureates in a country, 
causation should not be inferred. It is possible that both of these 
variables are changing due to another variable, such as per capita 
income. 


Not all questions about cause and effect include the word cause. 


Make sure to read questions—and reports in the media—very 
carefully. 


b. When Switzerland is included with the rest of the points, it makes the 
association stronger because it doesn’t vary much from the linear 
pattern. This makes the correlation closer to 1. 


FOR PRACTICE, TRY EXERCISES 17 AND 19 


Calculating Correlation 


Now that you understand the meaning and limitations of the correlation, 
let’s look at how it’s calculated. 


HOW TO CALCULATE THE CORRELATION r 


Suppose that we have data on variables x and y for n individuals. The 
values for the first individual are x; and yj, the values for the second 


individual are xj and y9, and so on. The means and standard deviations 
of the two variables are x & and s, for the x-values, and y ¥ and Sy for 
the y-values. The correlation r between x and y is 
r=1n- 1[ (k1-x sx)(yl—-y sy)+(x2- x sx)(y2-y sy)+...+(xn- 
x sx)(yn—y sy) | 


ICS) OFA): 
(22) (52) 
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or, More compactly, 


1 2;—-f Y¥;—-Y 
ee ae Nar 
n—l 87 Sy 


r=1n —- 1)(xi- x sx)(yi-y sy) - 


The formula for the correlation r is a bit complex. It helps us 
understand some properties of correlation, but in practice you should use 
your calculator or software to find r. Exercises 21 and 22 ask you to 
calculate a correlation step by step from the definition to solidify its 
meaning. 

Figure 3.4 shows the relationship between the payroll (in millions of 
dollars) and the number of wins for the 30 MLB teams in 2016. The red 
dot on the right represents the Los Angeles Dodgers, whose payroll was 
$274 million and who won 91 games. 
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FIGURE 3.4 Scatterplot showing the relationship between payroll (in millions of 
dollars) and number of wins for 30 Major League Baseball teams in 2016. The 
point representing the Los Angeles Dodgers is highlighted in red. 


The formula for r begins by standardizing the observations. The value 


| 


xi-x sx $2 


in the correlation formula is the standardized payroll (z-score) of the ith 
team. In 2016, the mean payroll was x_ = $145.033% = $145.033 million 
with a standard deviation of sx = $46.879%2 = $46.879 million. For the 
Los Angeles Dodgers, the corresponding z-score is 
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_ _274-145.033 _, 
2X=274 — 145.03346.879=2.75 — 46.879 
The Dodgers’ payroll is 2.75 standard deviations above the mean. 


Likewise, the value 


Yy-Y 
yi-y sy *Y 
in the correlation formula is the standardized number of wins for the ith 
team. In 2016, the mean number of wins was y = 80.99 = 80.9 with a 


standard deviation of sx = 10.669.’ = 10.669. For the Los Angeles 
Dodgers, the corresponding z-score is 


ne 1.05 
zy=91 - 80.910.669=0.95 " 10.669 


ZUMA Press Inc/Alamy 


The Dodgers’ number of wins is 0.95 standard deviation above the mean. 

Multiplying the Dodgers’ two z-scores, we get a product of (2.75) 
(0.95) = 2.6125. The correlation r is an “average” of the products of the 
standardized scores for all the teams. Just as in the case of the standard 
deviation s,, we divide by 1 fewer than the number of individuals to find 
the average. Finishing the calculation reveals that r = 0.613 for the 30 
MLB teams. 


Some people like to write the correlation formula as 


b td 
r=1n- 1) zxzy n—-1 


to emphasize the product of standardized scores in the calculation. 
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To understand what correlation measures, consider the graphs in 
Figure 3.5 on the next page. At the left is a scatterplot of the MLB data 
with two lines added—a vertical line at the mean payroll and a horizontal 
line at the mean number of wins. Most of the points fall in the upper-right 
or lower-left “quadrants” of the graph. Teams with above-average payrolls 
tend to have above-average numbers of wins, like the Dodgers. Teams 
with below-average payrolls tend to have numbers of wins that are below 
average. This confirms the positive association between the variables. 

110 4 
iow - 


90 4 oe e 


Number of wins 
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FIGURE 3.5 (a) Scatterplot showing the relationship between payroll (in millions 
of dollars) and number of wins for 30 Major League Baseball teams in 2016, with 
lines showing the mean of each variable. (b) Scatterplot showing the relationship 
between the standardized values of payroll and the standardized values of number 


of wins for the same 30 teams. 


Below on the right is a scatterplot of the standardized scores. To get 
this graph, we transformed both the x- and the y-values by subtracting their 
mean and dividing by their standard deviation. As we saw in Chapter 2, 
standardizing a data set converts the mean to 0 and the standard deviation 
to 1. That’s why the vertical and horizontal lines in the right-hand graph 
are both at 0. 


For the points in the upper-right quadrant and the lower-left quadrant, 
the products of the standardized values will be positive. Because most of 
the points are in these two quadrants, the sum of the z-score products will 
also be positive, resulting in a positive correlation r. 


What if there was a negative association between two variables? Most 
of the points would be in the upper-left and lower-right quadrants and their 
z-score products would be negative, resulting in a negative correlation. 
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Additional Facts about Correlation 


Now that you have seen how the correlation is calculated, here are some 
additional facts about correlation. 


. Correlation requires that both variables be quantitative, so that it makes 
sense to do the arithmetic indicated by the formula for r. We cannot 
calculate a correlation between the incomes of a group of people and 
what city they live in because city is a categorical variable. When one or 
both of the variables are categorical, use the term association rather than 
correlation. 


’. Correlation makes no distinction between explanatory and response 
variables. When calculating the correlation, it makes no difference 
which variable you call x and which you call y. Can you see why from 


the formula? 
= l 3 v5 Li Yi y 
n—1 8x Sy 


3. Because r uses the standardized values of the observations, r does not 
change when we change the units of measurement of x, y, or both. The 
correlation between height and weight won’t change if we measure 
height in centimeters rather than inches and measure weight in 
kilograms rather than pounds. 


- 
r=1n-1) (xi-x sx)(yi-y sy) 


|. The correlation r has no unit of measurement. It is just a number. 


EXAMPLE | Long strides (») 


More about correlation 
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Monkey Business Images/Shutterstock.com 


PROBLEM: The following scatterplot shows the height (in inches) and 
number of steps needed for a random sample of 36 students to walk the 
length of a school hallway. The correlation is r = —0.632. 


Number of steps 
wn nn 
i) wm 


Py 
wm 


s 


Height (in.) 


a. Explain why it isn’t correct to say that the correlation is —0.632 steps 
per inch. 


b. What would happen to the correlation if number of steps was used as 
the explanatory variable and height was used as the response 
variable? 


c. What would happen to the correlation if height was measured in 
centimeters instead of inches? Explain. 


SOLUTION: 


a. Because correlation is calculated using standardized values, it 
doesn’t have units. 
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Although it is unlikely that you will need to calculate the correlation by 


hand, understanding how the formula works makes it easier to answer 
questions like these. 


b. The correlation would be the same because correlation doesn’t make 
a distinction between explanatory and response variables. 


c. The correlation would be the same. Because r is calculated using 
standardized values, changes of units don’t affect correlation. 


Changing from inches to centimeters won’t change the locations of the 


points, only the numbers on the horizontal scale. 


FOR PRACTICE, TRY EXERCISE 23 


Section 3.1 Summary 


e A scatterplot displays the relationship between two quantitative 
variables measured on the same individuals. Mark values of one 
variable on the horizontal axis (x axis) and values of the other variable 
on the vertical axis (y axis). Plot each individual’s data as a point on the 
graph. 


e If we think that a variable x may help predict, explain, or even cause 
changes in another variable y, we call x an explanatory variable and y a 
response variable. Always plot the explanatory variable on the x axis of 
a scatterplot. Plot the response variable on the y axis. 


e When describing a scatterplot, look for an overall pattern (direction, 
form, strength) and departures from the pattern (unusual features) and 
always answer in context. 


: Direction: A relationship can have a positive association (when 
larger values of one variable tend to be paired with larger values of the 
other variable), a negative association (when larger values of one 
variable tend to be paired with smaller values of the other variable), or 
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no association (when knowing the value of one variable doesn’t help 
predict the value of the other variable). 


: Form: The form of a relationship can be linear or nonlinear (curved). 


. Strength: The strength of a relationship is determined by how close 
the points in the scatterplot lie to a simple form such as a line. 


» Unusual features: Look for outliers that fall outside the pattern and 
distinct clusters of points. 


e For linear relationships, the correlation r measures the strength and 
direction of the association between two quantitative variables x and y. 


e Correlation indicates the direction of a linear relationship by its sign: r > 
O for a positive association and r < 0 for a negative association. 
Correlation always satisfies —1 < r < 1 with values of r closer to 1 and 
—1 indicating stronger associations. Correlations of r= 1 andr = —1 
occur only when the points on a scatterplot lie exactly on a straight line. 


e Remember these limitations of r: Correlation does not imply causation. 
The correlation is not resistant, so outliers can greatly change the value 
of r. The correlation should only be used to describe linear relationships. 


e Correlation ignores the distinction between explanatory and response 
variables. The value of r does not have units and is not affected by 
changes in the unit of measurement of either variable. 


3.1 Technology Corner » 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
8. Making scatterplots Page 159 


Section 3.1 Exercises 


1. pg 154 |») Coral reefs and cell phones Identify the explanatory 


variable and the response variable for the following relationships, if 
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possible. Explain your reasoning. 


a. The weight gain of corals in aquariums where the water temperature 
is controlled at different levels 


b. The number of text messages sent and the number of phone calls 
made in a sample of 100 students 
2. Teenagers and corn yield Identify the explanatory variable and the 
response variable for the following relationships, if possible. Explain 
your reasoning. 


a. The height and arm span of a sample of 50 teenagers 


b. The yield of corn in bushels per acre and the amount of rain in the 
growing season 
3. pg 155 |p) Heavy backpacks Ninth-grade students at the Webb 


Schools go on a backpacking trip each fall. Students are divided into 
hiking groups of size 8 by selecting names from a hat. Before leaving, 
students and their backpacks are weighed. The data here are from one 
hiking group. Make a scatterplot by hand that shows how backpack 
weight relates to body weight. 


Body weight (Ib) 120 187 109 103 131 165 158 116 
Backpack weight (Ib) 26 30 26 24 29 35 31 = 28 


4. Putting success How well do professional golfers putt from various 
distances to the hole? The data show various distances to the hole (in 
feet) and the percent of putts made at each distance for a sample of 
golfers.2 Make a scatterplot by hand that shows how the percent of 
putts made relates to the distance of the putt. 


Distance (ft) Percent made 
Z 93.3 

83.1 

74.1 

58.9 

54.8 

53.1 

46.3 

31.8 


Oo DON DUH BW 
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10 33.5 


11 31.6 
12 20st 
13 24.0 
14 31.0 
15 16.8 
16 13.4 
17 15.9 
18 L753 
ig 13.6 
20 15.8 


5. pg 158 |») Olympic athletes The scatterplot shows the relationship 


6. 


between height (in inches) and weight (in pounds) for the members of 
the United States 2016 Olympic Track and Field team.12 Describe the 
relationship between height and weight for these athletes. 


350 - 
300 = 
a 
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= 200 4 2 ° ° fg 
A=] 5 ee 
“J > 3 : 3. 
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Starbucks The scatterplot shows the relationship between the amount 
of fat (in grams) and number of calories in products sold at 


Starbucks.1! Describe the relationship between fat and calories for 
these products. 
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More heavy backpacks Refer to your graph from Exercise 3. 
Describe the relationship between body weight and backpack weight 
for this group of hikers. 

More putting success Refer to your graph from Exercise 4. Describe 
the relationship between distance from hole and percent of putts made 
for the sample of professional golfers. 

Does fast driving waste fuel? How does the fuel consumption of a car 
change as its speed increases? Here are data for a British Ford Escort. 
Speed is measured in kilometers per hour, and fuel consumption is 
measured in liters of gasoline used per 100 kilometers traveled. +2 


Fuel used (L/100 
Speed (km/h) km) 
10 21.00 
20 13.00 
30 10.00 
40 8.00 
50 7.00 
60 5.90 
70 6.30 
80 6.95 
90 igo 
100 B27 
110 9.03 
120 9.87 
130 10.79 
140 11,77 
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150 12.83 


a. Make a scatterplot to display the relationship between speed and 
fuel consumption. 


b. Describe the relationship between speed and fuel consumption. 

10. Do muscles burn energy? Metabolic rate, the rate at which the body 
consumes energy, is important in studies of weight gain, dieting, and 
exercise. We have data on the lean body mass and resting metabolic 
rate for 12 women who are subjects in a study of dieting. Lean body 
mass, given in kilograms, is a person’s weight leaving out all fat. 
Metabolic rate is measured in calories burned per 24 hours. The 
researchers believe that lean body mass is an important influence on 
metabolic rate. 


Mass 36.1 54.6 48.5 42.0 50.6 42.0 40.3 33.1 42.4 345 51.1 41.2 
Rate 995 1425 1396 1418 1502 1256 1189 913 1124 1052 1347 1204 


a. Make a scatterplot to display the relationship between lean body 
mass and metabolic rate. 


b. Describe the relationship between lean body mass and metabolic 
rate. 

11. More Olympics Athletes who participate in the shot put, discus throw, 
and hammer throw tend to have different physical characteristics than 
other track and field athletes. The scatterplot shown here enhances the 
scatterplot from Exercise 5 by plotting these athletes with blue 
squares. How are the relationships between height and weight the 
same for the two groups of athletes? How are the relationships 
different? 


Weight (Ib) 
N 
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L 


= a ca a 
60 62 64 66 68 70 72 74 76 78 80 8&2 
Height (in.) 
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12. More Starbucks How do the nutritional characteristics of food 
products differ from drink products at Starbucks? The scatterplot 
shown here enhances the scatterplot from Exercise 6 by plotting the 
food products with blue squares. How are the relationships between fat 
and calories the same for the two types of products? How are the 
relationships different? 
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13. Manatees Manatees are large, gentle, slow-moving sea creatures found 
along the coast of Florida. Many manatees are injured or killed by 
boats. Here is a scatterplot showing the relationship between the 
number of boats registered in Florida (in thousands) and the number of 
manatees killed by boats for the years 1977 to 2015.42 Is r > 0 orr< 
0? Closer to r= 0 or r = +1? Explain your reasoning. 
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14. Windy city Is it possible to use temperature to predict wind speed? 
Here is a scatterplot showing the average temperature (in degrees 
Fahrenheit) and average wind speed (in miles per hour) for 365 
consecutive days at O’ Hare International Airport in Chicago.44 Isr>0 
or r < 0? Closer to r = 0 or r = +1? Explain your reasoning. 
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15. pg 162 |») Points and turnovers Here is a scatterplot showing the 


relationship between the number of turnovers and the number of points 


scored for players in a recent NBA season.!2 The correlation for these 
data is r = 0.92. Interpret the correlation. 


Points 


0 50 100 150 200 250 300 
Turnovers 


16. Oh, that smarts! Infants who cry easily may be more easily stimulated 
than others. This may be a sign of higher IQ. Child development 
researchers explored the relationship between the crying of infants 4 to 
10 days old and their IQ test scores at age 3 years. A snap of a rubber 
band on the sole of the foot caused the infants to cry. The researchers 
recorded the crying and measured its intensity by the number of peaks 
in the most active 20 seconds. The correlation for these data is r = 
0.45.48 Interpret the correlation. 
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IQ at age 3 years 


0 5 10 15 20 25 30 35 40 
Count of crying peaks 


17. pg 165 |») More turnovers? Refer to Exercise 15. Does the fact 


that r = 0.92 suggest that an increase in turnovers will cause NBA 
players to score more points? Explain your reasoning. 

18. More crying? Refer to Exercise 16. Does the fact that r = 0.45 suggest 
that making an infant cry will increase his or her IQ later in life? 
Explain your reasoning. 

19. pg 165 |») Hot dogs Are hot dogs that are high in calories also high 


in salt? The following scatterplot shows the calories and salt content 
(measured in milligrams of sodium) in 17 brands of meat hot dogs.24 
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a. The correlation for these data is r = 0.87. Interpret this value. 


b. What effect does the hot dog brand with the smallest calorie content 
have on the correlation? Justify your answer. 

20. All brawn? The following scatterplot plots the average brain weight 
(in grams) versus average body weight (in kilograms) for 96 species of 
mammals.48 There are many small mammals whose points overlap at 
the lower left. 
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a. The correlation between body weight and brain weight is r = 0.86. 
Interpret this value. 


b. What effect does the human have on the correlation? Justify your 
answer. 

21. Dem bones Archaeopteryx is an extinct beast that had feathers like a 
bird but teeth and a long bony tail like a reptile. Only six fossil 
specimens are known to exist today. Because these specimens differ 
greatly in size, some scientists think they are different species rather 
than individuals from the same species. If the specimens belong to the 
same species and differ in size because some are younger than others, 
there should be a positive linear relationship between the lengths of a 
pair of bones from all individuals. An outlier from this relationship 
would suggest a different species. Here are data on the lengths (in 
centimeters) of the femur (a leg bone) and the humerus (a bone in the 
upper arm) for the five specimens that preserve both bones:/2 


Femur (x) 38 56 59 64 74 
Humerus (y) 41 63 70 72 84 


a. Make a scatterplot. Do you think that all five specimens come from 
the same species? Explain. 


b. Find the correlation r step by step, using the formula on page 166. 
Explain how your value for r matches your graph in part (a). 

22. Data on dating A student wonders if tall women tend to date taller 
men than do short women. She measures herself, her dormitory 
roommate, and the women in the adjoining dorm rooms. Then she 
measures the next man each woman dates. Here are the data (heights 
in inches): 
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Women (x) 66 64 #466 65 70 65 
Men (y) 72 68 70 68 71 65 


a. Make a scatterplot of these data. Describe what you see. 


b. Find the correlation r step by step, using the formula on page 166. 
Explain how your value for r matches your description in part (a). 
23. pg 169 |») More hot dogs Refer to Exercise 19. 


a. Explain why it isn’t correct to say that the correlation is 0.87 
mg/cal. 


b. What would happen to the correlation if the variables were reversed 
on the scatterplot? Explain your reasoning. 


c. What would happen to the correlation if sodium was measured in 
grams instead of milligrams? Explain your reasoning. 
24. More brains Refer to Exercise 20. 


a. Explain why it isn’t correct to say that the correlation is 0.86 g/kg. 


b. What would happen to the correlation if the variables were reversed 
on the scatterplot? Explain your reasoning. 


c. What would happen to the correlation if brain weight was measured 
in kilograms instead of grams? Explain your reasoning. 

25. Rank the correlations Consider each of the following relationships: 
the heights of fathers and the heights of their adult sons, the heights of 
husbands and the heights of their wives, and the heights of women at 
age 4 and their heights at age 18. Rank the correlations between these 
pairs of variables from largest to smallest. Explain your reasoning. 

26. Teaching and research A college newspaper interviews a 
psychologist about student ratings of the teaching of faculty members. 
The psychologist says, “The evidence indicates that the correlation 
between the research productivity and teaching rating of faculty 
members is close to zero.” The paper reports this as “Professor 
McDaniel said that good researchers tend to be poor teachers, and vice 
versa.” Explain why the paper’s report is wrong. Write a statement in 
plain language (don’t use the word correlation) to explain the 
psychologist’s meaning. 

27. Correlation isn’t everything Marc and Rob are both high school 
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28. 


29. 


30. 


English teachers. Students think that Rob is a harder grader, so Rob 
and Marc decide to grade the same 10 essays and see how their scores 
compare. The correlation is r = 0.98, but Rob’s scores are always 
lower than Marc’s. Draw a possible scatterplot that illustrates this 
situation. 

Limitations of correlation A carpenter sells handmade wooden 
benches at a craft fair every week. Over the past year, the carpenter 
has varied the price of the benches from $80 to $120 and recorded the 
average weekly profit he made at each selling price. The prices of the 
bench and the corresponding average profits are shown in the table. 


Price $80 $90 $100 $110 $120 
Average profit $2400 $2800 $3000 $2800 $2400 


a. Make a scatterplot to show the relationship between price and 
profit. 


b. The correlation for these data is r = 0. Explain how this can be true 


even though there is a strong relationship between price and average 
profit. 


Multiple Choice Select the best answer for Exercises 29-34. 


You have data for many years on the average price of a barrel of oil 
and the average retail price of a gallon of unleaded regular gasoline. If 
you want to see how well the price of oil predicts the price of gas, then 
you should make a scatterplot with as the explanatory 
variable. 


a. the price of oil 
b. the price of gas 

c. the year 

d. either oil price or gas price 


e. time 
In a scatterplot of the average price of a barrel of oil and the average 
retail price of a gallon of gas, you expect to see 


a. very little association. 
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aL 


32. 


C. 
d. 


e. 


a weak negative association. 
a strong negative association. 
a weak positive association. 


a strong positive association. 


The following graph plots the gas mileage (in miles per gallon) of 
various cars from the same model year versus the weight of these cars 
(in thousands of pounds). The points marked with red dots correspond 
to cars made in Japan. From this plot, we may conclude that 


d. 


e. 


Miles per gallon 


T T T 
2.25 3.00 3.75 
Weight (thousands of pounds) 


4.50 


there is a positive association between weight and gas mileage for 
Japanese cars. 


. the correlation between weight and gas mileage for all the cars is 


close to 1. 


there is little difference between Japanese cars and cars made in 
other countries. 


Japanese cars tend to be lighter in weight than other cars. 


Japanese cars tend to get worse gas mileage than other cars. 


If women always married men who were 2 years older than 
themselves, what would be the correlation between the ages of 
husband and wife? 


. 


op 


2 
1 
0.5 
0 


Can’t tell without seeing the data 
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33. The scatterplot shows reading test scores against IQ test scores for 14 
fifth-grade children. There is one low outlier in the plot. What effect 
does this low outlier have on the correlation? 


a. It makes the correlation closer to 1. 

b. It makes the correlation closer to 0 but still positive. 
c. It makes the correlation equal to 0. 

d. It makes the correlation negative. 


e. It has no effect on the correlation. 
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34. If we leave out the low outlier, the correlation for the remaining 13 
points in the preceding figure is closest to 


a. —0.95. 
b. —0.65. 
c. 0. 

d. 0.65. 
e. 0.95. 


Recycle and Review 

35. Big diamonds (1.2) Here are the weights (in milligrams) of 58 
diamonds from a nodule carried up to the earth’s surface in 
surrounding rock. These data represent a population of diamonds 


formed in a single event deep in the earth.22 


13.8 3.7 33.8 118 27.0 189 193 208 254 23.1 7.8 
10.9 9.0 9.0 14.4 6.5 Teo 5.6 18.5 11 11.2 7.0 


402 


7.6 
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4.0 


9.0 
5.1 
0.1 
2a 


9.5 
se. 
4.7 
4.5 


Tad 7.6 =o 6.5 3.4 72 Ie 
3.8 2.1 2.1 4.7 af 3.8 49 24 
1.5 2.0 0.1 0.1 1.6 3.5 3.7 2.6 


Make a histogram to display the distribution of weight. Describe the 


distribution. 


36. Fruit fly thorax lengths (2.2) Fruit flies are used frequently in genetic 
research because of their quick reproductive cycle. The length of the 
thorax (in millimeters) for male fruit flies is approximately Normally 
distributed with a mean of 0.80 mm and a standard deviation of 0.08 


mm. 


a. What proportion of male fruit flies have a thorax length greater than 
1mm? 


b. What is the 30th percentile for male fruit fly thorax lengths? 
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SECTION 3.2 Least-Squares Regression 


LEARNING TARGETS By the end of the section, you should be able to: 


Make predictions using regression lines, keeping in mind the 
dangers of extrapolation. 


Calculate and interpret a residual. 
Interpret the slope and y intercept of a regression line. 


Determine the equation of a least-squares regression line using 
technology or computer output. 


Construct and interpret residual plots to assess whether a regression 
model is appropriate. 


Interpret the standard deviation of the residuals and r2 and use these 
values to assess how well a least-squares regression line models the 
relationship between two variables. 


Describe how the least-squares regression line, standard deviation 


of the residuals, and r2 are influenced by outliers. 


Find the slope and y intercept of the least-squares regression line 
from the means and standard deviations of x and y and their 
correlation. 


Linear (straight-line) relationships between two quantitative variables are 
fairly common. In the preceding section, we found linear relationships in 
settings as varied as Major League Baseball, geysers, and Nobel prizes. 
Correlation measures the strength and direction of these relationships. 
When a scatterplot shows a linear relationship, we can summarize the 
overall pattern by drawing a line on the scatterplot. A regression line 
summarizes the relationship between two variables, but only in a specific 
setting: when one variable helps explain the other. Regression, unlike 
correlation, requires that we have an explanatory variable and a response 
variable. 
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DEFINITION Regression line 


A regression line is a line that describes how a response variable y 
changes as an explanatory variable x changes. Regression lines are 


expressed in the form j=b0+b1x* — bo + b1® here y(pronounced 
“y-hat”) is the predicted value of y for a given value of x. 


It is common knowledge that cars and trucks lose value the more they 
are driven. Can we predict the price of a used Ford F-150 SuperCrew 4 x 4 
truck if we know how many miles it has on the odometer? A random 
sample of 16 used Ford F-150 SuperCrew 4 x 4s was selected from among 
those listed for sale at autotrader.com. The number of miles driven and 
price (in dollars) were recorded for each of the trucks.22 Here are the data: 


Miles driven 70,583 129,484 29,932 29,953 24495 75,678 8359 4447 
Price ($) 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
Miles driven 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
Price ($) 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 


Figure 3.6 is a scatterplot of these data. The plot shows a moderately 
strong, negative linear association between miles driven and price. There 
are no outliers; however, there are two distinct clusters of trucks: a group 
of 12 trucks between 0 and 80,000 miles driven and a group of 4 trucks 
between 120,000 and 160,000 miles driven. The correlation is r = —0.815. 
The line on the plot is a regression line for predicting price from miles 
driven. 
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FIGURE 3.6 Scatterplot showing the price and miles driven of used Ford F-150s, 
along with a regression line. 


Prediction 


We can use a regression line to predict the value of the response variable 
for a specific value of the explanatory variable. For the Ford F-150 data, 
the equation of the regression line is 


price’ = 38,257—0.1629 (miles driven) 
price = 38,257 — 0.1629 (miles driven) 


If a used Ford F-150 has 100,000 miles driven, substitute x = 100,000 
in the equation. The predicted price is 
price = 38,257—0.1629 (100,000) = $21,967 
price = 38,257 — 0.1629 (100,000) = $21,967 


When we want to refer to the predicted value of a variable, we add a hat on top. 
——e 


Here, price\P!!C€ refers to the predicted price of a used Ford F-150. 


This prediction is illustrated in Figure 3.7. 
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FIGURE 3.7 Using the regression line to predict price for a Ford F-150 with 
100,000 miles driven. 


Even though the value §=$21,967* = $21,967 is unlikely to be the 


actual price of a truck that has been driven 100,000 miles, it’s our best 


guess based on the linear model using x = miles driven. We can also think 


of §=$21,967" = $21,967 as the average price for a sample of trucks that 


have each been driven 100,000 miles. 

Can we predict the price of a Ford F-150 with 300,000 miles driven? 
We can certainly substitute 300,000 into the equation of the line. The 
prediction is 


price4=38,257-0.1629(300,000)=—$10,613 
price = 38,257 — 0.1629(300,000) = —$10,613 
The model predicts that we would need to pay $10,613 just to have 
someone take the truck off our hands! 


A negative price doesn’t make much sense in this context. Look again 
at Figure 3.7. A truck with 300,000 miles driven is far outside the set of x 
values for our data. We can’t say whether the relationship between miles 
driven and price remains linear at such extreme values. Predicting the price 
for a truck with 300,000 miles driven is an extrapolation of the 
relationship beyond what the data show. 


DEFINITION Extrapolation 


Extrapolation is the use of a regression line for prediction far outside 
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the interval of x values used to obtain the line. Such predictions are 
often not accurate. 


Few relationships are linear for all values of the explanatory variable. 
og Don’t make predictions using values of x that are much larger or 


much smaller than those that actually appear in your data. 


EXAMPLE | How much candy can you grab? » 


Prediction 


a, | 


Josh Tabor 


PROBLEM: The scatterplot below shows the hand span (in cm) and 
number of Starburst™ candies grabbed by each student when Mr. 
Tyson’s class did the “Candy grab” activity. The regression line y= 


~29.8+2.83x"— ~29-8 + 2-832 has been added to the scatterplot. 
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Number of Starburst candies 
Nm 
wa 


17 18 19 20 21 22 23 24 25 
Hand span (cm) 


a. Andres has a hand span of 22 cm. Predict the number of Starburst™ 
candies he can grab. 


b. Mr. Tyson’s young daughter McKayla has a hand span of 12 cm. 
Predict the number of Starburst candies she can grab. 


c. How confident are you in each of these predictions? Explain. 
SOLUTION: 


a. 9=-29.842.83(22)" — —29-8 + 2.83(22) 


$=32.46 Starhist ean a 32.46 Starburst candies 


Don’t worry that the predicted number of Starburst candies isn’t an 
integer. Think of 32.46 as the average number of Starburst candies 


that a group of students, each with a hand span of 22 cm, could grab. 


eee — — 29-8 + 2.83(12) 


$= —4 16 Starburst Can a —4.16 Starburst candies 


c. The prediction for Andres is believable because x = 22 is within the 
interval of x-values used to create the model. However, the 
prediction for McKayla is not trustworthy because x = 12 is far 
outside of the x-values used to create the regression line. The linear 
form may not extend to hand spans this small. 
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FOR PRACTICE, TRY EXERCISE 37 


Residuals 


In most cases, no line will pass exactly through all the points in a 
scatterplot. Because we use the line to predict y from x, the prediction 
errors we make are errors in y, the vertical direction in the scatterplot. 


Figure 3.8 shows a scatterplot of the Ford F-150 data with a regression 
line added. The prediction errors are marked as bold vertical segments in 
the graph. These vertical deviations represent “leftover” variation in the 
response variable after fitting the regression line. For that reason, they are 
called residuals. 


45,000 + 
40,000 fs | | 
35,000 it 
ZF 30,000 - i. oe | Regression line 
a “plz ¥ = 38,257 — 0.1629x 
8 25,000 - a 
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Miles driven 


FIGURE 3.8 Scatterplot of the Ford F-150 data with a regression line added. A 
good regression line should make the residuals (shown as bold vertical segments) 
as small as possible. 


DEFINITION Residual 


A residual is the difference between the actual value of y and the 
value of y predicted by the regression line. That is, 
residual=actual y—predicted y=y—y" 
residual = actual y — predicted y 


a“ 


= Ye 


In Figure 3.8 above, the highlighted data point represents a Ford F-150 
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that had 70,583 miles driven and a price of $21,994. The regression line 
predicts a price of 

price’ = 38,257—-0.1629 (70,583) = $26,759 

price = 38,257 — 0.1629 (70,583) = $26,759 


for this truck, but its actual price was $21,994. This truck’s residual is 


residual=actual y—predicted y=y—y/=21,994—-26,759=—$4765 
residual = actualy — predicted y 
= y-y 
21,994 — 26,759 = —$4765 


The actual price of this truck is $4765 less than the cost predicted by 
the regression line with x = miles driven. Why is the actual price less than 
predicted? There are many possible reasons. Perhaps the truck needs body 
work, has mechanical issues, or has been in an accident. 


EXAMPLE |Can you grab more than expected? » 


Calculating and interpreting a residual 


Josh Tabor 


PROBLEM: Here again is the scatterplot showing the hand span (in 
cm) and number of Starburst™ candies grabbed by each student in Mr. 
Tyson’s class. The regression line is y\=-29.8+2.83xY = —29.8 + 2.832 
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Number of Starburst candies 


Hand span (cm) 


Find and interpret the residual for Andres, who has a hand span of 22 
cm and grabbed 36 Starburst candies. 


SOLUTION: 


y\=~29.8+2,83(22)=32.46Y = —29.8 + 2.83(22) = 32.46 Starburst candies 
Residual = 36 — 32.46 = 3.54Residual = 36 — 32.46 = 3.54 Starburst 
candies 


Andres grabbed 3.54 more Starburst candies than the number predicted 
by the regression line with x = hand span. 


Residual=actual y — predicted y=y — y/ 
Residual = actualy — predicted y 


y-y¥ 


FOR PRACTICE, TRY EXERCISE 39 


CHECK YOUR UNDERSTANDING 


Some data were collected on the weight of a male white laboratory 
rat for the first 25 weeks after its birth. A scatterplot of y = weight (in 
grams) and x = time since birth (in weeks) shows a fairly strong, 
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positive linear relationship. The regression equation y\=100+40x 
y = 100 + 402 models the data fairly well. 


1. Predict the rat’s weight at 16 weeks old. 
2. Calculate and interpret the residual if the rat weighed 700 grams at 16 
weeks old. 


3. Should you use this line to predict the rat’s weight at 2 years old? Use 
the equation to make the prediction and discuss your confidence in 
the result. (There are 454 grams in a pound.) 


Interpreting a Regression Line 


A regression line is a model for the data, much like the density curves of 

Chapter 2. The y intercept and slope of the regression line describe what 
this model tells us about the relationship between the response variable y 
and the explanatory variable x. 


DEFINITION jy intercept, Slope 


In the regression equation y\=b0+b1x:¥ = 60 +X : 


e bg is the y intercept, the predicted value of y when x = 0 


e 5, is the slope, the amount by which the predicted value of y 


changes when x increases by 1 unit 


You are probably accustomed to the form y = mx + b for the equation 
of a line. However, statisticians prefer the form y\=b0+b1x¥ = bo + x 
because it extends easily to settings where multiple explanatory variables 
are used to predict the value of a response variable. Either form can be 
used, but we will stick with y\=b0+b1xY = bo + 61@, In both cases, the 
slope is the coefficient of x. 


Many graphing calculators use the form yA=atbxY = a+ b2 to avoid using 


subscripts. The slope is always the coefficient of x, no matter which form is used. 
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Let’s return to the Ford F-150 data. The equation of the regression line 
for these data is y\=38,257-0.1629xY = 38, 257 — 0.1629, | where x = 
miles driven and y = price. The slope b1= -0.1629%1= — 9.1629 tells us 
that the predicted price of a used Ford F-150 goes down by $0.1629 (16.29 
cents) for each additional mile that the truck has been driven. The y 
intercept bO= 38,2570 = 38,257 is the predicted price (in dollars) of a used 
Ford F-150 that has been driven 0 miles. 


AP® EXAM TIP 


When asked to interpret the slope or y intercept, it is very important to 
include the word predicted (or equivalent) in your response. Otherwise, 
it might appear that you believe the regression equation provides actual 
values of y. 


The slope of a regression line is an important numerical description of 
the relationship between the two variables. Although we need the value of 
the y intercept to draw the line, it is statistically meaningful only when the 
explanatory variable can actually take values close to 0, as in the Ford F- 
150 data. In other cases, using the regression line to make a prediction for 
x = 0 is an extrapolation. 


EXAMPLE |Grabbing more candy >» 
Interpreting the slope and y intercept 


PROBLEM: The scatterplot shows the hand span (in cm) and number 
of Starburst™ candies grabbed by each student in Mr. Tyson’s class, 
along with the regression line y\=-29.8+2.83xY = —29.8 + 2.832, 


a. Interpret the slope of the regression line. 


b. Does the value of the y intercept have meaning in this context? If so, 
interpret the y intercept. If not, explain why. 
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Number of Starburst candies 
Nm 
wa 
1 


17 18 19 20 21 22 23 24 25 
Hand span (cm) 


SOLUTION: 


a. The predicted number of Starburst candies grabbed goes up by 2.83 
for each increase of 1 cm in hand span. 


Remember that the slope describes how the predicted value of y 


changes, not the actual value of y. 


b. The y intercept does not have meaning in this case, as it is impossible 
to have a hand span of 0 cm. 


Predicting the number of Starburst candies when x = 0 is an 


extrapolation—and results in an unrealistic prediction of —29.8. 


FOR PRACTICE, TRY EXERCISE 41 


For the Ford F-150 data, the slope b1= —0.1629%1= ~— 9.1629 is very 
close to 0. This does not mean that change in miles driven has little effect 
on price. The size of the slope depends on the units in which we measure 
the two variables. In this setting, the slope is the predicted change in price 
(in dollars) when the distance driven increases by 1 mile. There are 100 
cents in a dollar. If we measured price in cents instead of dollars, the slope 
would be 100 times steeper, b1= —16.29%1= ~ 16.29. You can’t say how 
strong a relationship is by looking at the slope of the regression line. 
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CHECK YOUR UNDERSTANDING 


Some data were collected on the weight of a male white laboratory 
rat for the first 25 weeks after its birth. A scatterplot of y = weight (in 
grams) and x = time since birth (in weeks) shows a fairly strong, 
positive linear relationship. The regression equation y/ = 100 + 40x 
y = 100 + 402 models the data fairly well. 

1. Interpret the slope of the regression line. 


2. Does the value of the y intercept have meaning in this context? If so, 
interpret the y intercept. If not, explain why. 


The Least-Squares Regression Line 


There are many different lines we could use to model the association in a 
particular scatterplot. A good regression line makes the residuals as small 


as possible. 


Duncan Selby/Alamy 


In the F-150 example, the regression line we used is 
y = 38,257 — 0.1629x¥ = 38,257 — 0.16292, How does this line make 


the residuals “as small as possible”? Maybe this line minimizes the sum of 
the residuals. If we add the prediction errors for all 16 trucks, the positive 


and negative residuals cancel out, as shown in Figure 3.9(a). That’s the 
same issue we faced when we tried to measure deviation around the mean 


in Chapter 1. We’ll solve the current problem in much the same way—by 
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squaring the residuals. 
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FIGURE 3.9 Scatterplots of the Ford F-150 data with the regression line added. 
(a) The residuals will add to approximately 0 when using a good regression line. 
(b) A good regression line should make the sum of squared residuals as small as 
possible. 


A good regression line will have a sum of residuals near 0. But the 
regression line we prefer is the one that minimizes the sum of the squared 
residuals. That’s what the line shown in Figure 3.9(b) does for the Ford F- 
150 data, which is why we call it the least-squares regression line. No 
other regression line would give a smaller sum of squared residuals. 


DEFINITION Least-squares regression line 


The least-squares regression line is the line that makes the sum of 
the squared residuals as small as possible. 


Your calculator or statistical software will give the equation of the 
least-squares line from data that you enter. Then you can concentrate on 
understanding and using the regression line. 


AP® EXAM TIP 


When displaying the equation of a least-squares regression line, the 
calculator will report the slope and intercept with much more precision 
than we need. There is no firm rule for how many decimal places to 
show for answers on the AP® Statistics exam. Our advice: decide how 
much to round based on the context of the problem you are working on. 
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9. Technology Corner |CALCULATING LEAST- 
SQUARES REGRESSION LINES (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Let’s use the Ford F-150 data to show how to find the equation of the 
least-squares regression line on the TI-83/84. Here are the data again: 


Miles driven 70,583 129,484 29,932 29,953 24,495 75,678 8359 4447 
Price ($) 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
Miles driven 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
Price ($) 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 


1. Enter the miles driven data into L1 and the price data into L2. 


2. To determine the least-squares regression line, press choose 
CALC and then LinReg(a+bx). Note: The TI-83/84 uses the form 
y\=at bx¥ =@ + bz instead of y* = b0 + bix¥ = 50 4 bx. but 
they are equivalent. 


e OS 2.55 or later: In the dialog box, enter the following: Xlist:L1, 
Ylist:L2, FreqList (leave blank), Store RegEQ (leave blank), and 
choose Calculate. 


e Older OS: Finish the command to read LinReg(a+bx) L1,L2 and 


press. 


ORMAL FLOAT AUTO REAL RADIAN HP Fl 
| 


y=atbx 
a=38257.13507 
b=~. 1629185531 
r2=, 664247901 
r=~.8150140496 


Note: If r2 and r do not appear on the TI-83/84 screen, do this one-time 
series of keystrokes: 
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° OS 2.55 or later: Press and set STAT DIAGNOSTICS to 
ON. Then redo Step 2 to calculate the least-squares line. The r2 and r 


values should now appear. 


e Older OS: Press [o| (CATALOG), scroll down to 
DiagnosticOn, and press [ENTER|. Press again to execute 
the command. The screen should say “Done.” Then redo Step 2 to 
calculate the least-squares line. The r2 and r values should now 


appear. 


To graph the least-squares regression line on the scatterplot: 


1. Set up a scatterplot (see Technology Corner 8 on page 159). 


2. Press and enter the equation of the least-squares regression line 
in Y1. 


3. Press |ZOOM| and choose ZoomStat to see the scatterplot with the 
least-squares regression line. 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


Note: When you calculate the equation of the least-squares regression 
line, you can have the calculator store the equation to Y1. When setting 
up the calculation, enter Y1 for the StoreRegEg prompt blank (OS 2.55 
or later) or use the following command (older OS): LinReg(a+bx) 
L1,L2,Y1. Y1 is found by pressing and selecting Y-VARS, then 
Function, then Y1. 


Determining if a Linear Model Is Appropriate: Residual 
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Plots 


One of the first principles of data analysis is to look for an overall pattern 
and for striking departures from the pattern. A regression line describes the 
overall pattern of a linear relationship between an explanatory variable and 
a response variable. We see departures from this pattern by looking at a 


residual plot. 


DEFINITION Residual plot 


A residual plot is a scatterplot that displays the residuals on the 
vertical axis and the explanatory variable on the horizontal axis. 


Some software packages prefer to plot the residuals against the predicted values y 
instead of against the values of the explanatory variable. The basic shape of the two 


plots is the same because y is linearly related to x. 


Residual plots help us assess whether or not a linear model is 
appropriate. In Figure 3.10(a), the scatterplot shows the relationship 
between the average income (gross domestic product per person, in 
dollars) and fertility rate (number of children per woman) in 187 countries, 
along with the least-squares regression line. The residual plot in Figure 
3.10(b) shows the average income for each country and the corresponding 
residual. 
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FIGURE 3.10 The (a) scatterplot and (b) residual plot for the relationship between 
fertility rate and average income for a sample of countries. 


The least-squares regression line clearly doesn’t fit this association 
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very well! For most countries with average incomes under $5000, the 
actual fertility rates are greater than predicted, resulting in positive 
residuals. For countries with average incomes between $5000 and $60,000, 
the actual fertility rates tend to be smaller than predicted, resulting in 
negative residuals. Countries with average incomes above $60,000 all have 
fertility rates greater than predicted, again resulting in positive residuals. 
This U-shaped pattern in the residual plot indicates that the linear form of 
our model doesn’t match the form of the association. A curved model 
might be better in this case. 


In Figure 3.11(a), the scatterplot shows the Ford F-150 data, along with 
the least-squares regression line. The corresponding residual plot is shown 


in Figure 3.11(b). 


45,000 4 10,000 4 
40,000 4 
35,000 = ——~ —* . 5000 4 
30,000 4 


25,000 = 


Price ($) 
Residual 


20.000 4 
15,000 4 ag -50004 * 


10,000 = 


5000 5 T T T T T 1 1 — —10,000 4 - - - - - - - 
0 40,000 80,000 120,000 160,000 0 40,000 80,000 120,000 160,000 
(a) Miles driven (b) Miles driven 


FIGURE 3.11 The (a) scatterplot and (b) residual plot for the relationship between 
price and miles driven for Ford F-150s. 


Looking at the scatterplot, the line seems to be a good fit for this 
relationship. You can “see” that the line is appropriate by the lack of a 
leftover curved pattern in the residual plot. In fact, the residuals look 
randomly scattered around the residual = 0 line. 


HOW TO INTERPRET A RESIDUAL PLOT 


To determine whether the regression model is appropriate, look at the 
residual plot. 


e If there is no leftover curved pattern in the residual plot, the 
regression model is appropriate. 


e If there is a leftover curved pattern in the residual plot, consider using 
a regression model with a different form. 
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EXAMPLE | Pricing diamonds (>) 


Interpreting a residual plot 


JGI/Getty Images 


PROBLEM: Is a linear model appropriate to describe the relationship 
between the weight (in carats) and price (in dollars) of round, clear, 
internally flawless diamonds with excellent cuts? We calculated a least- 
squares regression line using x = weight and y = price and made the 
corresponding residual plot shown.22 Use the residual plot to determine 
if the linear model is appropriate. 


Residual ($) 


—50,000 


—100,000 


Weight (carats) 


SOLUTION: 


The linear model relating price to carat weight is not appropriate 
because there is a U-shaped pattern left over in the residual plot. 


FOR PRACTICE, TRY EXERCISE 47 
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Think About It 


WHY DO WE LOOK FOR PATTERNS IN RESIDUAL PLOTS? 
The word residual comes from the Latin word residuum, meaning “left 
over.” When we calculate a residual, we are calculating what is left over 
after subtracting the predicted value from the actual value: 


residual = actual y — predicted yresidual = actual y — predicted y 


Likewise, when we look at the form of a residual plot, we are looking at 
the form that is left over after subtracting the form of the model from 
the form of the association: 

form of residual plot = form of association — form of model 

form of residual plot = form of association form of model 
When there is a leftover form in the residual plot, the form of the 
association and form of the model are not the same. However, if the 
form of the association and form of the model are the same, the residual 
plot should have no form, other than random scatter. 


10. Technology Corner | MAKING RESIDUAL PLOTS 
Ne 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 

Let’s continue the analysis of the Ford F-150 miles driven and price 
data from Technology Corner 9 (page 184). You should have already 
made a scatterplot, calculated the equation of the least-squares 
regression line, and graphed the line on the scatterplot. Now, we want 
to calculate residuals and make a residual plot. Fortunately, your 
calculator has already done most of the work. Each time the calculator 
computes a regression line, it computes the residuals and stores them in 
a list named RESID. 


1. Set up a scatterplot in the statistics plots menu. 


° Press (STAT PLOT). 


423 


e Press to |1| go into Plot1. 


e Adjust the settings as shown. The RESID list is found in the List 
menu by pressing ISTAT|. Note: You have to calculate the 
equation of the least-squares regression line using the calculator 
before making a residual plot. Otherwise, the RESID list will 
include the residuals from a different least-squares regression line. 

NORMAL FLOAT AUTO REAL RADIAN HP fn 


PRESS (<] OR [>] TO SELECT AN OPTION 
Plot2 Plot3 


Off 
Type: (Bm L* dh oe Hh [| 
Xlist:Li 


Ylist:RESID 
Mark :F] + - 


Color Ese 


2. Use ZoomStat to let the calculator choose an appropriate window. 


e Press |ZOOM| and choose 9: ZoomStat. 


Note: If you want to see the values of the residuals, you can have the 
calculator put them in L3 (or any list). In the list editor, highlight the 
heading of L3, choose the RESID list from the LIST menu, and press 


[ENTER|. 


CHECK YOUR UNDERSTANDING 
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( 


y In Exercises 3 and 7, we asked you to make and describe a 
scatterplot for the hiker data shown in the table. 


Body weight (Ib) 120 187 109 103 131 165 158 116 
Backpack weight (Ib) De ) 2 Wl OR Fe OE 


1. Calculate the equation of the least-squares regression line. 
2. Make a residual plot for the linear model in Question 1. 


3. What does the residual plot indicate about the appropriateness of the 
linear model? Explain your answer. 


How Well the Line Fits the Data: The Role of s and r’ in 
Regression 


We use a residual plot to determine if a least-squares regression line is an 
appropriate model for the relationship between two variables. Once we 
determine that a least-squares regression line is appropriate, it makes sense 
to ask a follow-up question: How well does the line work? That is, if we 
use the least-squares regression line to make predictions, how good will 
these predictions be? 


THE STANDARD DEVIATION OF THE RESIDUALS We already 
know that a residual measures how far an actual y value is from its 
corresponding predicted value y. Earlier in this section, we calculated the 
residual for the Ford F-150 with 70,583 miles driven and price $21,994. 
As shown in Figure 3.12, the residual was -$4765, meaning that the actual 
price was $4765 less than we predicted. 
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FIGURE 3.12 Scatterplot of the Ford F-150 data with a regression line added. 
Residuals for each truck are shown with vertical line segments. 


To assess how well the line fits all the data, we need to consider the 
residuals for each of the trucks, not just one. Here are the residuals for all 
16 trucks: 


—-4765 —-7664 -3506 8617 7728 3057 -5004 458 
2289 1183 -8121 6858 2110 5572 = -5973 —2857 


Using these residuals, we can estimate the “typical” prediction error 
when using the least-squares regression line. To do this, we calculate the 
standard deviation of the residuals s. 


DEFINITION Standard deviation of residuals s 


The standard deviation of the residuals s measures the size of a 
typical residual. That is, s measures the typical distance between the 
actual y values and the predicted y values. 


To calculate s, use the following formula: 


s=sum of squared residualsn—2=) (yi-y/i)2n-2 


en ~ \2 
/sum of squared residuals __, > (¥j;—-9;) 
= \ n—2 — n—2 


For the Ford F-150 data, the standard deviation of the residuals is 
s=(—-4765)2+(—7664)2+ : - -+(—2857)216-2=461,264,13614=$5740 
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| (4765)? +(—7664)?+-.-+(—2857)? [461,264,136 
a | 16-2 ~ Vi 74 
= $5740 


Interpretation: The actual price of a Ford F-150 is typically about $5740 
away from the price predicted by the least-squares regression line with x = 
miles driven. If we look at the residual plot in Figure 3.11, this seems like 
a reasonable value. Although some of the residuals are close to 0, others 
are close to $10,000 or —$10,000. 


Think About It 


DOES THE FORMULA FOR s LOOK SLIGHTLY FAMILIAR? It 
should. In Chapter 1, we defined the standard deviation of a set of 
quantitative data as 


> (x;-%)? 


n—l 


= sz = . 
Sx=)' (xi-x )2n-1 


We interpreted the resulting value as the “typical” distance of the 
data points from the mean. In the case of two-variable data, we’re 
interested in the typical (vertical) distance of the data points from the 
regression line. We find this value in much the same way: first add up 
the squared deviations, then average them (again, in a funny way), and 
take the square root to get back to the original units of measurement. 


THE COEFFICIENT OF DETERMINATION r? There is another 
numerical quantity that tells us how well the least-squares line predicts 
values of the response variable y. It is r2, the coefficient of determination. 
Some computer packages call it “R-sq.” You may have noticed this value 
in some of the output that we showed earlier. Although it’s true that r2 is 
equal to the square of the correlation r, there is much more to this story. 


DEFINITION The coefficient of determination r2 


Z 


The coefficient of determination r~ measures the percent reduction 
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in the sum of squared residuals when using the least-squares 
regression line to make predictions, rather than the mean value of y. 
In other words, r2 measures the percent of the variability in the 
response variable that is accounted for by the least-squares regression 
line. 


Holly Albrecht 


Suppose we wanted to predict the price of a particular used Ford F- 
150, but we didn’t know how many miles it had been driven. Our best 
guess would be the average cost of a used Ford F-150, y = $27,834 
¥ = $27,834. Of course, this prediction is unlikely to be very good, as the 
prices vary quite a bit from the mean (sy=$9570°9 — $9570) If we knew 
how many miles the truck had been driven, we could use the least-squares 
regression line to make a better prediction. How much better are 
predictions that use the least-squares regression line with x = miles driven, 
rather than predictions that use only the average price? The answer is r2 


The scatterplot in Figure 3.13(a) shows the squared residuals along 
with the sum of squared residuals (approximately 1,374,000,000) when 
using the average price as the predicted value. The scatterplot in Figure 
3.13(b) shows the squared residuals along with the sum of squared 
residuals (approximately 461,300,000) when using the least-squares 
regression line with x = miles driven to predict the price. Notice that the 
squares in part (b) are quite a bit smaller. 


. 


428 


Price ($1000s) 
N 
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Miles driven (1000s) Miles driven (1000s) 
Price = 27,834 Price = 38,257 — 0.1629 Miles driven; 2 = 0.66 
(a) Sum of squares = 1,374,000,000 (b) Sum of squares = 461,300,000 


FIGURE 3.13 (a) The sum of squared residuals is about 1,374,000,000 if we use 
the mean price as our prediction for all 16 trucks. (b) The sum of squared residuals 
from the least-squares regression line is about 461,300,000. 


To find r2, calculate the percent reduction in the sum of squared 
residuals: 
r2=1,374,000,000—461,300,0001,374,000,000=912,700,0001,374,000,000= 
9 _ 1,374,000,000—461,300,000 = 912,700,000 _ 0.66 

i 1,374,000,000 ~ 1,374,000,000 


The sum of squared residuals has been reduced by 66%. 


Interpretation: About 66% of the variability in the price of a Ford F-150 is 
accounted for by the least-squares regression line with x = miles driven. 
The remaining 34% is due to other factors, including age, color, and 
condition. 

If all the points fall directly on the least-squares line, the sum of 
squared residuals is 0 and r2 = 1. Then all the variation in y is accounted 
for by the linear relationship with x. In the worst-case scenario, the least- 
squares line does no better at predicting y than y=y ¥ = Y does. Then the 
two sums of squared residuals are the same and r2 = 0. 


It’s fairly remarkable that the coefficient of determination r? is actually 


the square of the correlation. This fact provides an important connection 
between correlation and regression. When you see a linear association, 
square the correlation to get a better feel for how well the least-squares 
line fits the data. 
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Think About It 


WHAT?’S THE RELATIONSHIP BETWEEN s AND r2? Both s and 
? are calculated from the sum of squared residuals. They also both 
measure how well the line fits the data. The standard deviation of the 
residuals reports the size of a typical prediction error, in the same units 
as the response variable. In the truck example, s=$5740s = $5740, The 


r 


value of r2, however, does not have units and is usually expressed as a 
percentage between 0% and 100%, such as 12=66%r* = 66%. Because 
these values assess how well the line fits the data in different ways, we 
recommend you follow the example of most statistical software and 
report both. 


Knowing how to interpret s and r2 is much more important than 
knowing how to calculate them. Consequently, we typically let technology 
do the calculations. 


EXAMPLE | Grabbing candy, again » 


Interpreting s and r2 


Kylie McManis 


PROBLEM: The scatterplot shows the hand span (in centimeters) and 
number of Starburst™ candies grabbed by each student in Mr. Tyson’s 
class, along with the regression line y\=-29.8+2.83xY = —29.8 + 2.832, 


For this model, technology gives s = 4.03 and r2 = 0.697. 
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a. Interpret the value of s. 
b. Interpret the value of r?. 


40 4 ie 


35 - e ee. 


Number of Starburst candies 
mR 
\ 


17 18 19 2 2 2 «23 2 « 25 
Hand span (cm) 


SOLUTION: 
a. The actual number of Starburst!™ candies grabbed is typically about 


4.03 away from the number predicted by the least-squares regression 
line with x = hand span. 


b. About 69.7% of the variability in number of Starburst candies 
grabbed is accounted for by the least-squares regression line with x = 
hand span. 


FOR PRACTICE, TRY EXERCISE 55 


Interpreting Computer Regression Output 


Figure 3.14 displays the basic regression output for the Ford F-150 data 
from two statistical software packages: Minitab and JMP. Other software 
produces very similar output. Each output records the slope and y intercept 
of the least-squares line. The software also provides information that we 
don’t yet need, although we will use much of it later. Be sure that you can 
locate the slope, the y intercept, and the values of s (called root mean 
square error in JMP) and r¢ on both computer outputs. Once you 
understand the statistical ideas, you can read and work with almost any 
software output. 
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Minitab 


Slope y intercept 


Predictor coet / SE Coef 
Constant (38257) 2446 15.64 0.000 


0.03096 -5.26 0.000 


Miles Driven 


r2 


S = 5740.13 


Standard deviation of the residuals 


Summary of Fit 

RSquare 

RSquare Adj 

Root Mean Square Error 
Mean of Response 
Observations (or Sum Wgts) 


Parameter Estimates 


Term Estimate 
Intercept 4428257.135) 
Miles Driven / 
/ 
y intercept 


JMP 


- ad 
0.664248 Standard 
0.640266 - deviation 
of the residuals 
27833.69 
16 


Std Error t Ratio Prob>|t| 


2445.813 15.64 <.0001 
0.030956 -5.26 0.0001 
Slope 


FIGURE 3.14 Least-squares regression results for the Ford F-150 data from 
Minitab and JMP statistical software. Other software produces similar output. 
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EXAMPLE | Using feet to predict height (>) 


Interpreting regression output 


PROBLEM: A random sample of 15 high school students was selected 
from the U.S. Census At School database. The foot length (in 
centimeters) and height (in centimeters) of each student in the sample 
were recorded. Here are a scatterplot with the least-squares regression 


line added, a residual plot, and some computer output: 


195 
190 
185 
180 
175 
170 


Height (cm) 


165 


155 


—— 
24 26 28 30 


Foot length (cm) 
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Residual (cm) 
+ 


T T T T tk 
22 24 26 28 30 


Foot length (cm) 


Predictor Coef SE Coef Ab P 

Constant 103.4100 19.5000 5.30 0.000 
Foot length 2.7469 0.7833 ei fos. 0.004 
S = 7.95126 R-Sq = 48.6% R-Sq(adj) = 44.7% 


a. Is a line an appropriate model to use for these data? Explain how you 
know. 


b. Find the correlation. 


c. What is the equation of the least-squares regression line that 
describes the relationship between foot length and height? Define any 
variables that you use. 


d. By about how much do the actual heights typically vary from the 
values predicted by the least-squares regression line with x = foot 
length? 


SOLUTION: 


a. Because the scatterplot shows a linear association and the residual 
plot has no obvious leftover curved patterns, a line is an appropriate 
model to use for these data. 


b. r=0.486=0.697" = V 0.486 = 0.697 


The correlation r is the square root of r*, where r? is a value between 


0 and 1. Because the square root function on your calculator will 
always give a positive result, make sure to consider whether the 
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correlation is positive or negative. If the slope is negative, so is the 
correlation. 


c. height = 103.41 + 2.7469 (foot length) 
height = 103.41 + 2.7469 (foot length) 


We could also write the equation as y\=103.41+2.7469x 
y = 103.41 + 2.74692 where § = predicted height (cm) and x is foot 


length (cm). 


d. s = 7.95, so the actual heights typically vary by about 7.95 cm from 
the values predicted by the regression line with x = foot length. 


FOR PRACTICE, TRY EXERCISE 59 


CHECK YOUR UNDERSTANDING 


( 


y In Section 3.1, you read about the Old Faithful geyser in 
Yellowstone National Park. The computer output shows the results of a 
regression of y = interval of time until the next eruption (in minutes) and 
x = duration of the most recent eruption (in minutes) for each eruption 
of Old Faithful in a particular month. 


Summary of Fit 


RSquare 0.853725 
RSquare Adj 0.853165 
Root Mean Square Error 6.493357 
Mean of Response 77.543730 
Observations (or Sum 263.000000 
Wgts) 


Parameter Estimates 
Term Estimate Std Error t Ratio Prob>|t| 
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Intercept 33.347442 1.201081 27.76 <.0001* 
Duration 13.285406 0.340393 39.03 <.0001* 


1. What is the equation of the least-squares regression line that describes 
the relationship between interval and duration? Define any variables 
that you use. 


2. Interpret the slope of the least-squares regression line. 
3. Identify and interpret the standard deviation of the residuals. 


4. What percent of the variability in interval is accounted for by the 
least-squares regression line with x = duration? 


Regression to the Mean 


Using technology is often the most convenient way to find the equation of 
a least-squares regression line. It is also possible to calculate the equation 
of the least-squares regression line using only the means and standard 
deviations of the two variables and their correlation. Exploring this method 
will highlight an important relationship between the correlation and the 
slope of a least-squares regression line—and reveal why we include the 
word regression in the expression least-squares regression line. 


HOW TO CALCULATE THE LEAST-SQUARES 
REGRESSION LINE USING SUMMARY STATISTICS 


We have data on an explanatory variable x and a response variable y for 
n individuals. From the data, calculate the means x £ and y Y andthe 
standard deviations s, and sy of the two variables and their correlation 


r. The least-squares regression line is the line y\=b0+b1xY = 40 + 62 


with slope 


and y intercept 
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The formula for the slope reminds us that the distinction between 
explanatory and response variables is important in regression. Least- 
Squares regression makes the distances of the data points from the line 
small only in the y direction. If we reverse the roles of the two variables, 
the values of s, and sy will reverse in the slope formula, resulting in a 
different least-squares regression line. This is not true for correlation: 
switching x and y does not affect the value of r. 

The formula for the y intercept comes from the fact that the least- 
Squares regression line always passes through the point (xy )(@ y), 
Once we know the slope (b,) and that the line goes through the point 
(xy vy), we can use algebra to solve for the y intercept. Substituting 
(xy Y) into the equation y\=b0+b1x¥Y = 40 + 61# produces the 
equation y =b0+bix ¥ = 40 + 61%, Solving this equation for bo gives the 
equation shown in the definition box, bO=y —b1x 0 = ¥ ~ 51%, To see 
how these formulas work in practice, let’s look at an example. 


EXAMPLE |More about feet and height (») 


Calculating the least-squares regression line 


panpote/Getty Images 


PROBLEM: In the preceding example, we used data from a random 
sample of 15 high school students to investigate the relationship 
between foot length (in centimeters) and height (in centimeters). The 
mean and standard deviation of the foot lengths are x =24.76% = 24.76 
and s, = 2.71. The mean and standard deviation of the heights are 

y =171.439 = 171.43 and Sy = 10.69. The correlation between foot 


length and height is r = 0.697. Find the equation of the least-squares 
regression line for predicting height from foot length. 
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SOLUTION: 


10.69 
b, = 0.697 = 2.75 
b1=0.69710.692.71=2.75 2.71 


Be gee) —taea’o — 171.43 — 2.75 (24.76) = 103.34 


Sy 
r 


b1=rsysx Sz 
b0=y —b1x 6 = 9 — hz 


The equation of the least-squares regression line is y\=103.34+2.75x 
j = 103.34 + 2.752. 


FOR PRACTICE, TRY EXERCISE 63 


There is a close connection between the correlation and the slope of the 
least-squares regression line. The slope is 


bl=rsysx=r: sysx Sx 8x 


This equation says that along the regression line, a change of 1 
standard deviation in x corresponds to a change of r standard deviations in 
y. When the variables are perfectly correlated (r = 1 or r = —1), the change 
in the predicted response yy is the same (in standard deviation units) as 
the change in x. For example, if r = 1 and x is 2 standard deviations above 
x #, then the corresponding value of y¥ will be 2 standard deviations 
above y ¥. 

However, if the variables are not perfectly correlated (—1 < r < 1), the 
change in y/Y is less than the change in x, when measured in standard 
deviation units. To illustrate this property, let’s return to the foot length 
and height data from the preceding example. 

Figure 3.15 shows the scatterplot of height versus foot length and the 
regression line y\=103.34+2.75xY = 103.34 + 2.752, We have added four 
more lines to the graph: a vertical line at the mean foot length x Z, a 
vertical line at x +sx” + x (1 standard deviation above the mean foot 
length), a horizontal line at the mean height y ¥, and a horizontal line at 
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y tsyY ~ Sy (1 standard deviation above the mean height). 


[ y = mean height | § = 103.34 + 2.75x] 
195 - 
0 sy 
7 i 
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FIGURE 3.15 Scatterplot showing the relationship between foot length and height 


for a sample of students, along with lines showing the means of x and y and the 
values 1 standard deviation above each mean. 


When a student’s foot length is 1 standard deviation above the mean 
foot length x 2, the predicted height y/¥ is above the mean height y ¥— 
but not an entire standard deviation above the mean. How far above the 
mean is the value of y\Y? 

From the graph, we can see that 


changeiny ?? 
b; = slope =————— = — 

b1i=slope=change in ychange in x=??sx changeing sz 
From earlier, we know that 


r-sy 


1 
b1=r-sysx Sz 


Setting these two equations equal to each other, we have 


2?) oT Sy 


??Sx=rsysxSa $x 
Thus, y\¥ must be r sy above the mean y ¥. 


In other words, for an increase of 1 standard deviation in the value of 
the explanatory variable x, the least-squares regression line predicts an 
increase of only r standard deviations in the response variable y. When the 


438 


correlation isn’t r = 1 or r = —1, the predicted value of y is closer to its 
mean y ¥ than the value of x is to its mean x ©. This is called regression 
to the mean, because the values of y “regress” to their mean. 

Sir Francis Galton (1822-1911) is often credited with discovering the 
idea of regression to the mean. He looked at data on the heights of children 
versus the heights of their parents. He found that taller-than-average 
parents tended to have children who were taller than average but not quite 
as tall as their parents. Likewise, shorter-than-average parents tended to 
have children who were shorter than average but not quite as short as their 
parents. Galton used the symbol r for the correlation because of its 
important relationship to regression. 


Correlation and Regression Wisdom 


Correlation and regression are powerful tools for describing the 
relationship between two variables. When you use these tools, you should 
be aware of their limitations. 


CORRELATION AND REGRESSION LINES DESCRIBE ONLY 
LINEAR RELATIONSHIPS You can calculate the correlation and the 
least-squares line for any relationship between two quantitative variables, 
but the results are useful only if the scatterplot shows a linear pattern. 
Always plot your data first! 

The following four scatterplots show very different relationships. 
Which one do you think shows the greatest correlation? 
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Answer: All four have the same correlation, r = 0.816. Furthermore, 
the least-squares regression line for each relationship is exactly the same, 
yh=3+0.5xY = 3+ 0.52. These four data sets, developed by statistician 
Frank Anscombe, illustrate the importance of graphing data before doing 
calculations.24 
CORRELATION AND LEAST-SQUARES REGRESSION LINES 
ARE NOT RESISTANT You already know that the correlation r is not 
resistant. One unusual point in a scatterplot can greatly change the value of 
r. Is the least-squares line resistant? The following activity will help you 
answer this question. 


ACTIVITY Investigating properties of the least-squares 


prPle> 


aa > 
regression line ¢ 


In this activity, you will use the Correlation and Regression applet to 
explore some properties of the least-squares regression line. 
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10 


Correlation coefficient: 


0.4000 


® Add data points eee 


@ Draw your own line a 
@ Show least-squares line 


@ Show mean X & Y lines 


0 Show residuals 


1. Launch the applet at highschool.bfwpub.com/tps6e. 

2. Click on the graphing area to add 10 points in the lower-left corner so 
that the correlation is about r = 0.40. Also, check the boxes to show 
the “Least-Squares Line” and the “Mean X & Y” lines as in the 
screen shot. Notice that the least-squares regression line goes though 
the point (xy) y). 

3. If you were to add a point on the least-squares regression line at the 
right edge of the graphing area, what do you think would happen to 
the least-squares regression line? Add the point to see if you were 
correct. 


4. Click on the point you just added, and drag it up and down along the 
right edge of the graphing area. What happens to the least-squares 
regression line? 


5. Now, move this point so that it is on the vertical x line. Drag the 
point up and down on the x = line. What happens to the least-squares 
regression line? Do outliers in the vertical direction have as much 
influence on the least-squares regression line as outliers in the 
horizontal direction? 


6. Briefly summarize how outliers influence the least-squares regression 
line. 


Least-squares regression lines make the sum of the squared residuals as 
small as possible, so outliers can be very influential in regression 
calculations. Depending on where they are, outliers can greatly affect the 
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equation of the least-squares regression line, along with other summary 
statistics such as r, r2, and s. The best way to investigate the influence of 
outliers is to do regression calculations with and without the outlier and 
see how much the results differ. 

Does the age at which a child begins to talk predict a later score on a 
test of mental ability? A study of the development of young children 
recorded the age in months at which each of 21 children spoke their first 
word and their Gesell Adaptive Score, the result of an aptitude test taken 
much later.22 A scatterplot of the data appears in Figure 3.16, along with a 
residual plot, and computer output. Two outliers, child 18 and child 19, are 
indicated on each plot. 


140 ~ * = Two children dal Child 19 ¢ = Two children 
Child 19 
120 - . 20 7 
5 1004 a § 10-4 ous 
. Pay ° = bd 
3 ° : — be E a J 
oe e =~ ° & 
Oo 80-4 0 + 
a. ae 40-4 : : Child 18 
Child 18 ° 
40 4 1 1 1 T T -20 + T T 1 T 1 
0 10 20 30 40 50 0 10 20 30 40 50 
Age at first word (months) Age at first word (months) 


FIGURE 3.16 (a) Scatterplot of Gesell Adaptive Scores versus the age at first 
word for 21 children, along with the least-squares regression line. (b) Residual plot 
for the linear model. Child 18 and Child 19 are outliers. Each purple point in the 
graphs stands for two individuals. 


Clover No.7 Photography/Getty Images 
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Child 19 has a very large residual because this point lies far from the 
regression line. However, Child 18’s point is close to the line and has a 
small residual. How do these two outliers affect the regression? Figure 
3.17 shows the results of removing each of these points on the equation of 
the least-squares regression line, the standard deviation of the residuals, 
and r2. 


140 > 


* = Two children With all 19 points: 
Child 19 py =109.9 — 1.127x 
: s = 11.02 and r? = 0.410 


120 4 


Without Child 18: 


[ Without Child 18 y =105.6 — 0.779x 


ss s = 11.11 and r? = 0.112 


60 + Without Child 19 _— 
Child 18 


40 -+— T | T T T 
0 10 20 30 40 50 
Age at first word (months) 


Gesell score 


Without Child 19: 
y =109.3 — 1.193x 
s = 8.63 and r2 = 0.572 


FIGURE 3.17 Three least-squares regression lines of Gesell score on age at first 
word. The green line is calculated from all the data. The dark blue line is calculated 
leaving out only Child 18. The red line is calculated leaving out only Child 19. 


You can see that removing the point for Child 18 moves the line quite 
a bit. Because of Child 18’s extreme position on the age (x) scale, 
removing this point makes the slope closer to 0 and the y intercept smaller. 
Removing Child 18 also increases the standard deviation of the residuals 
because its small residual was making the typical distance from the 
regression line smaller. Finally, removing Child 18 also decreases r2 (and 
makes the correlation closer to 0) because the linear association is weaker 
without this point. 

Child 19’s Gesell score was far above the least-squares regression line, 
but this child’s age (17 months) is very close to x =14.4% = 14.4 months. 
Thus, removing Child 19 has very little effect on the least-squares 
regression line. The line shifts down slightly from the original regression 
line, but not by much. Child 19 has a bigger influence on the standard 
deviation of the residuals: without Child 19’s big residual, the size of the 
typical residual goes from s = 11.02 to s = 8.63. Likewise, without Child 
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19, the strength of the linear association increases and r2 goes from 0.410 
to 0.572. 


Think About It 


WHAT SHOULD WE DO WITH OUTLIERS? The strong influence 
of Child 18 makes the original regression of Gesell score on age at first 
word misleading. The original data have r2 = 0.41. That is, the least- 
squares line with x = age at which a child begins to talk accounts for 
41% of the variability in Gesell score. This relationship is strong 
enough to be interesting to parents. If we leave out Child 18, r2 drops to 
only 11%. The apparent strength of the association was largely due to a 
single influential observation. 

What should the child development researcher do? She must decide 
whether Child 18 is so slow to speak that this individual should not be 
allowed to influence the analysis. If she excludes Child 18, much of the 
evidence for a connection between the age at which a child begins to 
talk and later ability score vanishes. If she keeps Child 18, she needs 
data on other children who were also slow to begin talking, so the 
analysis no longer depends as heavily on just one child. 


EXAMPLE | Dodging the pattern? (>) 


Outliers and influential points 


Robert J Daveant/Shutterstock.com 


PROBLEM: The scatterplot shows the payroll (in millions of dollars) 
and number of wins for Major League Baseball teams in 2016, along 
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with the least-squares regression line. The points highlighted in red 
represent the Los Angeles Dodgers (far right) and the Cleveland Indians 
(upper left). 


a. Describe what influence the point representing the Los Angeles 
Dodgers has on the equation of the least-squares regression line. 
Explain your reasoning. 

b. Describe what influence the point representing the Cleveland Indians 
has on the standard deviation of the residuals and r2. Explain your 


reasoning. 
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SOLUTION: 


a. Because the point for the Los Angeles Dodgers is on the right and 
below the least-squares regression line, it is making the slope of the 
line closer to 0 and the y intercept greater. If the Dodgers’ point was 
removed, the line would be steeper. 


b. Because the point for the Cleveland Indians has a large residual, it is 
making the standard deviation of the residuals greater and the value 
of r2 smaller. 


FOR PRACTICE, TRY EXERCISE 67 
ASSOCIATION DOES NOT IMPLY CAUSATION When we study 
the relationship between two variables, we often hope to show that 


changes in the explanatory variable cause changes in the response 
variable. oOo A strong association between two variables is not enough 
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to draw conclusions about cause and effect. Sometimes an observed 
association really does reflect cause and effect. A household that heats 
with natural gas uses more gas in colder months because cold weather 
requires burning more gas to stay warm. In other cases, an association is 
explained by other variables, and the conclusion that x causes y is not 
valid. 

A study once found that people with two cars live longer than people 
who own only one car.2° Owning three cars is even better, and so on. 
There is a substantial positive association between number of cars x and 
length of life y. Can we lengthen our lives by buying more cars? No. The 
study used number of cars as a quick indicator of wealth. Well-off people 
tend to have more cars. They also tend to live longer, probably because 
they are better educated, take better care of themselves, and get better 
medical care. The cars have nothing to do with it. There is no cause-and- 
effect link between number of cars and length of life. 


Remember: It only makes sense to talk about the correlation between two quantitative 


variables. If one or both variables are categorical, you should refer to the association 
between the two variables. To be safe, use the more general term association when 


describing the relationship between any two variables. 


Section 3.2 Summary 


e A regression line is a line that describes how a response variable y 
changes as an explanatory variable x changes. You can use a regression 
line to predict the value of y for any value of x by substituting this x 
value into the equation of the line. 

e The slope b, of a regression line y\=b0+b1x¥ = 0 + 61% describes how 
the predicted value of y changes for each increase of 1 unit of x. 

e The y intercept bo of a regression line y\=b0+b1x¥ = 40 + 612 js the 
predicted value of y when the explanatory variable x equals 0. This 
prediction is of no statistical use unless x can actually take values near 0. 


e Avoid extrapolation, using a regression line to make predictions using 
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values of the explanatory variable outside the values of the data from 
which the line was calculated. 


The most common method of fitting a line to a scatterplot is least 
squares. The least-squares regression line is the line that minimizes the 
sum of the squares of the vertical distances of the observed points from 
the line. 


You can examine the fit of a regression line by studying the residuals, 
which are the differences between the actual values of y and predicted 
values of y. Be on the lookout for curved patterns in the residual plot, 
which indicate that a linear model may not be appropriate. 


The standard deviation of the residuals s measures the typical size of 
a residual when using the regression line. 


The coefficient of determination r2 is the percent of the variation in the 
response variable that is accounted for by the least-squares regression 
line using a particular explanatory variable. 


The least-squares regression line of y on x is the line with slope 
Sy 
bh =r— » _ 
b1=rsysx 8x and intercept bO=y —b1x % = ¥ ~ 517, This line 
always passes through the point (xy )(@ y), 


Correlation and regression must be used with caution. Plot the data to be 
sure that the relationship is roughly linear and to detect outliers. Also 
look for influential observations, individual points that substantially 
change the correlation or the regression line. Outliers in x are often 
influential. 


Most of all, be careful not to conclude that there is a cause-and-effect 
relationship between two variables just because they are strongly 
associated. 


3.2 Technology Corners (> 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
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9. Calculating least-squares regression lines Page 184 
10. Making residual plots Page 187 


Section 3.2 Exercises 


37. pg 178 |) Predicting wins Earlier we investigated the relationship 
between x = payroll (in millions of dollars) and y = number of wins 
for Major League Baseball teams in 2016. Here is a scatterplot of the 
data, along with the regression line y\=60.7+0.139x 
y = 60.7 + 0.1392. 
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a. Predict the number of wins for a team that spends $200 million on 
payroll. 


b. Predict the number of wins for a team that spends $400 million on 
payroll. 


c. How confident are you in each of these predictions? Explain your 
reasoning. 


38. How much gas? Joan is concerned about the amount of energy she 
uses to heat her home. The scatterplot shows the relationship between 
X = mean temperature in a particular month and y = mean amount of 
natural gas used per day (in cubic feet) in that month, along with the 
regression line yA=1425-19.87x¥ = 1425 — 19.872, 


a. Predict the mean amount of natural gas Joan will use per day ina 
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39. 


40. 


41. 


42. 


43. 


month with a mean temperature of 30°F. 


900 


600 


Gas consumed (cubic ft) 


30 35 40 45 50 55 60 
Temperature (°F) 


b. Predict the mean amount of natural gas Joan will use per day ina 
month with a mean temperature of 65°F. 


c. How confident are you in each of these predictions? Explain your 
reasoning. 


pg 180 |») Residual wins Refer to Exercise 37. The Chicago Cubs 


won the World Series in 2016. They had 103 wins and spent $182 
million on payroll. Calculate and interpret the residual for the Cubs. 


Residual gas Refer to Exercise 38. During March, the average 
temperature was 46.4°F and Joan used an average of 490 cubic feet of 
gas per day. Calculate and interpret the residual for this month. 


pg 182 |») More wins? Refer to Exercise 37. 


a. Interpret the slope of the regression line. 


b. Does the value of the y intercept have meaning in this context? If 
so, interpret the y intercept. If not, explain why. 


Less gas? Refer to Exercise 38. 
a. Interpret the slope of the regression line. 


b. Does the value of the y intercept have meaning in this context? If 
so, interpret the y intercept. If not, explain why. 


Long strides The scatterplot shows the relationship between x = 
height of a student (in inches) and y = number of steps required to 
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walk the length of a school hallway, along with the regression line 
yA=113.6-0.921xY = 113.6 — 0.9212, 
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a. Calculate and interpret the residual for Kiana, who is 67 inches tall 
and took 49 steps to walk the hallway. 


b. Matthew is 10 inches taller than Samantha. About how many fewer 
steps do you expect Matthew to take compared to Samantha? 


44. Crickets chirping The scatterplot shows the relationship between x = 
temperature in degrees Fahrenheit and y = chirps per minute for the 
striped ground cricket, along with the regression line yA= 
~0.31+0.212x¥ = —0.31 + 0.2122 27 


20 + ° 


Chirps per minute 


Temperature (°F) 


a. Calculate and interpret the residual for the cricket who chirped 20 
times per minute when the temperature was 88.6°F. 


b. About how many additional chirps per minute do you expect a 
cricket to make if the temperature increases by 10°F? 
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45. More Olympic athletes In Exercises 5 and 11, you described the 
relationship between height (in inches) and weight (in pounds) for 
Olympic track and field athletes. The scatterplot shows this 
relationship, along with two regression lines. The regression line for 
the shotput, hammer throw, and discus throw athletes (blue squares) 
is yA=-115+5.13xY = —115 + 5.132, The regression line for the 
remaining athletes (black dots) is y\=-297+6.41xY = —297 + 6.412, 
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a. How do the regression lines compare? 


b. How much more do you expect a 72-inch discus thrower to weigh 
than a 72-inch sprinter? 


46. More Starbucks In Exercises 6 and 12, you described the 
relationship between fat (in grams) and the number of calories in 
products sold at Starbucks. The scatterplot shows this relationship, 
along with two regression lines. The regression line for the food 
products (blue squares) is y\=170+11.8xY = 170+ 11.82, The 


regression line for the drink products (black dots) is yA=88+24.5x 
y = 88 + 24.52. 
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Calories 


Fat (g) 


a. How do the regression lines compare? 


b. How many more calories do you expect to find in a food item with 
5 grams of fat compared to a drink item with 5 grams of fat? 


47. pg 186 |») Infant weights in Nahya A study of nutrition in 


developing countries collected data from the Egyptian village of 
Nahya. Researchers recorded the mean weight (in kilograms) for 170 
infants in Nahya each month during their first year of life. A hasty 
user of statistics enters the data into software and computes the least- 
squares line without looking at the scatterplot first. The result is 


weight\=4.88+0.267Weight = 4.88 + 0.267 (age). Use the residual 
plot to determine if this linear model is appropriate. 
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48. Driving speed and fuel consumption Exercise 9 (page 171) gives 
data on the fuel consumption y of a car at various speeds x. Fuel 
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consumption is measured in liters of gasoline per 100 kilometers 
driven, and speed is measured in kilometers per hour. A statistical 
software package gives the least-squares regression line yA=11.058— 
0.01466x¥ = 11.058- 0.014662 Use the residual plot to determine if 
this linear model is appropriate. 


Residual 
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Speed (km/h) 


49. Actual weight Refer to Exercise 47. Use the equation of the least- 


50. 


ol. 


Squares regression line and the residual plot to estimate the actual 
mean weight of the infants when they were 1 month old. 


Actual consumption Refer to Exercise 48. Use the equation of the 
least-squares regression line and the residual plot to estimate the 
actual fuel consumption of the car when driving 20 kilometers per 
hour. 


Movie candy Is there a relationship between the amount of sugar (in 
grams) and the number of calories in movie-theater candy? Here are 
the data from a sample of 12 types of candy: 


Name Sugar (g) Calories 
Butterfinger Minis 45 450 
Junior Mints 107 570 
M&M’S® 62 480 
Milk Duds 44 370 
Peanut M&M’S® 79 790 
Raisinets 60 420 
Reese’s Pieces 61 980 
Skittles 87 450 
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Sour Patch Kids 92 490 


SweeTarts 136 680 
Twizzlers 59 460 
Whoppers 48 350 


a. Sketch a scatterplot of the data using sugar as the explanatory 
variable. 


b. Use technology to calculate the equation of the least-squares 
regression line for predicting the number of calories based on the 
amount of sugar. Add the line to the scatterplot from part (a). 


c. Explain why the line calculated in part (b) is called the “least- 
squares” regression line. 


52. Long jumps Here are the 40-yard-dash times (in seconds) and long- 
jump distances (in inches) for a small class of 12 students: 


Dash time (sec) 5.41 5.05 7.01 7.17 6.73 5.68 
Long-jump distance (in.) 171 =184 90 65 78 =130 
Dash time (sec) 5.78 6.31 644 650 680 7.25 
Long-jump distance (in.) 173. 143 92 139 120 = 110 


a. Sketch a scatterplot of the data using dash time as the explanatory 
variable. 


b. Use technology to calculate the equation of the least-squares 
regression line for predicting the long-jump distance based on the 
dash time. Add the line to the scatterplot from part (a). 


c. Explain why the line calculated in part (b) is called the “least- 
squares” regression line. 


53. More candy Refer to Exercise 51. Use technology to create a residual 
plot. Sketch the residual plot and explain what information it 
provides. 


54. More long jumps Refer to Exercise 52. Use technology to create a 
residual plot. Sketch the residual plot and explain what information it 
provides. 
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56. 


o7. 


pg 191 |») Longer strides In Exercise 43, we summarized the 
relationship between x = height of a student (in inches) and y = 
number of steps required to walk the length of a school hallway, with 
the regression line y\=113.6—-0.921x¥ = 113.6 — 0.9212 For this 
model, technology gives s = 3.50 and r2 = 0.399. 


a. Interpret the value of s. 


b. Interpret the value of r2. 


Crickets keep chirping In Exercise 44, we summarized the 
relationship between x = temperature in degrees Fahrenheit and y = 
chirps per minute for the striped ground cricket, with the regression 
line yA=-0.31+0.212x¥ =~ 0.31 + 0.2122, Foy this model, technology 
gives s = 0.97 and r2 = 0.697. 


a. Interpret the value of s. 


b. Interpret the value of r2. 


Olympic figure skating For many people, the women’s figure 
skating competition is the highlight of the Olympic Winter Games. 
Scores in the short program x and scores in the free skate y were 
recorded for each of the 24 skaters who competed in both rounds 
during the 2010 Winter Olympics in Vancouver, Canada.22 Here is a 
scatterplot with least-squares regression line y\=—16.2+2.07x 

y = —16.2 + 2.072. For this model, s = 10.2 and r2 = 0.736. 


Free skate score 


a a Fa a i 
45 50 55 60 65 70 75 80 
Short program score 


a. Calculate and interpret the residual for the 2010 gold medal winner 
Yu-Na Kim, who scored 78.50 in the short program and 150.06 in 
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the free skate. 


b. Interpret the slope of the least-squares regression line. 


c. Interpret the value of s. 


d. Interpret the value of r2. 


58. Age and height A random sample of 195 students was selected from 


59. 


the United Kingdom using the Census At School data selector. The 
age x (in years) and height y (in centimeters) were recorded for each 
student. Here is a scatterplot with the least-squares regression line 
y\=106.1+4.21x¥ = 106.1 + 4.212 For this model, s = 8.61 and r2 = 
0.274. 
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a. Calculate and interpret the residual for the student who was 141 
cm tall at age 10. 


b. Interpret the slope of the least-squares regression line. 


c. Interpret the value of s. 
d. Interpret the value of r2. 


pg 192 |») More mess? When Mentos are dropped into a newly 


opened bottle of Diet Coke, carbon dioxide is released from the Diet 
Coke very rapidly, causing the Diet Coke to be expelled from the 
bottle. To see if using more Mentos causes more Diet Coke to be 
expelled, Brittany and Allie used twenty-four 2-cup bottles of Diet 
Coke and randomly assigned each bottle to receive either 2, 3, 4, or 5 
Mentos. After waiting for the fizzing to stop, they measured the 
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amount expelled (in cups) by subtracting the amount remaining from 
the original amount in the bottle.22 Here is some computer output 
from a regression of y = amount expelled on x = number of Mentos: 


1.45 4 : 
Z 140-4 ; 
g 1.35 4 
a] a % © 
2 130-4 
Es 1.25 4 ’ . ° . 
= 1.20 4 ‘ 
. a 
= * ® 
@ 110-4 
1.05 4 : 
T T T T 
2 3 4 5 
Number of Mentos 
0.10 4 . ; 
= 0.05 4 
a 
& ; ° 
= 0.00- 
= om 
3 -0.05 4 ‘ 
-0.10 5 ; ° 
T T T T 
2 3 4 5 
Number of Mentos 
Term Coef SE Coef T-Value P-Value 
Constant 1.0021 0.0451 2271 0.000 
Mentos 0.0708 0.0123 aT 0.000 
S = 0.06724 R-Sq = 60.21% R-Sq(adj) = 58.40% 


a. Is a line an appropriate model to use for these data? Explain how 
you know. 


b. Find the correlation. 


c. What is the equation of the least-squares regression line? Define 
any variables that you use. 


d. Interpret the values of s and r2. 
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60. Less mess? Kerry and Danielle wanted to investigate whether tapping 
on a can of soda would reduce the amount of soda expelled after the 
can has been shaken. For their experiment, they vigorously shook 40 
cans of soda and randomly assigned each can to be tapped for 0 
seconds, 4 seconds, 8 seconds, or 12 seconds. After waiting for the 
fizzing to stop, they measured the amount expelled (in milliliters) by 
subtracting the amount remaining from the original amount in the 
can.22 Here is some computer output from a regression of y = amount 
expelled on x = tapping time: 


1104 © 
a, o 
Zio 4 ° i 
3 b 
= 
a ~ e 
i] £ 
- 3 
3 80+ 2 . 
5 i 
70 + 3 
° 
T T T T T T T 
0 2 4 6 8 10 12 


Tapping time (sec) 


15 * 


z 

r- 

s 

S - 

0 2 4 6 8 10 12 
Tapping time (sec) 

Term Coef SE Coef T-Value P-Value 
Constant 106.360 1.320 80.34 0.000 
Tapping time —2.635 O77 —14.90 0.000 
S 55.00347 R-Sq 585.38% R-Sq(adj) 584.99% 


a. Is aline an appropriate model to use for these data? Explain how 
you know. 


b. Find the correlation. 


c. What is the equation of the least-squares regression line? Define 
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any variables that you use. 


d. Interpret the values of s and r2. 


61. Temperature and wind The average temperature (in degrees 
Fahrenheit) and average wind speed (in miles per hour) were recorded 
for 365 consecutive days at Chicago’s O’Hare International Airport. 
Here is computer output for a regression of y = average wind speed 
on x = average temperature: 


Summary of Fit 


RSquare 0.047874 

RSquare Adj 0.045251 

Root Mean Square Error 3.655950 

Mean of Response 9.826027 

Observations (or Sum 365 

Wégts) 
Parameter Estimates 
Term Estimate Std Error t Ratio Prob> |t| 
Intercept 11.897762 0.521320 22.82 <.0001* 
Avg temp —0.041077 0.009615 4,27 <.0001* 


a. Calculate and interpret the residual for the day where the average 
temperature was 42°F and the average wind speed was 2.2 mph. 


b. Interpret the slope. 


c. By about how much do the actual average wind speeds typically 
vary from the values predicted by the least-squares regression line 
with x = average temperature? 


d. What percent of the variability in average wind speed is accounted 
for by the least-squares regression line with x = average 
temperature? 


62. Beetles and beavers Do beavers benefit beetles? Researchers laid out 
23 circular plots, each 4 meters in diameter, in an area where beavers 
were cutting down cottonwood trees. In each plot, they counted the 
number of stumps from trees cut by beavers and the number of 
clusters of beetle larvae. Ecologists believe that the new sprouts from 
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stumps are more tender than other cottonwood growth, so beetles 


prefer them. If so, more stumps should produce more beetle larvae.24 
Here is computer output for a regression of y = number of beetle 
larvae on x = number of stumps: 

Summary of Fit 

RSquare 0.839144 

RSquare Adj 0.831484 

Root Mean Square Error 6.419386 

Mean of Response 25.086960 

Observations (or Sum 23 

Wegts) 
Parameter Estimates 
Term Estimate StdError tRatio Prob>|t| 
Intercept —1.286104 2.853182 —0.45 0.6568 
Number of stumps 11.893733 1.136343 10.47 <.0001* 
a. Calculate and interpret the residual for the plot that had 2 stumps 

and 30 beetle larvae. 

b. Interpret the slope. 
c. By about how much do the actual number of larvae typically vary 


from the values predicted by the least-squares regression line with 
xX = number of stumps? 


What percent of the variability in number of larvae is accounted 
for by the least-squares regression line with x = number of stumps? 


pg 195 |») Husbands and wives The mean height of married 


American women in their early 20s is 64.5 inches and the standard 
deviation is 2.5 inches. The mean height of married men the same age 
is 68.5 inches with standard deviation 2.7 inches. The correlation 
between the heights of husbands and wives is about r = 0.5. 


a. 


Find the equation of the least-squares regression line for predicting 
a husband’s height from his wife’s height for married couples in 
their early 20s. 


. Suppose that the height of a randomly selected wife was 1 standard 
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64. 


65. 


66. 


deviation below average. Predict the height of her husband without 
using the least-squares line. 


The stock market Some people think that the behavior of the stock 
market in January predicts its behavior for the rest of the year. Take 
the explanatory variable x to be the percent change in a stock market 
index in January and the response variable y to be the change in the 
index for the entire year. We expect a positive correlation between x 
and y because the change during January contributes to the full year’s 
change. Calculation from data for an 18-year period gives 


x =1.75% sx=5.36% y =9.07% sy=15.35% r=0.596 
== 1.75% s, = 5.36% y = 9.07% s, = 15.35% r = 0.596 


a. Find the equation of the least-squares line for predicting full-year 
change from January change. 


b. Suppose that the percent change in a particular January was 2 


standard deviations above average. Predict the percent change for 
the entire year without using the least-squares line. 


Will I bomb the final? We expect that students who do well on the 
midterm exam in a course will usually also do well on the final exam. 
Gary Smith of Pomona College looked at the exam scores of all 346 
students who took his statistics class over a 10-year period.22 Assume 
that both the midterm and final exam were scored out of 100 points. 


a. State the equation of the least-squares regression line if each 
student scored the same on the midterm and the final. 


b. The actual least-squares line for predicting final-exam score y from 


midterm-exam score X was y\=46.6+0.41xY = 46.6 + 0.412, 
Predict the score of a student who scored 50 on the midterm and a 
student who scored 100 on the midterm. 


c. Explain how your answers to part (b) illustrate regression to the 
mean. 


It’s still early We expect that a baseball player who has a high 
batting average in the first month of the season will also have a high 
batting average the rest of the season. Using 66 Major League 
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Baseball players from a recent season," a least-squares regression 
line was calculated to predict rest-of-season batting average y from 
first-month batting average x. Note: A player’s batting average is the 
proportion of times at bat that he gets a hit. A batting average over 
0.300 is considered very good in Major League Baseball. 


a. 


State the equation of the least-squares regression line if each player 
had the same batting average the rest of the season as he did in the 
first month of the season. 


. The actual equation of the least-squares regression line is 


y\=0.245+0.109xY = 0.245 + 0.1092 | Predict the rest-of-season 
batting average for a player who had a 0.200 batting average the 
first month of the season and for a player who had a 0.400 batting 
average the first month of the season. 


Explain how your answers to part (b) illustrate regression to the 


medn. 


pg 200 >» Who’s got hops? Haley, Jeff, and Nathan measured the 


height (in inches) and vertical jump (in inches) of 74 students at their 


school.*4 Here is a scatterplot of the data, along with the least-squares 
regression line. Jacob (highlighted in red) had a vertical jump of 
nearly 3 feet! 


Vertical jump (in.) 
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a. Describe the influence that Jacob’s point has on the equation of the 
least-squares regression line. 


b. Describe the influence that Jacob’s point has on the standard 
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deviation of the residuals and r. 


68. Stand mixers The scatterplot shows the weight (in pounds) and cost 
(in dollars) of 11 stand mixers.22 The mixer from Walmart 
(highlighted in red) was much lighter—and cheaper—than the other 
mixers. 


500 - 


Weight (Ib) 


a. Describe what influence the highlighted point has on the equation 
of the least-squares regression line. 


b. Describe what influence the highlighted point has on the standard 
deviation of the residuals and r2. 


69. Managing diabetes People with diabetes measure their fasting 
plasma glucose (FPG, measured in milligrams per milliliter) after 
fasting for at least 8 hours. Another measurement, made at regular 
medical checkups, is called HbA. This is roughly the percent of red 
blood cells that have a glucose molecule attached. It measures 
average exposure to glucose over a period of several months. The 
table gives data on both HbA and FPG for 18 diabetics five months 


after they had completed a diabetes education class.2© 
Subject HbA (%) FPG (mg/ml) 
1 6.1 141 
2 6.3 158 
3 6.4 112 
4 6.8 153 
fs) 7.0 134 
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6 7.1 95 
7 es. 96 
8 Td 78 
2 vps, 148 
10 oy 172 
11 9.4 200 
12 10.4 271 
13 10.6 103 
14 10.7 172 
15 10.7 359 
16 11.2 145 
17 13.7 147 
18 19.3 255 


a. Make a scatterplot with HbA as the explanatory variable. Describe 
what you see. 


b. Subject 18 is an outlier in the x direction. What effect do you think 
this subject has on the correlation? What effect do you think this 
subject has on the equation of the least-squares regression line? 
Calculate the correlation and equation of the least-squares 
regression line with and without this subject to confirm your 
answer. 


c. Subject 15 is an outlier in the y direction. What effect do you think 
this subject has on the correlation? What effect do you think this 
subject has on the equation of the least-squares regression line? 
Calculate the correlation and equation of the least-squares 
regression line with and without this subject to confirm your 
answer. 


70. Rushing for points What is the relationship between rushing yards 
and points scored in the National Football League? The table gives 
the number of rushing yards and the number of points scored for each 
of the 16 games played by the Jacksonville Jaguars in a recent 


season.2Z 


Game Rushing yards Points scored 
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1 163 16 
2 112 3 
a 128 10 
4 104 10 
fs) 96 20 
6 133 13 
7 132 12 
8 84 14 
a 141 17 
10 108 10 
11 105 13 
12 129 14 
13 116 41 
14 116 14 
15 113 17 
16 190 19 


a. Make a scatterplot with rushing yards as the explanatory variable. 
Describe what you see. 


b. The number of rushing yards in Game 16 is an outlier in the x 
direction. What effect do you think this game has on the 
correlation? On the equation of the least-squares regression line? 
Calculate the correlation and equation of the least-squares 
regression line with and without this game to confirm your 
answers. 


c. The number of points scored in Game 13 is an outlier in the y 
direction. What effect do you think this game has on the 
correlation? On the equation of the least-squares regression line? 
Calculate the correlation and equation of the least-squares 
regression line with and without this game to confirm your 
answers. 


Multiple Choice Select the best answer for Exercises 71—78. 


71. Which of the following is not a characteristic of the least-squares 
regression line? 
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72s 


7s 


a. The slope of the least-squares regression line is always between —1 
and 1. 


b. The least-squares regression line always goes through the point 
(xy )® y) 5 


c. The least-squares regression line minimizes the sum of squared 
residuals. 


d. The slope of the least-squares regression line will always have the 
same sign as the correlation. 


e. The least-squares regression line is not resistant to outliers. 


Each year, students in an elementary school take a standardized math 
test at the end of the school year. For a class of fourth-graders, the 
average score was 55.1 with a standard deviation of 12.3. In the third 
grade, these same students had an average score of 61.7 witha 
standard deviation of 14.0. The correlation between the two sets of 
scores is r = 0.95. Calculate the equation of the least-squares 
regression line for predicting a fourth-grade score from a third-grade 
score. 


a. yA=3.58+0.835x9 = 3.58 + 0.835a 

b. y\=15.69+0.835xY = 15.69 + 0.8352 
C. yA=2.19+1.08xY = 2.19+ 1.082 

d. yA=-11.5441.08xY = —11.54+ 1.082 
e. Cannot be calculated without the data. 


Using data from the LPGA tour, a regression analysis was performed 
using x = average driving distance and y = scoring average. Using the 
output from the regression analysis shown below, determine the 
equation of the least-squares regression line. 


Predictor Coef SE Coef T P 
Constant 87.974000 2.391000 36.78 0.000 
Driving Distance —0.060934 0.009536 —6.39 0.000 
S = 1.01216 R-Sq = 22.1% R-Sq(adj) = 21.6% 
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74. 


75. 


d. 


e. 


y\=87.97442.391xY = 87.974 + 2.3912 
y\=87.97441.01216xY = 87.974 + 1.012162 
y\=87.974—-0.060934xY = 87.974 — 0.060934z 
y\=-0.060934+1.01216xY = —0.060934 + 1.012162 


y\=-0,060934+87.947x9 = —0.060934 + 87.947x 


Exercises 74 to 78 refer to the following setting. Measurements on 
young children in Mumbai, India, found this least-squares line for 


predicting y = height (in cm) from x = arm span (in cm):28 


y\=6.4+0.93xY = 6.4 + 0.932 


By looking at the equation of the least-squares regression line, you 
can see that the correlation between height and arm span is 


a. 
b. 
c. 
d. 


e. 


greater than zero. 
less than zero. 
0.93. 

6.4. 


Can’t tell without seeing the data. 


In addition to the regression line, the report on the Mumbai 


measurements says that r2 = 0.95. This suggests that 


a. 


although arm span and height are correlated, arm span does not 
predict height very accurately. 


height increases by 0.95=0.97 cmv 9-95 = 0.97 cm for each 
additional centimeter of arm span. 


95% of the relationship between height and arm span is accounted 
for by the regression line. 


95% of the variation in height is accounted for by the regression 
line with x = arm span. 


95% of the height measurements are accounted for by the 
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regression line with x = arm span. 


76. One child in the Mumbai study had height 59 cm and arm span 60 
cm. This child’s residual is 


a. —3.2 cm. 
b.. =2.2 Gm. 
c. —1.3 cm. 
d. 3.2 cm. 


e. 62.2 cm. 


77. Suppose that a tall child with arm span 120 cm and height 118 cm 
was added to the sample used in this study. What effect will this 
addition have on the correlation and the slope of the least-squares 
regression line? 


a. Correlation will increase, slope will increase. 

b. Correlation will increase, slope will stay the same. 

c. Correlation will increase, slope will decrease. 

d. Correlation will stay the same, slope will stay the same. 


e. Correlation will stay the same, slope will increase. 


78. Suppose that the measurements of arm span and height were 
converted from centimeters to meters by dividing each measurement 
by 100. How will this conversion affect the values of r? and s? 


a. r will increase, s will increase. 

b. r2 will increase, s will stay the same. 

c. r2 will increase, s will decrease. 

d. r2 will stay the same, s will stay the same. 
e. r2 will stay the same, s will decrease. 


Recycle and Review 
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79. Fuel economy (2.2) In its recent Fuel Economy Guide, the 
Environmental Protection Agency (EPA) gives data on 1152 vehicles. 
There are a number of outliers, mainly vehicles with very poor gas 
mileage or hybrids with very good gas mileage. If we ignore the 
outliers, however, the combined city and highway gas mileage of the 
other 1120 or so vehicles is approximately Normal with mean 18.7 
miles per gallon (mpg) and standard deviation 4.3 mpg. 


a. The Chevrolet Malibu with a four-cylinder engine has a combined 
gas mileage of 25 mpg. What percent of the 1120 vehicles have 
worse gas mileage than the Malibu? 


b. How high must a vehicle’s gas mileage be in order to fall in the top 
10% of the 1120 vehicles? 


80. Marijuana and traffic accidents (1.1) Researchers in New Zealand 
interviewed 907 drivers at age 21. They had data on traffic accidents 
and they asked the drivers about marijuana use. Here are data on the 
numbers of accidents caused by these drivers at age 19, broken down 
by marijuana use at the same age:22 


Marijuana use per year 
Never 1210 times 11250 times 511 times 
Number of drivers 452 229 70 156 
Accidents caused 59 36 15 50 


a. Make a graph that displays the accident rate for each category of 
marijuana use. Is there evidence of an association between 
marijuana use and traffic accidents? Justify your answer. 


b. Explain why we can’t conclude that marijuana use causes 
accidents based on this study. 
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— FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


Two Statistics students went to a flower shop and randomly 
selected 12 carnations. When they got home, the students prepared 
12 identical vases with exactly the same amount of water in each 
vase. They put one tablespoon of sugar in 3 vases, two tablespoons of 
sugar in 3 vases, and three tablespoons of sugar in 3 vases. In the 
remaining 3 vases, they put no sugar. After the vases were prepared, 
the students randomly assigned 1 carnation to each vase and 
observed how many hours each flower continued to look fresh. A 
scatterplot of the data is shown below. 
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Freshness (h) 


Amount of sugar (tbsp) 


a. Briefly describe the association shown in the scatterplot. 


b. The equation of the least-squares regression line for these data is 
y\=180.8+15.8xi = 180.8 4 15.8cy = 180.8 + 15.82. Interpret the 


slope of the line in the context of the study. 


c. Calculate and interpret the residual for the flower that had 2 
tablespoons of sugar and looked fresh for 204 hours. 


d. Suppose that another group of students conducted a similar 
experiment using 12 flowers, but included different varieties in 
addition to carnations. Would you expect the value of r¢ for the 
second group’s data to be greater than, less than, or about the same 
as the value of r? for the first group’s data? Explain. 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 


SCOre. 
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Chapter 3 Review 


Section 3.1: Scatterplots and Correlation 


In this section, you learned how to explore the relationship between two 
quantitative variables. As with distributions of a single variable, the first 
step is always to make a graph. A scatterplot is the appropriate type of 
graph to investigate relationships between two quantitative variables. To 
describe a scatterplot, be sure to discuss four characteristics: direction, 
form, strength, and unusual features. The direction of a relationship might 
be positive, negative, or neither. The form of a relationship can be linear or 
nonlinear. A relationship is strong if it closely follows a specific form. 
Finally, unusual features include outliers that clearly fall outside the 
pattern of the rest of the data and distinct clusters of points. 


The correlation r is a numerical summary for linear relationships that 
describes the direction and strength of the association. When r > 0, the 
assoCiation is positive, and when r < 0, the association is negative. The 
correlation will always take values between —1 and 1, with r = —-1 andr = 
1 indicating a perfectly linear relationship. Strong linear relationships have 
correlations near 1 or —1, while weak linear relationships have correlations 
near 0. It isn’t possible to determine the form of a relationship from only 
the correlation. Strong nonlinear relationships can have a correlation close 
to 1 ora correlation close to 0. You also learned that outliers can greatly 
affect the value of the correlation and that correlation does not imply 
causation. That is, we can’t assume that changes in one variable cause 
changes in the other variable, just because they have a correlation close to 
1or-1. 


Section 3.2: Least-Squares Regression 


In this section, you learned how to use least-squares regression lines as 
models for relationships between two quantitative variables that have a 
linear association. It is important to understand the difference between the 
actual data and the model used to describe the data. To emphasize that the 
model only provides predicted values, least-squares regression lines are 
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always expressed in terms of y#¥ instead of y. Likewise, when you are 
interpreting the slope of a least-squares regression line, describe the 
change in the predicted value of y. 


The difference between the actual value of y and the predicted value of 
y is called a residual. Residuals are the key to understanding almost 
everything in this section. To find the equation of the least-squares 
regression line, find the line that minimizes the sum of the squared 
residuals. To see if a linear model is appropriate, make a residual plot. If 
there is no leftover curved pattern in the residual plot, you know the model 
is appropriate. To assess how well a line fits the data, calculate the 
standard deviation of the residuals s to estimate the size of a typical 
prediction error. You can also calculate r2, which measures the percent of 
the variation in the y variable that is accounted for by the least-squares 
regression line. 

You also learned how to obtain the equation of a least-squares 
regression line from computer output and from summary statistics (the 
means and standard deviations of two variables and their correlation). As 
with the correlation, the equation of the least-squares regression line and 
the values of s and r2 can be greatly influenced by outliers, so be sure to 
plot the data and note any unusual features before making any calculations. 


What Did You Learn? 


Related Relevant 
Example Chapter 


on Review 
Learning Target Section Page(s) Exercise(s) 
Distinguish between explanatory and 3.1 154 R3.4 
response variables for quantitative data. 
Make a scatterplot to display the 3.1 155 R3.4 
relationship between two quantitative 
variables. 
Describe the direction, form, and strength cm 158 R31, 
of a relationship displayed in a scatterplot B32 


and identify unusual features. 
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Interpret the correlation. 


Understand the basic properties of 
correlation, including how the correlation 
is influenced by outliers. 


Distinguish correlation from causation. 


Make predictions using regression lines, 
keeping in mind the dangers of 
extrapolation. 


Calculate and interpret a residual. 


Interpret the slope and y intercept of a 
regression line. 


Determine the equation of a least-squares 
regression line using technology or 
computer output. 


Construct and interpret residual plots to 
assess whether a regression model is 
appropriate. 

Interpret the standard deviation of the 
residuals and ,2 and use these values to 
assess how well a least-squares regression 


line models the relationship between two 
variables. 


Describe how the least-squares regression 
line, standard deviation of the residuals, 
and 2 are influenced by outliers. 


Find the slope and y intercept of the least- 


squares regression line from the means and 


standard deviations of x and y and their 
correlation. 
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3.1 
3.1 


3.1 
3.2 


3.2 


3.2 


3.2 


3.2 


3.2 


3.2 


ones 


162 
165, 169 


165 
178 


180 


182 


192 


186 


191 


200 


195 


Chapter 3 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R3.1 |») Born to be old? Is there a relationship between the gestational 
period (time from conception to birth) of an animal and its average 
life span? The figure shows a scatterplot of the gestational period 
and average life span for 43 species of animals.42 
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Gestation (days) 
a. Describe the relationship shown in the scatterplot. 


b. Point A is the hippopotamus. What effect does this point have on 
the correlation, the equation of the least-squares regression line, 
and the standard deviation of the residuals? 


c. Point B is the Asian elephant. What effect does this point have on 
the correlation, the equation of the least-squares regression line, 
and the standard deviation of the residuals? 


R3.2 |) Penguins diving A study of king penguins looked for a 


relationship between how deep the penguins dive to seek food and 
how long they stay under water.*! For all but the shallowest dives, 
there is an association between x = depth (in meters) and y = dive 
duration (in minutes) that is different for each penguin. The study 
gives a scatterplot for one penguin titled “The Relation of Dive 
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Duration (y) to Depth (x).” The scatterplot shows an association that 
is positive, linear, and strong. 


a. Explain the meaning of the term positive association in this 
context. 


b. Explain the meaning of the term linear association in this 
context. 


c. Explain the meaning of the term strong association in this 
context. 


d. Suppose the researchers reversed the variables, using x = dive 
duration and y = depth. Would this change the correlation? The 
equation of the least-squares regression line? 


R3.3 |p) Stats teachers’ cars A random sample of AP® Statistics 


teachers was asked to report the age (in years) and mileage of their 
primary vehicles. Here are a scatterplot, a residual plot, and other 
computer output: 
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Age (years) 
Predictor Coef SE Coef c P 
Constant 3704 8268 0.45 0.662 
Age 12188 1492 8.17 0.000 
S = 20870.5 R-Sq = 83.7% R-Sq(adj) = 82.4% 


a. Is a linear model appropriate for these data? Explain how you 
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e. 


know this. 


What’s the correlation between car age and mileage? Interpret 
this value in context. 


Give the equation of the least-squares regression line for these 
data. Identify any variables you use. 


One teacher reported that her 6-year-old car had 65,000 miles on 
it. Find and interpret its residual. 


Interpret the values of s and r2. 


R3.4 |») Late bloomers? Japanese cherry trees tend to blossom early 


when spring weather is warm and later when spring weather is cool. 
Here are some data on the average March temperature (in degrees 
Celsius) and the day in April when the first cherry blossom appeared 


over a 24-year period:44 

Temperature (°C) 4.0 5.4 3.2 2.6 4.2 4.7 4.9 4.0 4.9 3.8 4.0 5.1 
Days in April to first 14 8 11 19 14 14 14 21 9 14 13 11 
blossom 

Temperature (°C) 4.3 1.5 3.7 3.8 4.5 4.1 6.1 6.2 5.1 5.0 4.6 4.0 
Days in April to first 13 28 17 19 10 17 3 3 11 6 9 11 
blossom 

a. Make a well-labeled scatterplot that’s suitable for predicting 


when the cherry trees will blossom from the temperature. Which 
variable did you choose as the explanatory variable? Explain 
your reasoning. 


Use technology to calculate the correlation and the equation of 
the least-squares regression line. Interpret the slope and y 
intercept of the line in this setting. 


Suppose that the average March temperature this year was 8.2°C. 
Would you be willing to use the equation in part (b) to predict the 
date of first blossom? Explain your reasoning. 


Calculate and interpret the residual for the year when the average 
March temperature was 4.5°C. 


Use technology to help construct a residual plot. Describe what 
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you see. 


R3.5 |) What’s my grade? In Professor Friedman’s economics course, 


the correlation between the students’ total scores prior to the final 
examination and their final exam scores is r = 0.6. The pre-exam 
totals for all students in the course have mean 280 and standard 
deviation 30. The final exam scores have mean 75 and standard 
deviation 8. Professor Friedman has lost Julie’s final exam but 
knows that her total before the exam was 300. He decides to predict 
her final exam score from her pre-exam total. 


a. Find the equation for the least-squares regression line Professor 
Friedman should use to make this prediction. 


b. Use the least-squares regression line to predict Julie’s final exam 
score. 


c. Explain the meaning of the phrase “least squares” in the context 
of this question. 


d. Julie doesn’t think this method accurately predicts how well she 
did on the final exam. Determine r2. Use this result to argue that 
her actual score could have been much higher (or much lower) 
than the predicted value. 


R3.6 |) Calculating achievement The principal of a high school read a 


study that reported a high correlation between the number of 
calculators owned by high school students and their math 
achievement. Based on this study, he decides to buy each student at 
his school two calculators, hoping to improve their math 
achievement. Explain the flaw in the principal’s reasoning. 
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Chapter 3 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T3.1 A school guidance counselor examines how many extracurricular 
activities students participate in and their grade point average. The 
guidance counselor says, “The evidence indicates that the 
correlation between the number of extracurricular activities a 
student participates in and his or her grade point average is close to 
0.” Which of the following is the most appropriate conclusion? 


a. Students involved in many extracurricular activities tend to be 
students with poor grades. 


b. Students with good grades tend to be students who are not 
involved in many extracurricular activities. 


c. Students involved in many extracurricular activities are just as 
likely to get good grades as bad grades. 


d. Students with good grades tend to be students who are involved 
in many extracurricular activities. 


e. No conclusion should be made based on the correlation without 
looking at a scatterplot of the data. 


T3.2 An AP® Statistics student designs an experiment to see whether 
today’s high school students are becoming too calculator- 
dependent. She prepares two quizzes, both of which contain 40 
questions that are best done using paper-and-pencil methods. A 
random sample of 30 students participates in the experiment. Each 
student takes both quizzes—one with a calculator and one without 
—in a random order. To analyze the data, the student constructs a 
scatterplot that displays a linear association between the number of 
correct answers with and without a calculator for the 30 students. 
A least-squares regression yields the equation 

calculator’ = —1.2 + 0.865 (pencil) r= 0.79 


calculator = — 1.2 + 0.865 (pencil) r= 0.79 
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calculator = — 1.2 + 0.865 (pencil) r= 0.79 


Which of the following statements is/are true? 


I. If the student had used Calculator as the explanatory variable, 
the correlation would remain the same. 


II. If the student had used Calculator as the explanatory variable, 
the slope of the least-squares line would remain the same. 


III. The standard deviation of the number of correct answers on 
the paper-and-pencil quizzes was smaller than the standard 
deviation on the calculator quizzes. 


I only 


ZT p 


II only 

c. If only 

I and HI only 
e. I, II, and III 


Pp 


Questions T3.3—T3.5 refer to the following setting. Scientists 
examined the activity level of 7 fish at different temperatures. Fish 
activity was rated on a scale of 0 (no activity) to 100 (maximal 
activity). The temperature was measured in degrees Celsius. A 
computer regression printout and a residual plot are provided. 
Notice that the horizontal axis on the residual plot is labeled 
“Fitted value,” which means the same thing as “predicted value.” 
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Temperature -3.2167 0.4533 -7.10 0.001 
S = 4.78505 R-Sq = 91.0% R-Sq(adj) = 89.2% 


T3.3 What is the correlation between temperature and fish activity? 
a. 0.95 
b. 0.91 
c. 0.45 
d. —0.91 
e. —0.95 


T3.4 What was the actual activity level rating for the fish at a 
temperature of 20°C? 


a. 87 
b. 84 
c. 81 
d. 66 
e. 3 


T3.5 Which of the following gives a correct interpretation of s in this 
setting? 


a. For every 1°C increase in temperature, fish activity is predicted 
to increase by 4.785 units. 


b. The typical distance of the temperature readings from their 
mean is about 4.785°C. 


c. The typical distance of the activity level ratings from the least- 
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squares line is about 4.785 units. 


d. The typical distance of the activity level readings from their 


mean is about 4.785 units. 


e. At atemperature of 0°C, this model predicts an activity level of 


4.785 units. 


T3.6 Which of the following statements is not true of the correlation r 
between the lengths (in inches) and weights (in pounds) of a 
sample of brook trout? 


a. r must take a value between —1 and 1. 


b. 


r is measured in inches. 


c. If longer trout tend to also be heavier, then r > 0. 


r would not change if we measured the lengths of the trout in 
centimeters instead of inches. 


r would not change if we measured the weights of the trout in 
kilograms instead of pounds. 


T3.7 When we standardize the values of a variable, the distribution of 
standardized values has mean 0 and standard deviation 1. Suppose 
we measure two variables X and Y on each of several subjects. We 
standardize both variables and then compute the least-squares 
regression line. Suppose the slope of the least-squares regression 
line is 20.44. We may conclude that 


a. the intercept will also be —0.44. 
b. 

C. 
d. 


e. 


the intercept will be 1.0. 
the correlation will be 1/—0.44. 
the correlation will be 1.0. 


the correlation will also be —0.44. 


T3.8 There is a linear relationship between the number of chirps made 
by the striped ground cricket and the air temperature. A least- 
squares fit of some data collected by a biologist gives the model 
yA=25.2+3.3xy = 25.2 4 3.304 = 25.2 4 3.32 where x is the number 
of chirps per minute and yi is the estimated temperature in 


482 


degrees Fahrenheit. What is the predicted increase in temperature 
for an increase of = chirps per minute? 


a. 3.3°F 
b. 16.5°F 
@: 25,2°F 
d. 28.5°F 
e. 41.7°F 


T3.9 The scatterplot shows the relationship between the number of 
people per television set and the number of people per physician 
for 40 countries, along with the least-squares regression line. In 
Ethiopia, there were 503 people per TV and 36,660 people per 
doctor. Which of the following is correct? 
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People per TV 
a. Increasing the number of TVs in a country will attract more 
doctors. 
b. The slope of the least-squares regression line is less than 1. 
c. The correlation is greater than 1. 


d. The point for Ethiopia is decreasing the slope of the least- 
squares regression line. 


e. Ethiopia has more people per doctor than expected, based on 
how many people it has per TV. 


T3.10 The scatterplot shows the lean body mass and metabolic rate for a 
sample of = adults. For each person, the lean body mass is the 
subject’s total weight in kilograms less any weight due to fat. The 
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metabolic rate is the number of calories burned in a 24-hour 
period. 
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Because a person with no lean body mass should burn no calories, 
it makes sense to model the relationship with a direct variation 
function in the form y = kx. Models were tried using different 
values of k (k = 25, k = 26, etc.) and the sum of squared residuals 
(SSR) was calculated for each value of k. Here is a scatterplot 
showing the relationship between SSR and k: 

500 4 
450 5 
400 = 
350 4 


300 = 


SSR (1000s) 


250 5 


200 4 


150 4 


According to the scatterplot, what is the ideal value of k to use for 
predicting metabolic rate? 


a. 24 
b. 25 
e. 29 
d. 31 
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e. 36 


Section IJ: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T3.11 Sarah’s parents are concerned that she seems short for her age. 
Their doctor has kept the following record of Sarah’s height: 


Age (months) 36 48 5Sl 54 57 ~~ 60 
Height (cm) 86 90 91 93 94 ~ 95 


a. Make a scatterplot of these data using age as the explanatory 
variable. Describe what you see. 


b. Using your calculator, find the equation of the least-squares 
regression line. 


c. Calculate and interpret the residual for the point when Sarah 
was 48 months old. 


d. Would you be confident using the equation from part (b) to 
predict Sarah’s height when she is 40 years old? Explain. 


T3.12 Drilling down beneath a lake in Alaska yields chemical evidence of 
past changes in climate. Biological silicon, left by the skeletons of 
single-celled creatures called diatoms, is a measure of the 
abundance of life in the lake. A rather complex variable based on 
the ratio of certain isotopes relative to ocean water gives an indirect 
measure of moisture, mostly from snow. As we drill down, we look 
further into the past. Here is a scatterplot of data from 2300 to 
12,000 years ago: 
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Silicon 


-21.8 —21.4 21.0 —20.6 20.2 -19.8 19.4 


Isotope 


a. Identify the unusual point in the scatterplot and estimate its x 
and y coordinates. 


b. Describe the effect this point has on 
i. the correlation. 
ii. the slope and y intercept of the least-squares line. 
iii. the standard deviation of the residuals. 


T3.13 Long-term records from the Serengeti National Park in Tanzania 
show interesting ecological relationships. When wildebeest are 
more abundant, they graze the grass more heavily, so there are 
fewer fires and more trees grow. Lions feed more successfully 
when there are more trees, so the lion population increases. 
Researchers collected data on one part of this cycle, wildebeest 
abundance (in thousands of animals) and the percent of the grass 
area burned in the same year. The results of a least-squares 


regression on the data are shown here. 
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Is a linear model appropriate for describing the relationship 
between wildebeest abundance and percent of grass area 
burned? Explain. 


Give the equation of the least-squares regression line. Be sure to 
define any variables you use. 


Interpret the slope. Does the value of the y intercept have 
meaning in this context? If so, interpret the y intercept. If not, 
explain why. 


Interpret the standard deviation of the residuals and r2. 
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Chapter 3 Project Investigating 
Relationships in 
Baseball 


What is a better predictor of the number of wins for a baseball team, the 
number of runs scored by the team or the number of runs they allow the 
other team to score? What variables can we use to predict the number of 
runs a team scores? To predict the number of runs it allows the other team 
to score? In this project, you will use technology to help answer these 
questions by exploring a large set of data from Major League Baseball. 


Part 1 


. Download the “MLB Team Data 2012-2016” Excel file from the book’s 
website, along with the “Glossary for MLB Team Data file,” which 
explains each of the variables included in the data set.“4 Import the data 
into the statistical software package you prefer. 


’. Create a scatterplot to investigate the relationship between runs scored 
per game (R/G) and wins (W). Then calculate the equation of the least- 
squares regression line, the standard deviation of the residuals, and r2. 
Note: R/G is in the section for hitting statistics and W is in the section 
for pitching statistics. 


3. Create a scatterplot to investigate the relationship between runs allowed 
per game (RA/G) and wins (W). Then calculate the equation of the 
least-squares regression line, the standard deviation of the residuals, and 
r2. Note: Both of these variables may be found in the section for 
pitching statistics. 

|. Compare the two associations. Is runs scored or runs allowed a better 


predictor of wins? Explain your reasoning. 


». Because the number of wins a team has is dependent on both how many 
runs they score and how many runs they allow, we can use a 
combination of both variables to predict the number of wins. Add a 
column in your data table for a new variable, run differential. Fill in the 
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values using the formula R/G — RA/G. 


)}. Create a scatterplot to investigate the relationship between run 
differential and wins. Then calculate the equation of the least-squares 
regression line, the standard deviation of the residuals, and r2, 


’. Is run differential a better predictor than the variable you chose in 
Question 4? Explain your reasoning. 


Part 2 


It is fairly clear that the number of games a team wins is dependent on 
both runs scored and runs allowed. But what variables help predict runs 
scored? Runs allowed? 


. Choose either runs scored (R) or runs allowed (RA) as the response 
variable you will try to model. 


’. Choose at least three different explanatory variables (or combinations of 
explanatory variables) that might help predict the response variable you 
chose in Question 1. Create a scatterplot using each explanatory 
variable. Then calculate the equation of the least-squares regression line, 
the standard deviation of the residuals, and r2 for each relationship. 


3. Which explanatory variable from Question 2 is the best? Explain your 
reasoning. 
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- Collecting Data 
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Cumulative AP® Practice Test 1 
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Introduction 


You can hardly go a day without hearing the results of a statistical study. 
Here are some examples: 


e The National Highway Traffic Safety Administration (NHTSA) reports 
that seat belt use in passenger vehicles increased from 88.5% in 2015 to 
90.1% in 2016.4 


e According to a survey, U.S. teens aged 13 to 18 use entertainment 
media (television, Internet, social media, listening to music, etc.) nearly 
9 hours a day, on average.2 


e A study suggests that lack of sleep increases the risk of catching a cold.2 


e For their final project, two AP® Statistics students showed that listening 
to music while studying decreased subjects’ performance on a memory 
task.4 


Can we trust these results? As you’ll learn in this chapter, the answer 
depends on how the data were produced. Let’s take a closer look at where 
the data came from in each of these studies. 


Each year, the NHTSA conducts an observational study of seat belt use 
in vehicles. The NHTSA sends trained observers to record the behavior of 
people in vehicles at randomly selected locations across the country. The 
idea of an observational study is simple: you can learn a lot just by 
watching or by asking a few questions, as in the survey of teens’ media 
habits. Common Sense Media conducted this survey using a random 
sample of 1399 U.S. 13- to 18-year-olds. Both of these studies use 
information from a sample to draw conclusions about some larger 
population. Section 4.1 examines the issues involved in sampling and 
surveys. 

In the sleep and catching a cold study, 153 volunteers answered 
questions about their sleep habits over a two-week period. Then 
researchers gave them a virus and waited to see who developed a cold. 
This was a complicated observational study. Compare this with the 
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experiment performed by the AP® Statistics students. They recruited 30 
students and divided them into two groups of 15 by drawing names from a 
hat. Students in one group tried to memorize a list of words while listening 
to music. Students in the other group tried to memorize the same list of 
words while sitting in silence. Section 4.2 focuses on designing 
experiments. 


In Section 4.3, we revisit two key ideas from Sections 4.1 and 4.2: 
drawing conclusions about a population based on a random sample and 
drawing conclusions about cause and effect based on a randomized 
experiment. In both cases, we will focus on the role of randomization in 
our analysis. 
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SECTION 4.1 Sampling and Surveys 


LEARNING TARGETS By the end of the section, you should be able to: 
Identify the population and sample in a statistical study. 


Identify voluntary response sampling and convenience sampling 
and explain how these sampling methods can lead to bias. 


Describe how to select a simple random sample with technology or 
a table of random digits. 


Describe how to select a sample using stratified random sampling 
and cluster sampling, distinguish stratified random sampling from 
cluster sampling, and give an advantage of each method. 


Explain how undercoverage, nonresponse, question wording, and 
other aspects of a sample survey can lead to bias. 


Suppose we want to find out what percent of young drivers in the United 
States text while driving. To answer the question, we will survey 16- to 
20-year-olds who live in the United States and drive. Ideally, we would 
ask them all by conducting a census. But contacting every driver in this 
age group wouldn’t be practical: it would take too much time and cost too 
much money. Instead, we pose the question to a sample chosen to 
represent the entire population of young drivers. 


DEFINITION Population, Census, Sample 


The population in a statistical study is the entire group of individuals 
we want information about. A census collects data from every 
individual in the population. 


A sample is a subset of individuals in the population from which we 


collect data. 


The distinction between population and sample is basic to statistics. To 
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make sense of any sample result, you must know what population the 
sample represents. 


EXAMPLE | Sampling monitors and voters » 


Populations and samples 


Cancan Chu/Getty Images 


PROBLEM: Identify the population and the sample in each of the 
following settings. 


a. The quality control manager at a factory that produces computer 
monitors selects 10 monitors from the production line each hour. The 
manager inspects each monitor for defects in construction and 
performance. 


b. Prior to an election, a news organization surveys 1000 registered 
voters to predict which candidate will be elected as president. 

SOLUTION: 

a. The population is all the monitors produced in this factory that hour. 
The sample is the 10 monitors selected from the production line. 


b. The population is all registered voters. The sample is the 1000 
registered voters surveyed. 


Because the sample came from 1 hour’s production at this factory, the 
population is the monitors produced that hour in this factory—not all 
monitors produced in the world or even all monitors produced by this 


factory. 
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FOR PRACTICE, TRY EXERCISE 1 


The Idea of a Sample Survey 


We often draw conclusions about a population based on a sample. Have 
you ever tasted a sample of ice cream and ordered a cone because the 
sample tastes good? Because ice cream is fairly uniform, the single taste 
represents the whole. Choosing a representative sample from a large and 
varied population (like all young U.S. drivers) is not so easy. The first step 
in planning a sample survey is to decide what population we want to 
describe. The second step is to decide what we want to measure. 


DEFINITION Sample survey 


A sample survey is a study that collects data from a sample that is 
chosen to represent a specific population. 


By our definition, the population in a sample survey can consist of 
people, animals, or things. Some people use the terms survey or sample 
survey to refer only to studies in which people are asked questions, like the 
news organization survey in the preceding example. We’ll avoid this more 
restrictive terminology. 


The final step in planning a sample survey is to decide how to choose a 
sample from the population. Here is an activity that illustrates the process 
of conducting a sample survey. 


ACTIVITY Who wrote the Federalist Papers? 


The Federalist Papers are a series of 85 essays supporting the 
ratification of the U.S. Constitution. At the time they were published, 
the identity of the authors was a secret known to only a few people. 
Over time, however, the authors were identified as Alexander Hamilton, 
James Madison, and John Jay. The authorship of 73 of the essays is 
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fairly certain, leaving 12 in dispute. However, thanks in some part to 
Statistical analysis, most scholars now believe that the 12 disputed 
essays were written by Madison alone or in collaboration with 
Hamilton.2 


% LLOBLNONDS 


Ha 


Bragin Alexey/Shutterstock.com 


There are several ways to use statistics to help determine the 
authorship of a disputed text. One method is to estimate the average 
word length in a disputed text and compare it to the average word 
lengths of works where the authorship is not in dispute. 


The following passage is the opening paragraph of Federalist Paper 


No. 51,2 one of the disputed essays. The theme of this essay is the 
separation of powers between the three branches of government. 


To what expedient, then, shall we finally resort, for maintaining in 
practice the necessary partition of power among the several 
departments, as laid down in the Constitution? The only answer 
that can be given is, that as all these exterior provisions are found 
to be inadequate, the defect must be supplied, by so contriving the 
interior structure of the government as that its several constituent 
parts may, by their mutual relations, be the means of keeping each 
other in their proper places. Without presuming to undertake a full 
development of this important idea, I will hazard a few general 
observations, which may perhaps place it in a clearer light, and 
enable us to form a more correct judgment of the principles and 
structure of the government planned by the convention. 


1. Choose 5 words from this passage. Count the number of letters in 
each of the words you selected, and find the average word length. 


2. Your teacher will draw and label a horizontal axis for a class dotplot. 
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Plot the average word length you obtained in Step 1 on the graph. 


3. Your teacher will show you how to use a random number generator to 
select a random sample of 5 words from the 130 words in the opening 
passage. Count the number of letters in each of the selected words, 
and find the average word length. 


4. Your teacher will draw and label another horizontal axis with the 
same scale for a comparative dotplot. Plot the average word length 
you obtained in Step 3 on the graph. 


5. How do the dotplots compare? Can you think of any reasons why 
they might be different? Discuss with your classmates. 


How to Sample Badly 


Suppose we want to know how long students at a large high school spent 
doing homework last week. We might go to the school library and ask the 
first 30 students we see about the amount of time they spend on their 
homework. This method is known as a convenience sampling. 


DEFINITION Convenience sampling 


Convenience sampling selects individuals from the population who 
are easy to reach. 


o@ Convenience sampling often produces unrepresentative data. 
Consider our sample of 30 students from the school library. It’s unlikely 
that this convenience sample accurately represents the homework habits of 
all students at the high school. In fact, if we were to repeat this sampling 
process day after day, we would almost always overestimate the average 
homework time in the population. Why? Because students who hang out in 
the library tend to be more studious. This is bias: using a method that 
favors some outcomes over others. 


DEFINITION Bias 


498 


The design of a statistical study shows bias if it is very likely to 
underestimate or very likely to overestimate the value you want to 
know. 


AP® EXAM TIP 


If you’re asked to describe how the design of a sample survey leads to 
bias, you’re expected to do two things: (1) describe how the members 
of the sample might respond differently from the rest of the population, 
and (2) explain how this difference would lead to an underestimate or 
Overestimate. Suppose you were asked to explain how using your 
Statistics class as a sample to estimate the proportion of all high school 
students who own a graphing calculator could result in bias. You might 
respond, “This is a convenience sample. It would probably include a 
much higher proportion of students with a graphing calculator than in 
the population at large because a graphing calculator is required for the 
Statistics class. So this method would probably lead to an overestimate 
of the actual population proportion.” 


oOo Bias is not just bad luck in one sample. It’s the result of a bad 


study design that will consistently miss the truth about the population in 
the same way. Convenience sampling will almost always result in bias. So 


will voluntary response sampling. 


DEFINITION Voluntary response sampling 


Voluntary response sampling allows people to choose to be in the 
sample by responding to a general invitation. 


The Internet brings voluntary response sampling to the computer nearest you. Visit 


www.misterpoll.com to become part of the sample in any of dozens of online polls. As 
the site says, “None of these polls are ‘scientific,’ but do represent the collective 
opinion of everyone who participates.” Unfortunately, such polls don’t tell you much 


about the views of any larger population. 
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Most Internet polls, along with call-in, text-in, and write-in polls, rely 
on voluntary response sampling. People who self-select to participate in 
such surveys are usually not representative of some larger population of 
interest. Voluntary response sampling attracts people who feel strongly 
about an issue, and who often share the same opinion. That leads to bias. 


EXAMPLE |Boaty McBoatface » 


Biased sampling methods 


NERC/SplashNews/Newscom 


PROBLEM: In 2016, Britain’s Natural Environment Research Council 
invited the public to name its new $300 million polar research ship. To 
vote on the name, people simply needed to visit a website and record 
their choice. Ignoring names suggested by the council, over 124,000 
people voted for “Boaty McBoatface,” which ended up having more 
than 3 times as many votes as the second-place finisher.2 


What type of sampling did the council use in their poll? Explain how 
bias in this sampling method could have affected the poll results. 


SOLUTION: 


The council used voluntary response sampling: people chose to go 
online and respond. The people who chose to be in the sample were 
probably less serious about science than the British population as a 
whole—and more likely to prefer a funny name. The proportion of 
people in the sample who prefer the name Boaty McBoatface is likely to 
be greater than the proportion of all British residents who would choose 
this name. 
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Remember to describe how the responses from the members of the 


sample might differ from the responses from the rest of the population 
and how this difference will affect the estimate. 


FOR PRACTICE, TRY EXERCISE 5 


CHECK YOUR UNDERSTANDING 


( 


y For each of the following situations, identify the sampling method 
used. Then explain how bias in the sampling method could affect the 


results. 

1. A farmer brings a juice company several crates of oranges each week. 
A company inspector looks at 10 oranges from the top of each crate 
before deciding whether to buy all the oranges. 


2. The ABC program Nightline once asked if the United Nations should 
continue to have its headquarters in the United States. Viewers were 
invited to call one telephone number to respond “Yes” and another to 
respond “No.” There was a charge for calling either number. More 
than 186,000 callers responded, and 67% said “No.” 


How to Sample Well: Random Sampling 


In convenience sampling, the researcher chooses easy-to-reach members 
of the population. In voluntary response sampling, people decide whether 
to join the sample. Both sampling methods suffer from bias due to 
personal choice. As you discovered in The Federalist Papers activity, a 
good way to avoid bias is to let chance choose the sample. That’s the idea 


of random sampling. 


DEFINITION Random sampling 
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Random sampling involves using a chance process to determine 
which members of a population are included in the sample. 


In everyday life, some people use the word random to mean “haphazard,” as in 


“That’s so random.” In statistics, random means “using chance.” Don’t say that a 
sample was chosen at random if a chance process wasn’t used to select the 


individuals. 


For example, to choose a random sample of 6 students from a class of 
30, start by writing each of the 30 names on a separate slip of paper, 
making sure the slips are all the same size. Then put the slips in a hat, mix 
them well, and pull out slips one at a time until you have identified 6 
different students. An alternative approach would be to give each member 
of the population a distinct number and to use the “hat method” with these 
numbers instead of people’s names. Note that this version would work just 
as well if the population consisted of animals or things. The resulting 
sample is called a simple random sample, or SRS for short. 


DEFINITION Simple random sample (SRS) 


A simple random sample (SRS) of size n is chosen in such a way 
that every group of n individuals in the population has an equal 
chance to be selected as the sample. 


An SRS gives every possible sample of the desired size an equal 
chance to be chosen. Picture drawing 20 slips of paper (the sample) from a 
hat containing 200 identical slips (the population). Any set of 20 slips has 
the same chance as any other set of 20 to be chosen. This also means that 
each individual has the same chance to be chosen in an SRS. However, 
giving each individual the same chance to be selected is not enough to 
guarantee that a sample is an SRS. Some other random sampling methods 
give each member of the population an equal chance to be selected, but not 
each possible sample. We’ll look at some of these methods later. 


HOW TO CHOOSE A SIMPLE RANDOM SAMPLE The hat method 
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won’t work well if the population is large. Imagine trying to take a simple 
random sample of 1000 registered voters in the United States using a hat! 
In practice, most people use random numbers generated by technology to 
choose samples. 


HOW TO CHOOSE AN SRS WITH TECHNOLOGY 


e Label. Give each individual in the population a distinct numerical 
label from 1 to N, where N is the number of individuals in the 
population. 


e Randomize. Use a random number generator to obtain n different 
integers from 1 to N, where n is the sample size. 


e Select. Choose the individuals that correspond to the randomly 
selected integers. 


When choosing an SRS, we make the selections without replacement. 
That is, once an individual is selected for a sample, that individual cannot 
be selected again. Many random number generators sample numbers with 
replacement, so it is important to explain that repeated numbers should be 
ignored when using technology to select an SRS. 


11. Technology Corner |CHOOSING AN SRS (@ 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Let’s use a graphing calculator to select an SRS of 10 students from a 

population of students numbered 1 to 1750. 

1. Check that your calculator’s random number generator is working 
properly. 

e Press [MATH], then select PROB (PRB) and choose randInt(. 
Newer OS: In the dialogue box, enter these values: lower: 1, 
upper: 1750, n: 1, choose Paste, and press [ENTER\. 

Older OS: Complete the command randInt(1,1750) and press 
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[ENTER| 


e Compare your results with those of your classmates. If several 
students got the same number, you’|l need to seed your 
calculator’s random integer generator with different numbers 
before you proceed. Directions for doing this are given in the 
Teacher’s Edition. 


2. Randomly generate 10 distinct numbers from 1 to 1750 by pressing 
until you have chosen 10 different labels. 


NORMAL FLOAT AUTO REAL RADIAN CL Hl 


randInt(1,175@) 


nee | 
randiInt(1,1750) 
Lae deat eis ames oi oni ascidga 
randiInt(1,1750) 
SEALS ne en ee eee! 929. 
randInt(1,1750) 
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Note: If you have a TI-84 Plus CE, use the command 
RandIntNoRep(1,1750,10) to get 10 distinct integers from 1 to 1750. If 
you have a TI-84 with OS 2.55 or later, use the command 
RandIntNoRep(1,1750) to sort the numbers from 1 to 1750 in random 
order. The first 10 numbers listed give the labels of the chosen students. 


There are many random number generators available on the Internet, 
including those at www.random.org. You can also use the random number 
generator on your calculator. 

If you don’t have technology handy, you can use a table of random 
digits to choose an SRS. We have provided a table of random digits at the 
back of the book (Table D). Here is an excerpt: 


Table D Random digits 


LINE 

101 19223 95034 05756 28713 96409 12531 42544 82853 
102 73676 47150 99400 01927 27754 42648 82425 36290 
103 45467 71709 77558 00095 32863 29485 82226 90056 
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You can think of this table as the result of someone putting the digits 0 
to 9 in a hat, mixing, drawing one, replacing it, mixing again, drawing 
another, and so on. The digits have been arranged in groups of five within 
numbered rows to make the table easier to read. The groups and rows have 
no special meaning—Table D is just a long list of randomly chosen digits. 
As with technology, there are three steps in using Table D to choose a 
random sample. 


HOW TO CHOOSE AN SRS USING TABLE D 


e Label. Give each member of the population a distinct numerical label 
with the same number of digits. Use as few digits as possible. 


e Randomize. Read consecutive groups of digits of the appropriate 


length from left to right across a line in Table D. Ignore any group of 
digits that wasn’t used as a label or that duplicates a label already in 
the sample. Stop when you have chosen n different labels. 


e Select. Choose the individuals that correspond to the randomly 
selected integers. 


Always use the shortest labels that will cover your population. For 
instance, you can label up to 100 individuals with two digits: 01, 02, ..., 
99, 00. As standard practice, we recommend that you begin with label 1 
(or 01 or 001 or 0001, as needed). Reading groups of digits from the table 
gives all individuals the same chance to be chosen because all labels of the 
same length have the same chance to be found in the table. For example, 
any pair of digits in the table is equally likely to be any of the 100 possible 
labels 01, 02, ... , 99, 00. Here’s an example that shows how this process 
works. 


EXAMPLE | Attendance audit » 
Choosing an SRS with Table D 
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Holly Albrecht 


PROBLEM: Each year, the state Department of Education randomly 
selects three schools from each district and conducts a detailed audit of 
their attendance records. 


a. Describe how to use a table of random digits to select an SRS of three 
schools from this list of 19 schools. 


Amphitheater High School Keeling Elementary School 
Amphitheater Middle School La Cima Middle School 
Canyon del Oro High School Mesa Verde Elementary School 
Copper Creek Elementary School Nash Elementary School 
Coronado K-8 School Painted Sky Elementary School 
Cross Middle School Prince Elementary School 
Donaldson Elementary School Rio Vista Elementary School 
Harelson Elementary School Walker Elementary School 
Holaway Elementary School Wilson K-8 School 


Ironwood Ridge High School 


b. Use the random digits here to choose the sample. 
62081 64816 87374 09517 84534 06489 87201 97245 


SOLUTION: 


a. Label the schools from 01 to 19 in alphabetical order. Move along a 
line of random digits from left to right, reading two-digit numbers, 
until three different numbers from 01 to 19 have been selected 
(ignoring repeated numbers and the numbers 20-99, 00). Audit the 
three schools that correspond with the numbers selected. 
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Remember to include all three steps: 


e Label 
e Randomize 


e Select 


b. 62-skip, 08-select, 16-select, 48-skip, 16-repeat, 87-skip, 37-skip, 
40-skip, 95-skip, 17-select. 
The three schools are 08: Harelson Elementary School, 16: Prince 
Elementary School, and 17: Rio Vista Elementary School. 


FOR PRACTICE, TRY EXERCISE 11 


CHECK YOUR UNDERSTANDING 


‘ 


», A furniture maker buys hardwood in batches that each contain 1000 
pieces. The supplier is supposed to dry the wood before shipping (wood 
that isn’t dry won’t hold its size and shape). The furniture maker 
chooses five pieces of wood from each batch and tests their moisture 
content. If any piece exceeds 12% moisture content, the entire batch is 
sent back. Describe how to select a simple random sample of 5 pieces 
using each of the following. 


1. A random number generator 


2. A table of random digits 
Other Random Sampling Methods 
One of the most common alternatives to simple random sampling is called 


stratified random sampling. This method involves sampling from groups 
(strata) of similar individuals within the population separately. Then these 
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separate “subsamples” are combined to form the sample. 


DEFINITION Strata, Stratified random sample 


Strata are groups of individuals in a population who share 
characteristics thought to be associated with the variables being 
measured in a study. Stratified random sampling selects a sample 
by choosing an SRS from each stratum and combining the SRSs into 
one overall sample. 


The singular form of strata is stratum. 


Stratified random sampling works best when the individuals within 
each stratum are similar with respect to what is being measured and when 
there are large differences between strata. For example, in a study of sleep 
habits on school nights, the population of students in a large high school 
might be divided into freshman, sophomore, junior, and senior strata. After 
all, it is reasonable to think that freshmen have different sleep habits than 
seniors. The following activity illustrates the benefit of choosing 
appropriate strata. 


ACTIVITY Sampling sunflowers 


A British farmer grows sunflowers for making sunflower oil. Her field 
is arranged in a grid pattern, with 10 rows and 10 columns as shown in 
the figure. Irrigation ditches run along the top and bottom of the field. 
The farmer would like to estimate the number of healthy plants in the 
field so she can project how much money she’! make from selling 
them. It would take too much time to count the plants in all 100 squares, 
so she’Il accept an estimate based on a sample of 10 squares. 
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. Use Table D or technology to take a simple random sample of 10 grid 
squares. Record the location (e.g., B6) of each square you select. 


. This time, select a stratified random sample using the rows as strata. 
Use Table D or technology to randomly select one square from each 
horizontal row. Record the location of each square—for example, 
Row 1: G, Row 2: B, and so on. 


. Now, take a stratified random sample using the columns as strata. Use 
Table D or technology to randomly select one square from each 
vertical column. Record the location of each square—for example, 
Column A: 4, Column B: 1, and so on. 


. Your teacher will provide the actual number of healthy sunflowers in 
each grid square. Use that information to calculate your estimate of 
the mean number of healthy sunflowers per square in the entire field 
for each of your samples in Steps 1, 2, and 3. 


. Make comparative dotplots showing the mean number of healthy 
sunflowers obtained using the three different sampling methods for 
all members of the class. Describe any similarities and differences 
you see. 


The following dotplots show the mean number of healthy plants in 100 
samples using each of the three sampling methods in the activity: simple 
random sampling, stratified random sampling with rows of the field as 
strata, and stratified random sampling with columns of the field as strata. 
Notice that all three distributions are centered at about 102.5, the true 
mean number of healthy plants in all squares of the field. That makes sense 
because random sampling tends to yield accurate estimates of unknown 
population means. 


e ee 6 ee 
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One other detail stands out in the graphs: there is much less variability 
in the estimates when we use the rows as strata. The table provided by 
your teacher shows the actual number of healthy sunflowers in each grid 
square. Notice that the squares within each row contain a similar number 
of healthy plants but that there are big differences between rows. When we 
can choose strata that have similar responses (e.g., number of healthy 
plants) within strata but different responses between strata, stratified 
random samples give more precise estimates than simple random samples 
of the same size. 


Why didn’t using the columns as strata reduce the variability of the 
estimates in a similar way? Because the numbers of healthy plants vary a 
lot within each column and aren’t very different from other columns. 

Both simple random sampling and stratified random sampling are hard 
to use when populations are large and spread out over a wide area. In that 
situation, we’d prefer a method that selects groups (clusters) of 
individuals that are “near” one another. That’s the idea of cluster 


sampling. 
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DEFINITION Clusters, Cluster sampling 


A cluster is a group of individuals in the population that are located 
near each other. Cluster sampling selects a sample by randomly 
choosing clusters and including each member of the selected clusters 
in the sample. 


Cluster sampling is often used for practical reasons, like saving time 
and money. It works best when the clusters look just like the population 
but on a smaller scale. Imagine a large high school that assigns students to 
homerooms alphabetically by last name, in groups of 25. Administrators 
want to survey 200 randomly selected students about a proposed schedule 
change. It would be difficult to track down an SRS of 200 students, so the 
administration opts for a cluster sample of homerooms. The principal (who 
knows some statistics) selects an SRS of 8 homerooms and gives the 
survey to all 25 students in each homeroom. 


Be sure you understand the difference between strata and clusters. We 
want each stratum to contain similar individuals and for large differences 
to exist between strata. For a cluster sample, we’d like each cluster to look 
just like the population, but on a smaller scale. Unfortunately, cluster 
samples don’t offer the statistical advantage of better information about the 
population that stratified random samples do. Here’s an example that 
compares stratified random sampling and cluster sampling. 


EXAMPLE | Sampling at a school assembly (>) 
Other sampling methods 
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PROBLEM: The student council wants to conduct a survey about use 
of the school library during the first five minutes of an all-school 
assembly in the auditorium. There are 800 students present at the 
assembly. Here is a map of the auditorium. Note that students are seated 
by grade level and that the seats are numbered from 1 to 800. 


9th grade: Seats 601-800 10th grade: Seats 401-600 
11th grade: Seats 201-400 12th grade: Seats 1-200 


a. Describe how to obtain a sample of 80 students using stratified 
random sampling. Explain your choice of strata and why this method 
might be preferred to simple random sampling. 


b. Describe how to obtain a sample of 80 students using cluster 
sampling. Explain your choice of clusters and why this method might 
be preferred to simple random sampling. 


SOLUTION: 


a. Because students’ library use might be similar within grade levels 
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but different across grade levels, use the grade-level seating areas as 
strata. For the 9th grade, generate 20 different random integers from 
601 to 800 and give the survey to the students in those seats. Do the 
same for sophomores, juniors, and seniors using their corresponding 
seat numbers. Stratification by grade level should result in more 
precise estimates of student opinion than a simple random sample of 
the same size. 


b. Each column of seats from the stage to the back of the auditorium 
could be used as a cluster because it would be relatively easy to hand 
out the surveys to an entire column. Label the columns from 1 to 20 
starting at the left side of the stage, generate 2 different integers from 
1 to 20, and give the survey to the 80 students sitting in these two 
columns. Cluster sampling is much more efficient than finding 80 
seats scattered about the auditorium, as required by simple random 
sampling. 


Note that each cluster contains students from all four grade levels, so 
each should represent the population fairly well. Randomly selecting 4 


rows as clusters would also be easy, but this may over- or under- 
represent one grade level. 


FOR PRACTICE, TRY EXERCISE 21 


Most large-scale sample surveys use multistage sampling, which 
combines two or more sampling methods. For example, the U.S. Census 
Bureau carries out a monthly Current Population Survey (CPS) of about 
60,000 households. Researchers start by choosing a stratified random 
sample of neighborhoods in 756 of the 2007 geographical areas in the 
United States. Then they divide each neighborhood into clusters of four 
nearby households and select a cluster sample to interview. 

Analyzing data from sampling methods other than simple random 
sampling takes us beyond basic statistics. But the SRS is the building 
block of more elaborate methods, and the principles of analysis remain 
much the same for these other methods. 
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CHECK YOUR UNDERSTANDING 


(0 


y A factory runs 24 hours a day, producing wood pencils on three 8- 

hour shifts—day, evening, and overnight. In the last stage of 

manufacturing, the pencils are packaged in boxes of 10 pencils each. 

Each day a sample of 300 pencils is selected and inspected for quality. 

1. Describe how to select a stratified random sample of 300 pencils. 
Explain your choice of strata. 

2. Describe how to select a cluster sample of 300 pencils. Explain your 
choice of clusters. 

3. Explain a benefit of using a stratified random sample and a benefit of 
using a cluster sample in this context. 


Sample Surveys: What Else Can Go Wrong? 


As we have learned, the use of bad sampling methods (convenience or 
voluntary response) often leads to bias. Researchers can avoid these 
methods by using random sampling to choose their samples. Other 
problems in conducting sample surveys are more difficult to avoid. 
Sampling is sometimes done using a list of individuals in the 
population, called a sampling frame. Such lists are seldom accurate or 


complete. The result is undercoverage. 


DEFINITION Undercoverage 


Undercoverage occurs when some members of the population are 
less likely to be chosen or cannot be chosen in a sample. 


Most samples suffer from some degree of undercoverage. A sample 
survey of households, for example, will miss not only homeless people but 
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also prison inmates and students in dormitories. An opinion poll conducted 
by calling landline telephone numbers will miss households that have only 
cell phones as well as households without a phone. The results of sample 
surveys may not be accurate if the people who are undercovered differ 
from the rest of the population in ways that affect their responses. 


Even if every member of the population is equally likely to be selected 
for a sample, not all members of the population are equally likely to 
provide a response. Some people are never at home and cannot be reached 
by pollsters on the phone or in person. Other people see an unfamiliar 
phone number on their caller ID and never pick up the phone or quickly 
hang up when they don’t recognize the voice of the caller. These are 
examples of nonresponse, another major source of bias in surveys. 


DEFINITION Nonresponse 


Nonresponse occurs when an individual chosen for the sample can’t 
be contacted or refuses to participate. 


Nonresponse leads to bias when the individuals who can’t be contacted 
or refuse to participate would respond differently from those who do 
participate. Consider a telephone survey that asks people how many hours 
of television they watch per day. People who are selected but are out of the 
house won’t be able to respond. Because these people probably watch less 
television than the people who are at home when the phone call is made, 
the mean number of hours obtained in the sample is likely to be greater 
than the mean number of hours of TV watched in the population. 

How bad is nonresponse? According to polling guru Nate Silver, 
“Response rates to political polls are dismal. Even polls that make every 
effort to contact a representative sample of voters now get no more than 10 
percent to complete their surveys—down from about 35 percent in the 
1990s.”8 In contrast, the Census Bureau’s American Community Survey 
(ACS) has the lowest nonresponse rate of any poll we know: only about 
1% of the households in the sample refuse to respond. The overall 
nonresponse rate, including “never at home” and other causes, is just 
2.5%.2 
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oOo Some students misuse the term voluntary response to explain 


why certain individuals don’t respond in a sample survey. Their belief 
is that participation in the survey is optional (voluntary), so anyone can 
refuse to take part. What the students are describing is nonresponse. Think 
about it this way: nonresponse can occur only after a sample has been 
selected. In a voluntary response sample, every individual has opted to 
take part, so there won’t be any nonresponse. 


The wording of questions has an important influence on the answers 
given to a sample survey. Confusing or leading questions can introduce 
strong bias. Even a single word can make a difference. In a recent 
Quinnipiac University poll, half of the respondents were asked if they 
support “stronger gun laws” and the other half were asked if they support 
“stronger gun control laws.” In the first group, 52% of respondents 
supported stronger laws, but when the word control was added to the 


question, only 46% of respondents supported stronger laws.12 


The gender, age, ethnicity, or behavior of the interviewer can also 
affect people’s responses. People may lie about their age, income, or drug 
use. They may misremember how many hours they spent on the Internet 
last week. Or they might make up an answer to a question that they don’t 
understand. All these issues can lead to response bias. 


DEFINITION Response bias 


Response bias occurs when there is a systematic pattern of inaccurate 
answers to a survey question. 


EXAMPLE | Wash your hands! (>) 


Response bias 


PROBLEM: What percent of Americans wash their hands after using 
the bathroom? It depends on how you collect the data. In a telephone 
survey of 1006 U.S. adults, 96% said they always wash their hands after 
using a public restroom. An observational study of 6028 adults in public 
restrooms told a different story: only 85% of those observed washed 
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their hands after using the restroom. Explain why the results of the two 
studies are so different.14 


SOLUTION: 


When asked in person, many people may lie about always washing their 
hands because they want to appear to have good hygiene. When people 
are only observed and not asked directly, the percent who wash their 
hands will be smaller—and much closer to the truth. 


FOR PRACTICE, TRY EXERCISE 29 


Even the order in which questions are asked is important. For example, 
ask a sample of college students these two questions: 


e “How happy are you with your life in general?” (Answer on a scale of 1 
to 5.) 


e “How many dates did you have last month?” 


There is almost no association between responses to the two questions 
when asked in this order. It appears that dating has little to do with 
happiness. Reverse the order of the questions, however, and a much 
stronger association appears: college students who say they had more dates 
tend to give higher ratings of happiness about life. The lesson is clear: the 
order in which questions are asked can influence the results. 


CHECK YOUR UNDERSTANDING 


1. Each of the following is a possible source of bias in a sample survey. 
Name the type of bias that could result. 
a. The sample is chosen at random from a telephone directory. 


b. Some people cannot be contacted in five calls. 


c. Interviewers choose people walking by on the sidewalk to 


aL? 


interview. 


2. A survey paid for by makers of disposable diapers found that 84% of 
the sample opposed banning disposable diapers. Here is the actual 
question: 


It is estimated that disposable diapers account for less than 2% 
of the trash in today’s landfills. In contrast, beverage containers, 
third-class mail, and yard wastes are estimated to account for 
about 21% of the trash in landfills. Given this, in your opinion, 
would it be fair to ban disposable diapers?22 


Do you think the estimate of 84% is less than, greater than, or about 
equal to the percent of all people in the population who would oppose 
banning disposable diapers? Explain your reasoning. 


Section 4.1 Summary 


A census collects data from every individual in the population. 


A sample survey selects a sample from the population of all 
individuals about which we desire information. The goal of a sample 
survey is to draw conclusions about the population based on data from 
the sample. 


Convenience sampling chooses individuals who are easiest to reach. In 
voluntary response sampling, individuals choose to join the sample in 
response to an open invitation. Both these sampling methods usually 
lead to bias: they will be very likely to underestimate or very likely to 
overestimate the value you want to know. 


Random sampling uses a chance process to select a sample. 


A simple random sample (SRS) gives every possible sample of a given 
size the same chance to be chosen. Choose an SRS by labeling the 
members of the population and using a table of random digits or 
technology to select the sample. 


To use stratified random sampling, divide the population into groups 
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of individuals (strata) that are similar in some way that might affect 
their responses. Then choose a separate SRS from each stratum and 
combine these SRSs to form the sample. When strata are “similar within 
but different between,” stratified random samples tend to give more 
precise estimates of unknown population values than do simple random 
samples. 


e To use cluster sampling, divide the population into groups of 
individuals that are located near each other, called clusters. Randomly 
select some of these clusters. All the individuals in the chosen clusters 
are included in the sample. Ideally, clusters are “different within but 
similar between.” Cluster sampling saves time and money by collecting 
data from entire groups of individuals that are close together. 


e Random sampling helps avoid bias in choosing a sample. Bias can still 
occur in the sampling process due to undercoverage, which happens 
when some members of the population are less likely to be chosen or 
cannot be chosen for the sample. 


e Other serious problems in sample surveys can occur after the sample is 
chosen. The single biggest problem is nonresponse: when people can’t 
be contacted or refuse to answer. Untruthful answers by respondents, 
poorly worded questions, and other problems can lead to response bias. 


4.1 Technology Corners | » 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
11. Choosing an SRS Page 226 


Section 4.1 Exercises 


1. pg 221 |») Sampling stuffed envelopes A large retailer prepares 


its customers’ monthly credit card bills using an automatic machine 
that folds the bills, stuffs them into envelopes, and seals the envelopes 
for mailing. Are the envelopes completely sealed? Inspectors choose 
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40 envelopes at random from the 1000 stuffed each hour for visual 
inspection. Identify the population and the sample. 


Student archaeologists An archaeological dig turns up large 
numbers of pottery shards, broken stone tools, and other artifacts. 
Students working on the project classify each artifact and assign a 
number to it. The counts in different categories are important for 
understanding the site, so the project director chooses 2% of the 
artifacts at random and checks the students’ work. Identify the 
population and the sample. 


Students as customers A high school’s student newspaper plans to 
survey local businesses about the importance of students as 
customers. From an alphabetical list of all local businesses, the 
newspaper staff chooses 150 businesses at random. Of these, 73 
return the questionnaire mailed by the staff. Identify the population 
and the sample. 


Customer satisfaction A department store mails a customer 
satisfaction survey to people who make credit card purchases at the 
store. This month, 45,000 people made credit card purchases. Surveys 
are mailed to 1000 of these people, chosen at random, and 137 people 
return the survey form. Identify the population and the sample. 


pg 224 |») Sleepless nights How much sleep do high school 


students get on a typical school night? A counselor designed a survey 
to find out. To make data collection easier, the counselor surveyed the 
first 100 students to arrive at school on a particular morning. These 
students reported an average of 7.2 hours of sleep on the previous 
night. 


a. What type of sample did the counselor obtain? 


b. Explain why this sampling method is biased. Is 7.2 hours probably 
greater than or less than the true average amount of sleep last night 
for all students at the school? Why? 


Online polls Parade magazine posed the following question: “Should 
drivers be banned from using all cell phones?” Readers were 
encouraged to vote online at parade.com. The subsequent issue of 
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10. 


11. 


Parade reported the results: 2407 (85%) said “Yes” and 410 (15%) 
said “No.” 


a. What type of sample did the Parade survey obtain? 


b. Explain why this sampling method is biased. Is 85% probably 
greater than or less than the true percent of all adults who believe 
that all cell phone use while driving should be banned? Why? 


Online reviews Many websites include customer reviews of 
products, restaurants, hotels, and so on. The manager of a hotel was 
upset to see that 26% of reviewers on a travel website gave the hotel 
“1 star’”—the lowest possible rating. Explain how bias in the 
sampling method could affect the estimate. 


Funding for fine arts The band director at a high school wants to 
estimate the percentage of parents who support a decrease in the 
budget for fine arts. Because many parents attend the school’s annual 
musical, the director surveys the first 30 parents who arrive at the 
show. Explain how bias in the sampling method could affect the 
estimate. 


Explain it to the congresswoman You are on the staff of a member 
of Congress who is considering a bill that would provide government- 
sponsored insurance for nursing-home care. You report that 1128 
letters have been received on the issue, of which 871 oppose the 
legislation. “I’m surprised that most of my constituents oppose the 
bill. I thought it would be quite popular,” says the congresswoman. 
Are you convinced that a majority of the voters oppose the bill? How 
would you explain the statistical issue to the congresswoman? 


Sampling mall shoppers You may have seen the mall interviewer, 
clipboard in hand, approaching people passing by. Explain why even 
a large sample of mall shoppers would not provide a trustworthy 
estimate of the current unemployment rate in the city where the mall 
is located. 


pg 228 |») Do you trust the Internet? You want to ask a sample 


of high school students the question “How much do you trust 
information about health that you find on the Internet—a great deal, 
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somewhat, not much, or not at all?” You try out this and other 
questions on a pilot group of 5 students chosen from your class. 


a. Explain how you would use a line of Table D to choose an SRS of 


5 students from the following list. 


b. Use line 107 to select the sample. Show how you use each of the 


digits. 

Anderson Drasin Kim 
Arroyo Eckstein Molina 
Batista Fernandez Morgan 
Bell Fullmer Murphy 
Burke Gandhi Nguyen 
Cabrera Garcia Palmiero 
Calloway Glaus Percival 
Delluci Helling Prince 
Deng Husain Puri 

De Ramos Johnson Richards 


Rider 
Rodriguez 
Samuels 
Shen 

Tse 
Velasco 
Wallace 
Washburn 
Zabidi 
Zhao 


12. Apartment living You are planning a report on apartment living ina 
college town. You decide to select three apartment complexes at 


random for in-depth interviews with residents. 


a. Explain how you would use a line of Table D to choose an SRS of 


3 complexes from the following list. 


b. Use line 117 to select the sample. Show how you use each of the 


digits. 

Ashley Oaks Country View 
Bay Pointe Country Villa 
Beau Jardin Crestview 
Bluffs Del-Lynn 
Brandon Place Fairington 
Briarwood Fairway Knolls 
Brownstone Fowler 
Burberry Franklin Park 
Cambridge Georgetown 
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Mayfair Village 
Nobb Hill 
Pemberly Courts 
Peppermill 
Pheasant Run 
Richfield 
Sagamore Ridge 
Salem Courthouse 
Village Manor 


13. 


14. 


15. 


Chauncey Village Greenacres Waterford Court 


Country Squire Lahr House Williamsburg 


Sampling the forest To gather data on a 1200-acre pine forest in 
Louisiana, the U.S. Forest Service laid a grid of 1410 equally spaced 
circular plots over a map of the forest. A ground survey visited a 
sample of 10% of the plots.42 


a. Explain how you would use a random number generator to choose 
an SRS of 141 plots. Your description should be clear enough for a 
classmate to carry out your plan. 


b. Use your method from part (a) to choose the first 3 plots. 


Sampling gravestones The local genealogical society in Coles 
County, Illinois, has compiled records on all 55,914 gravestones in 
cemeteries in the county for the years 1825 to 1985. Historians plan 
to use these records to learn about African Americans in Coles 
County’s history. They first choose an SRS of 395 records to check 


their accuracy by visiting the actual gravestones.44 


a. Explain how you would use a random number generator to choose 
the SRS. Your description should be clear enough for a classmate 
to carry out your plan. 


b. Use your method from part (a) to choose the first 3 gravestones. 


Dead trees On the west side of Rocky Mountain National Park, many 
mature pine trees are dying due to infestation by pine beetles. 
Scientists would like to use sampling to estimate the proportion of all 
pine trees in this area that have been infested. 


a. Explain why it wouldn’t be practical for scientists to obtain an SRS 
in this setting. 


b. A possible alternative would be to use every pine tree along the 
park’s main road as a sample. Why is this sampling method 
biased? 


c. Suppose that a more complicated random sampling plan is carried 
out, and that 35% of the pine trees in the sample are infested by the 
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pine beetle. Can scientists conclude that exactly 35% of all the 
pine trees on the west side of the park are infested? Why or why 
not? 


16. iPhones Suppose 1000 iPhones are produced at a factory today. 
Management would like to ensure that the phones’ display screens 
meet their quality standards before shipping them to retail stores. 
Because it takes about 10 minutes to inspect an individual phone’s 
display screen, managers decide to inspect a sample of 20 phones 
from the day’s production. 


a. Explain why it would be difficult for managers to inspect an SRS 
of 20 iPhones that are produced today. 


b. An eager employee suggests that it would be easy to inspect the 
last 20 iPhones that were produced today. Why isn’t this a good 
idea? 


c. Another employee recommends a different sampling method: 
Randomly choose one of the first 50 iPhones produced. Inspect 
that phone and every fiftieth iPhone produced afterward. (This 
method is known as systematic random sampling.) Explain 
carefully why this sampling method is not an SRS. 


17. No tipping The owner of a large restaurant is considering a new “no 
tipping” policy and wants to survey a sample of employees. The 
policy would add 20% to the cost of food and beverages and the 
additional revenue would be distributed equally among servers and 
kitchen staff. Describe how to select a stratified random sample of 
approximately 30 employees. Explain your choice of strata and why 
stratified random sampling might be preferred in this context. 


18. Parking on campus The director of student life at a university wants 
to estimate the proportion of undergraduate students who regularly 
park a car on campus. Describe how to select a stratified random 
sample of approximately 100 students. Explain your choice of strata 
and why stratified random sampling might be preferred in this 
context. 


19. SRS of engineers? A corporation employs 2000 male and 500 female 
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20. 


21. 


22. 


engineers. A stratified random sample of 200 male and 50 female 
engineers gives every individual in the population the same chance to 
be chosen for the sample. Is it an SRS? Explain your answer. 


SRS of students? At a party, there are 30 students over age 21 and 20 
students under age 21. You choose at random 3 of those over 21 and 
separately choose at random 2 of those under 21 to interview about 
their attitudes toward alcohol. You have given every student at the 
party the same chance to be interviewed. Is your sample an SRS? 
Explain your answer. 


pg 231 |») How is your room? A hotel has 30 floors with 40 


rooms per floor. The rooms on one side of the hotel face the water, 
while rooms on the other side face a golf course. There is an extra 
charge for the rooms with a water view. The hotel manager wants to 
select 120 rooms and survey the registered guest in each of the 
selected rooms about his or her overall satisfaction with the property. 


a. Describe how to obtain a sample of 120 rooms using stratified 


random sampling. Explain your choice of strata and why this 
method might be preferred to simple random sampling. 


b. Describe how to obtain a sample of 120 rooms using cluster 


sampling. Explain your choice of clusters and why this method 
might be preferred to simple random sampling. 


Go Blue! Michigan Stadium, also known as “The Big House,” seats 
over 100,000 fans for a football game. The University of Michigan 
Athletic Department wants to survey fans about concessions that are 
sold during games. Tickets are most expensive for seats on the 
sidelines. The cheapest seats are in the end zones (where one of the 
authors sat as a student). A map of the stadium is shown. 
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Visitor Sideline 


ANOZQGNG 


Michigan Sideline 


Wl Sideline al sl 
[™) Corner 
[3 Endzone 


a. Describe how to obtain a sample using stratified random sampling. 
Explain your choice of strata and why this method might be 
preferred to simple random sampling. 


b. Describe how to obtain a sample using cluster sampling. Explain 
your choice of clusters and why this method might be preferred to 
simple random sampling. 


High-speed Internet Laying fiber-optic cable is expensive. Cable 
companies want to make sure that if they extend their lines to less 
dense suburban or rural areas, there will be sufficient demand so the 
work will be cost-effective. They decide to conduct a survey to 
determine the proportion of households in a rural subdivision that 
would buy the service. They select a simple random sample of 5 
blocks in the subdivision and survey each family that lives on one of 
those blocks. 


a. What is the name for this kind of sampling method? 


b. Give a possible reason why the cable company chose this method. 


Timber! A lumber company wants to estimate the proportion of trees 
in a large forest that are ready to be cut down. They use an aerial map 
to divide the forest into 200 equal-sized rectangles. Then they choose 
a random sample of 20 rectangles and examine every tree that’s in 
one of those rectangles. 


926 


pase 


26. 
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a. What is the name for this kind of sampling method? 


b. Give a possible reason why the lumber company chose this 
method. 


Eating on campus The director of student life at a small college 
wants to know what percent of students eat regularly in the cafeteria. 
To find out, the director selects an SRS of 300 students who live in 
the dorms. Describe how undercoverage might lead to bias in this 
study. Explain the likely direction of the bias. 


Immigration reform A news organization wants to know what 
percent of U.S. residents support a “pathway to citizenship” for 
people who live in the United States illegally. The news organization 
randomly selects registered voters for the survey. Describe how 
undercoverage might lead to bias in this study. Explain the likely 
direction of the bias. 


Reporting weight loss A total of 300 people participated in a free 12- 
week weight-loss course at a community health clinic. After one year, 
administrators emailed each of the 300 participants to see how much 
weight they had lost since the end of the course. Only 56 participants 
responded to the survey. The mean weight loss for this sample was 
13.6 pounds. Describe how nonresponse might lead to bias in this 
study. Explain the likely direction of the bias. 


Nonresponse A survey of drivers began by randomly sampling from 
all listed residential telephone numbers in the United States. Of 
45,956 calls to these numbers, 5029 were completed. The goal of the 
survey was to estimate how far people drive, on average, per day.12 
Describe how nonresponse might lead to bias in this study. Explain 
the likely direction of the bias. 


pg 234 |») Running red lights An SRS of 880 drivers was asked: 


“Recalling the last ten traffic lights you drove through, how many of 
them were red when you entered the intersections?” Of the 880 
respondents, 171 admitted that at least one light had been red. A 
practical problem with this survey is that people may not give truthful 
answers. Explain the likely direction of the bias. 
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Seat belt use A study in El Paso, Texas, looked at seat belt use by 
drivers. Drivers were observed at randomly chosen convenience 
stores. After they left their cars, they were invited to answer questions 
that included questions about seat belt use. In all, 75% said they 
always used seat belts, yet only 61.5% were wearing seat belts when 
they pulled into the store parking lots.t© Explain why the two 
percentages are so different. 


Boys don’t cry? Two female statistics students asked a random 
sample of 60 high school boys if they have ever cried during a movie. 
Thirty of the boys were asked directly and the other 30 were asked 
anonymously by means of a “secret ballot.” When the responses were 
anonymous, 63% of the boys said “Yes,” whereas only 23% of the 
other group said “Yes.” Explain why the two percentages are so 
different. 


Weight? Wait what? Marcos asked a random sample of 50 mall 
shoppers for their weight. Twenty-five of the shoppers were asked 
directly and the other 25 were asked anonymously by means of a 
“secret ballot.” The mean reported weight was 13 pounds heavier for 


the anonymous group. Explain why the two means are so different.12 


Wording bias Comment on each of the following as a potential 
sample survey question. Is the question clear? Is it slanted toward a 
desired response? 


a. “Some cell phone users have developed brain cancer. Should all 
cell phones come with a warning label explaining the danger of 
using cell phones?” 


b. “Do you agree that a national system of health insurance should be 


favored because it would provide health insurance for everyone 
and would reduce administrative costs?” 


c. “In view of escalating environmental degradation and incipient 
resource depletion, would you favor economic incentives for 
recycling of resource—intensive consumer goods?” 


Checking for bias Comment on each of the following as a potential 
sample survey question. Is the question clear? Is it slanted toward a 
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desired response? 


a. Which of the following best represents your opinion on gun 
control? 


i. The government should confiscate our guns. 
ii. We have the right to keep and bear arms. 


b. A freeze in nuclear weapons should be favored because it would 
begin a much-needed process to stop everyone in the world from 
building nuclear weapons now and reduce the possibility of 
nuclear war in the future. Do you agree or disagree? 


Multiple Choice Select the best answer for Exercises 35—40. 


30. 


36. 


A popular website places opinion poll questions next to many of its 
news stories. Simply click your response to join the sample. One of 
the questions was “Do you plan to diet this year?” More than 30,000 
people responded, with 68% saying “Yes.” Which of the following is 
true? 


a. About 68% of Americans planned to diet. 


b. The poll used a convenience sample, so the results tell us little 
about the population of all adults. 


c. The poll uses voluntary response, so the results tell us little about 
the population of all adults. 


d. The sample is too small to draw any conclusion. 


e. None of these. 


To gather information about the validity of a new standardized test 
for 10th-grade students in a particular state, a random sample of 15 
high schools was selected from the state. The new test was 
administered to every 10th-grade student in the selected high schools. 
What kind of sample is this? 


a. A simple random sample 
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b. A stratified random sample 
c. A cluster sample 
d. A systematic random sample 


e. A voluntary response sample 


Your statistics class has 30 students. You want to ask an SRS of 5 
students from your class whether they use a mobile device for the 
online quizzes. You label the students 01, 02, ... , 30. You enter the 
table of random digits at this line: 

14459 26056 31424 80371 65103 62253 22490 61181 


Your SRS contains the students labeled 
a. 14, 45, 92, 60, 56. 
b. 14, 31, 03, 10, 22. 
c. 14, 03, 10, 22, 22. 
d. 14, 03, 10, 22, 06. 
e. 14, 03, 10, 22, 11. 


Suppose that 35% of the voters in a state are registered as 
Republicans, 40% as Democrats, and 25% as Independents. A 
newspaper wants to select a sample of 1000 registered voters to 
predict the outcome of the next election. If it randomly selects 350 
Republicans, randomly selects 400 Democrats, and randomly selects 
250 Independents, did this sampling procedure result in a simple 
random sample of registered voters from this state? 


a. Yes, because each registered voter had the same chance of being 
chosen. 


b. Yes, because random chance was involved. 


c. No, because not all registered voters had the same chance of being 
chosen. 


d. No, because a different number of registered voters was selected 
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40. 


from each party. 


e. No, because not all possible groups of 1000 registered voters had 
the same chance of being chosen. 


A local news agency conducted a survey about unemployment by 
randomly dialing phone numbers during the work day until it 
gathered responses from 1000 adults in its state. In the survey, 19% of 
those who responded said they were not currently employed. In 
reality, only 6% of the adults in the state were not currently employed 
at the time of the survey. Which of the following best explains the 
difference in the two percentages? 


a. The difference is due to sampling variability. We shouldn’t expect 
the results of a random sample to match the truth about the 
population every time. 


b. The difference is due to response bias. Adults who are employed 
are likely to lie and say that they are unemployed. 


c. The difference is due to undercoverage bias. The survey included 
only adults and did not include teenagers who are eligible to work. 


d. The difference is due to nonresponse bias. Adults who are 
employed are less likely to be available for the sample than adults 
who are unemployed. 


e. The difference is due to voluntary response. Adults are able to 
volunteer as a member of the sample. 


A simple random sample of 1200 adult Americans is selected, and 
each person is asked the following question: “In light of the huge 
national deficit, should the government at this time spend additional 
money to send humans to Mars?” Only 39% of those responding 
answered “Yes.” This survey 


a. is reasonably accurate because it used a large simple random 
sample. 


b. needs to be larger because only about 24 people were drawn from 
each state. 
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c. probably understates the percent of people who favor sending 
humans to Mars. 


d. is very inaccurate but neither understates nor overstates the percent 
of people who favor sending humans to Mars. Because simple 
random sampling was used, it is unbiased. 


e. probably overstates the percent of people who favor sending 
humans to Mars. 


Recycle and Review 


41. Don’t turn it over (3.2) How many points do turnovers cost teams in 
the NFL? The scatterplot shows the relationship between x = number 
of turnovers and y = number of points scored by teams in the NFL 
during 2015, along with the least-squares regression line 
y\=460.2-4.084x9 = 460.2 — 4.0842, 
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a. Interpret the slope of the regression line in context. 
b. For this regression line, s = 57.3. Interpret this value. 


c. Calculate and interpret the residual for the San Francisco 49ers, 
who had 17 turnovers and scored 238 points. 


d. How does the point for the 49ers affect the least-squares regression 
line and standard deviation of the residuals? Explain your answer. 


42. Internet charges (2.1) Some Internet service providers (ISPs) charge 
companies based on how much bandwidth they use in a month. One 
method that ISPs use to calculate bandwidth is to find the 95th 


932 


percentile of a company’s usage based on samples of hundreds of 5- 


minute intervals during a month. 
a. Explain what “95th percentile” means in this setting. 


b. Is it possible to determine the z-score for a usage total that is at the 
95th percentile? If so, find the z-score. If not, explain why not. 
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SECTION 4.2 Experiments 


LEARNING TARGETS. By the end of the section, you should be able to: 


Explain the concept of confounding and how it limits the ability to 
make cause-and-effect conclusions. 


Distinguish between an observational study and an experiment, and 
identify the explanatory and response variables in each type of 
study. 


Identify the experimental units and treatments in an experiment. 


Describe the placebo effect and the purpose of blinding in an 
experiment. 


Describe how to randomly assign treatments in an experiment using 
slips of paper, technology, or a table of random digits. 


Explain the purpose of comparison, random assignment, control, 
and replication in an experiment. 


Describe a completely randomized design for an experiment. 


Describe a randomized block design and a matched pairs design for 
an experiment and explain the purpose of blocking in an 
experiment. 


A sample survey aims to gather information about a population without 
disturbing the population in the process. Sample surveys are one kind of 
observational study. Other observational studies record the behavior of 
animals in the wild or track the medical history of volunteers to look for 
associations between variables such as type of diet, amount of exercise, 
and blood pressure. 


DEFINITION Observational study 


An observational study observes individuals and measures variables 
of interest but does not attempt to influence the responses. 
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Section 4.2 is about statistical designs for experiments, a very different 
way to produce data. 


Observational Studies Versus Experiments 


Is taking a vitamin D supplement good for you? Hundreds of observational 
studies have looked at the relationship between vitamin D concentration in 
a person’s blood and various health outcomes.28 In one observational 
study, researchers found that teenage girls with higher vitamin D intakes 
were less likely to suffer broken bones.!2 Other observational studies have 
shown that people with higher vitamin D concentration have less 
cardiovascular disease, better cognitive function, and less risk of diabetes 
than people with lower concentrations of vitamin D. 


y 


Tatiana Popova/Shutterstock.com 


In the observational studies involving vitamin D and diabetes, the 
explanatory variable is vitamin D concentration in the blood and the 
response variable is diabetes status—whether or not the person developed 
diabetes. 


DEFINITION Response variable, Explanatory variable 


A response variable measures an outcome of a study. An 
explanatory variable may help explain or predict changes in a 
response variable. 


Unfortunately, it is very difficult to show that taking vitamin D causes 
a lower risk of diabetes using an observational study. As shown in the 
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table, there are many possible differences between the group of people 
with high vitamin D concentration and the group of people with low 
vitamin D concentration. Any of these differences could be causing the 
difference in diabetes risk between the two groups of people. 


Variable Group 1 Group 2 

Vitamin D concentration High vitamin D Low vitamin D 

(explanatory) concentration concentration 

Quality of diet Better diet Worse diet 

Amount of exercise More exercise Less exercise 

Amount of vitamin More likely to take other Less likely to take other 

supplementation vitamins vitamins 

Diabetes status (response) Less likely to have More likely to have 
diabetes diabetes 


For example, it is possible that people who have healthier diets eat lots 
of foods that are high in vitamin D. Likewise, it is possible that people 
with healthier diets are less likely to develop diabetes. Vitamin D 
concentration may not have anything to do with diabetes status, even 
though there is an association between the two variables. In this case, we 
say there is confounding between vitamin D concentration and diet 
because we cannot tell which variable is causing the change in diabetes 
status. 


Some people call a variable that results in confounding, like diet in this case, a 


confounding variable. 


DEFINITION Confounding 


Confounding occurs when two variables are associated in such a way 
that their effects on a response variable cannot be distinguished from 
each other. 


AP® EXAM TIP 
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If you are asked to identify a possible confounding variable in a given 
setting, you are expected to explain how the variable you choose (1) is 
associated with the explanatory variable and (2) is associated with the 
response variable. 


Likewise, because sun exposure increases vitamin D concentration, it 
is possible that people who exercise a lot outside have higher 
concentrations of vitamin D. If people who exercise a lot are also less 
likely to get diabetes, then amount of exercise and vitamin D concentration 
are confounded—we can’t say which variable is the cause of the smaller 
diabetes risk. 


EXAMPLE |Smoking and ADHD (>) 


Confounding 


Ai/Getty Images 


PROBLEM: Ina study of more than 4700 children, researchers from 
Cincinnati Children’s Hospital Medical Center found that those children 
whose mothers smoked during pregnancy were more than twice as 
likely to develop ADHD as children whose mothers had not smoked.22 
Explain how confounding makes it unreasonable to conclude that a 
mother’s smoking during pregnancy causes an increase in the risk of 
ADHD in her children based on this study. 


SOLUTION: 


It is possible that the mothers who smoked during pregnancy were also 
more likely to have unhealthy diets. If people with unhealthy diets are 


937 


also more likely to have children with ADHD, then it could be that 
unhealthy diets caused the increase in ADHD risk, not smoking. 


Notice that the solution describes how diet might be associated with the 


explanatory variable (smoking status) and with the response variable 
(ADHD status). 


FOR PRACTICE, TRY EXERCISE 43 


Observational studies of the effect of an explanatory variable on a 
response variable often fail because of confounding between the 
explanatory variable and one or more other variables. In contrast to 
observational studies, experiments don’t just observe individuals or ask 
them questions. They actively impose some treatment to measure the 
response. Experiments can answer questions like “Does aspirin reduce the 
chance of a heart attack?” and “Can yoga help dogs live longer?” 


DEFINITION Experiment 


An experiment deliberately imposes some treatment on individuals 
to measure their responses. 


To determine if taking vitamin D actually causes a reduction in 
diabetes risk, researchers in Norway performed an experiment. The 
researchers randomly assigned 500 people with pre-diabetes to either take 
a high dose of vitamin D or to take a placebo—a pill that looked exactly 
like the vitamin D supplement but contained no active ingredient. After 5 
years, about 40% of the people in each group were diagnosed with 
diabetes.2= In other words, the association between vitamin D 
concentration and diabetes status disappeared when comparing two groups 
that were roughly the same to begin with. 


DEFINITION Placebo 


A placebo is a treatment that has no active ingredient, but is 
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otherwise like other treatments. 


The experiment in Norway avoided confounding by letting chance 
decide who took vitamin D and who didn’t. That way, people with 
healthier diets were split about evenly between the two groups. So were 
people who exercise a lot and people who take other vitamins. When our 
goal is to understand cause and effect, experiments are the only source of 
fully convincing data. For this reason, the distinction between 
observational study and experiment is one of the most important in 
Statistics. 


EXAMPLE | Facebook and financial incentives >» 


Observational studies and experiments 


os _ a? 

Rostislav_Sedlacek/Shutterstock.com 
PROBLEM: In each of the following settings, identify the explanatory 
and response variables. Then determine if each is an experiment or an 
observational study. Explain your reasoning. 


a. In a study conducted by researchers at the University of Texas, 
people were asked about their social media use and satisfaction with 
their marriage. Of the heavy social media users, 32% had thought 
seriously about leaving their spouse. Only 16% of non-social media 


users had thought seriously about leaving their spouse.22 


b. Ina diet study using 100 overweight volunteers, 50 volunteers were 
randomly assigned to receive weight-loss counseling, monthly weigh- 
ins, and a three-month gym pass. The other 50 volunteers were given 
financial incentives (earning $20 for losing 4 pounds in a month or 
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paying $20 otherwise) along with the counseling, weigh-ins, and gym 
pass. The group with the financial incentives lost 6.7 more pounds, 
on average." 


SOLUTION: 


a. Explanatory variable: Frequency of social media use. Response 
variable: Marital satisfaction. This is an observational study because 
people weren’t assigned to use social media or not. 


b. Explanatory variable: Whether or not financial incentives were 
given. Response variable: Amount of weight lost. This is an 
experiment because researchers gave some volunteers financial 
incentives and did not give financial incentives to the other 
volunteers. 


In part (a), the response variable is not the percent who thought about 
leaving their spouse. This percent is a summary of all the responses. 


Likewise, in part (b), the response variable is not the average weight 
loss. This average is a summary of all the responses. 


FOR PRACTICE, TRY EXERCISE 45 


In part (a) of the example, it would be incorrect to conclude that using 
social media causes marital dissatisfaction. It could be that other variables 
are confounded with social media use—or even that marital dissatisfaction 
is causing increased social media use. In part (b), it is reasonable to 
conclude that the financial incentives caused the increase in weight loss 
because this was a well-designed experiment. 


CHECK YOUR UNDERSTANDING 


R 
J 
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1. Does reducing screen brightness increase battery life in laptop 
computers? To find out, researchers obtained 30 new laptops of the 
same brand. They chose 15 of the computers at random and adjusted 
their screens to the brightest setting. The other 15 laptop screens were 
left at the default setting—moderate brightness. Researchers then 
measured how long each machine’s battery lasted. Was this an 
observational study or an experiment? Justify your answer. 


Questions 2-4 refer to the following setting. Does eating dinner with 
their families improve students’ academic performance? According to 
an ABC News article, “Teenagers who eat with their families at least 
five times a week are more likely to get better grades in school.”24 
This finding was based on a sample survey conducted by researchers 


at Columbia University. 


2. Was this an observational study or an experiment? Justify your 
answer. 


3. What are the explanatory and response variables? 


4. Explain clearly why such a study cannot establish a cause-and-effect 
relationship. Suggest a variable that may be confounded with whether 
families eat dinner together. 


The Language of Experiments 


An experiment is a statistical study in which we actually do something (a 
treatment) to people, animals, or objects (the experimental units or 
subjects) to observe the response. 


DEFINITION Treatment, Experimental unit, Subjects 


A specific condition applied to the individuals in an experiment is 
called a treatment. If an experiment has several explanatory 
variables, a treatment is a combination of specific values of these 
variables. An experimental unit is the object to which a treatment is 
randomly assigned. When the experimental units are human beings, 
they are often called subjects. 
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The best way to learn the language of experiments is to practice using 
it. 


EXAMPLE | How can we prevent malaria? >» 


Vocabulary of experiments 


PROBLEM: Malaria causes hundreds of thousands of deaths each year, 
with many of the victims being children. Will regularly screening 
children for the malaria parasite and treating those who test positive 
reduce the proportion of children who develop the disease? Researchers 
worked with children in 101 schools in Kenya, randomly assigning half 
of the schools to receive regular screenings and follow-up treatments 
and the remaining schools to receive no regular screening. Children at 
all 101 schools were tested for malaria at the end of the study.22 Identify 
the treatments and the experimental units in this experiment. 


SOLUTION: 


This experiment compares two treatments: (1) regular screenings and 
follow-up treatments and (2) no regular screening. The experimental 
units are 101 schools in Kenya. 


Note that the experimental units are the schools, not the students. The 


decision about who to screen was made school by school, not student by 


student. All students at the same school received the same treatment. 
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In the malaria experiment, there was one explanatory variable: 
screening status. In other experiments, there are multiple explanatory 
variables. Sometimes, these explanatory variables are called factors. In an 
experiment with multiple factors, the treatments are formed using the 
various levels of each of the factors. 
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DEFINITION Factor, Levels 


In an experiment, a factor is a variable that is manipulated and may 
cause a change in the response variable. The different values of a 
factor are called levels. 


Here’s an example of a multifactor experiment. 


EXAMPLE |The five-second rule >» 


Experiments with multiple explanatory variables 


Zaia Snively 


PROBLEM: Have you ever dropped a tasty piece of food on the 
ground, then quickly picked it up and eaten it? If so, you probably 
thought about the “five-second rule,” which states that a piece of food is 
safe to eat if it has been on the floor less than 5 seconds. The rule is 
based on the belief that bacteria need time to transfer from the floor to 
the food. But does it work? 

Researchers from Rutgers University put the five-second rule to the 
test. They used four different types of food: watermelon, bread, bread 
with butter, and gummy candy. They dropped the food onto four 
different surfaces: stainless steel, ceramic tile, wood, and carpet. And 
they waited for four different lengths of time: less than 1 second, 5 
seconds, 30 seconds, and 300 seconds. Finally, they used bacteria 
prepared two different ways: in a tryptic soy broth or peptone buffer. 
Once the bacteria were ready, the researchers spread them out on the 
different surfaces and started dropping food.2° 


a. List the factors in this experiment and the number of levels for each 
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factor. 


b. If the researchers used every possible combination to form the 
treatments, how many treatments were included in the experiment? 


c. List two of the treatments. 
SOLUTION: 


a. Type of food (4 levels), type of surface (4 levels), amount of time (4 
levels), method of bacterial preparation (2 levels) 


b. 4x 4x 4x2 = 128 different treatments 


c. Watermelon/stainless steel/less than 1 second/tryptic soy broth; 
gummy candy/wood/300 seconds/peptone buffer 
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What did the researchers discover? The wetter foods had greater 
bacterial transfer and food dropped on carpet had the least bacterial 
transfer. There was greater bacterial transfer the longer the food was on the 
surface, although there was some transfer that happened almost 
instantaneously. Overall, the researchers concluded that the type of food 
and type of surface were at least as important as the amount of time the 
food remained on the surface. 


This example shows how experiments allow us to study the combined 
effect of several factors. The interaction of several factors can produce 
effects that could not be predicted from looking at the effect of each factor 
alone. For example, although longer time was associated with more 
bacterial transfer in general, this relationship might not be true for very 
moist food. 


Designing Experiments: Comparison 


Experiments are the preferred method for examining the effect of one 
variable on another. By imposing the specific treatment of interest and 
controlling other influences, we can pin down cause and effect. Good 
designs are essential for effective experiments, just as they are for 
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sampling. To see why, let’s start with an example of a bad experimental 
design. 


Zaia Snively 


Does caffeine affect pulse rate? Many students regularly consume 
caffeine to help them stay alert. So it seems plausible that taking caffeine 
might increase an individual’s pulse rate. Is this true? One way to 
investigate this claim is to ask volunteers to measure their pulse rates, 
drink some cola with caffeine, measure their pulse rates again after 10 
minutes, and calculate the increase in pulse rate. 


This experiment has a very simple design. A group of subjects (the 
students) were exposed to a treatment (the cola with caffeine), and the 
outcome (change in pulse rate) was observed. Here is the design: 


Students — Cola with caffeine — Change in pulse rate 
Students — Cola with caffeine —» Change in pulse rate 


Unfortunately, even if the pulse rate of every student went up, we couldn’t 
attribute the increase to caffeine. Perhaps the excitement of being in an 
experiment made their pulse rates increase. Maybe it was the sugar in the 
cola and not the caffeine. Perhaps their teacher told them a funny joke 
during the 10-minute waiting period and made everyone laugh. In other 
words, there are many other variables that are potentially confounded with 
taking caffeine. 


Many laboratory experiments use a design like the one in the caffeine 
example: 
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Experimental units — Treatment — Measure response 
Experimental units — Treatment — Measure response 


In the lab environment, simple designs often work well. Field experiments 
and experiments with animals or people deal with more varied conditions. 
Outside the lab, badly designed experiments often yield worthless results 
because of confounding. 


The remedy for the confounding in the caffeine example is to do a 
comparative experiment with two groups: one group that receives caffeine 
and a control group that does not receive caffeine. 


DEFINITION Control group 


In an experiment, a control group is used to provide a baseline for 
comparing the effects of other treatments. Depending on the purpose 
of the experiment, a control group may be given an inactive treatment 
(placebo), an active treatment, or no treatment at all. 


In all other aspects, these groups should be treated exactly the same so 
that the only difference is the caffeine. That way, if there is convincing 
evidence of a difference in the average increase in pulse rates, we can 
safely conclude it was caused by the caffeine. This means that one group 
could get regular cola with caffeine, while the control group gets caffeine- 
free cola. Both groups would get the same amount of sugar, so sugar 
consumption would no longer be confounded with caffeine intake. 
Likewise, both groups would experience the same events during the 
experiment, so what happens during the experiment won’t be confounded 
with caffeine intake either. 


EXAMPLE | Preventing malaria (>) 


Control groups 
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Alexander Joe/Getty Images 


PROBLEM: In an earlier example, we described an experiment in 
which researchers randomly assigned 101 schools in Kenya to either 
receive regular screenings and follow-up treatments or to receive no 
regular screenings. Explain why it was necessary to include a control 
group of schools that didn’t receive regular screenings. 


SOLUTION: 


The purpose of the control group is to provide a baseline for comparing 
the effect of the regular screenings and follow-up treatments. Otherwise, 
researchers wouldn’t be able to determine if a decrease in malaria rates 
was due to the treatment or some other change that occurred during the 
experiment (like a drought that killed off mosquitos, slowing the spread 
of malaria). 
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A control group was essential in the malaria experiment to determine if 
screening was effective. However, not all experiments include a control 
group—as long as comparison takes place. In the experiment about the 
five-second rule, there were 128 different treatments being compared and 
no control group. A control group wasn’t essential in this experiment 
because researchers were interested in comparing different amounts of 
time on the floor, different types of food, and different types of surfaces. 


Designing Experiments: Blinding and the Placebo Effect 


In the caffeine experiment, we used comparison to help prevent 
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confounding. But even when there is comparison, confounding is still 
possible. If the subjects in the experiment know what type of soda they are 
receiving, the expectations of the two groups will be different. The 
knowledge that a subject is receiving caffeine may increase his or her 
pulse rate, apart from the caffeine itself. This is an example of the placebo 
effect. 


DEFINITION Placebo effect 


The placebo effect describes the fact that some subjects in an 
experiment will respond favorably to any treatment, even an inactive 
treatment. 


In one study, researchers zapped the wrists of 24 test subjects with a 
painful jolt of electricity. Then they rubbed a cream with no active 
medicine on subjects’ wrists and told them the cream should help soothe 
the pain. When researchers shocked them again, 8 subjects said they 
experienced significantly less pain.22 When the ailment is psychological, 
like depression, some experts think that the placebo effect accounts for 
about three-quarters of the effect of the most widely used drugs.28 


Because of the placebo effect, it is important that subjects don’t know 
what treatment they are receiving. It is also better if the people interacting 
with the subjects and measuring the response variable don’t know which 
subjects are receiving which treatment. When neither group knows who is 
receiving which treatment, the experiment is double-blind. Other 
experiments are single-blind. 


DEFINITION Double-blind, Single-blind 


In a double-blind experiment, neither the subjects nor those who 
interact with them and measure the response variable know which 
treatment a subject received. 


In a single-blind experiment, either the subjects don’t know which 
treatment they are receiving or the people who interact with them and 
measure the response variable don’t know which subjects are 
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receiving which treatment. 


The idea of a double-blind design is simple. Until the experiment ends 
and the results are in, only the study’s statistician knows for sure which 
treatment a subject is receiving. However, some experiments cannot be 
carried out in a double-blind manner. For example, if researchers are 
comparing the effects of exercise and dieting on weight loss, then subjects 
will know which treatment they are receiving. Such an experiment can still 
be single-blind if the individuals who are interacting with the subjects and 
measuring the response variable don’t know who is dieting and who is 
exercising. In other single-blind experiments, the subjects are unaware of 
which treatment they are receiving, but the people interacting with them 
and measuring the response variable do know. 


EXAMPLE |Do magnets repel pain? (») 
Blinding and the placebo effect 


Eric O’Connell/Getty Images 


PROBLEM: Early research showed that magnetic fields affected living 
tissue in humans. Some doctors have begun to use magnets to treat 
patients with chronic pain. Scientists wondered if this type of therapy 
really worked. They designed a double-blind experiment to find out. A 
total of 50 patients with chronic pain were recruited for the study. A 
doctor identified a painful site on each patient and asked him or her to 
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rate the pain on a scale from 0 (mild pain) to 10 (severe pain). Then the 
doctor selected a sealed envelope containing a magnet at random from a 
box with a mixture of active and inactive magnets. The chosen magnet 
was applied to the site of the pain for 45 minutes. After being treated, 
each patient was again asked to rate the level of pain from 0 to 10.22 


a. Explain what it means for this experiment to be double-blind. 


b. Why was it important for this experiment to be double-blind? 
SOLUTION: 


a. Neither the subjects nor the doctors applying the magnets and 
recording the pain ratings knew which subjects had the active 
magnets and which had the inactive magnets. 


b. If subjects knew they were receiving an active treatment, researchers 
wouldn’t know if any improvement was due to the magnets or to the 
expectation of getting better (the placebo effect). If the doctors knew 
which subjects received which treatments, they might treat one group 
of subjects differently from the other group. This would make it 
difficult to know if the magnets were the cause of any improvement. 
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CHECK YOUR UNDERSTANDING 


. 


y A new analysis is casting doubt on a claimed benefit of omega-3 
fish oil. For years, doctors have been recommending eating fish and 
taking fish oil supplements to prevent heart disease. But the new 
analysis reviewed 20 previous studies and showed that the effects of 
omega-3 aren’t as great as once suspected. One reason is that an early 
trial of omega-3 supplements was conducted as an open-label study.22 
In this type of study, both patients and researchers know who is 
receiving which treatment. 
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1. Describe a potential problem with an open-label study in this 
context. 
2. Describe how you can fix the problem identified in Question 1. 


Designing Experiments: Random Assignment 


Comparison alone isn’t enough to produce results we can trust. If the 
treatments are given to groups that differ greatly when the experiment 
begins, confounding will result. If we allow students to choose what type 
of cola they will drink in the caffeine experiment, students who consume 
caffeine on a regular basis might be more likely to choose the regular cola. 
Due to their caffeine tolerance, these students’ pulse rates might not 
increase as much as other students’ pulse rates. In this case, caffeine 
tolerance would be confounded with the amount of caffeine consumed, 
making it impossible to conclude cause and effect. 


To create roughly equivalent groups at the beginning of an experiment, 
we use random assignment to determine which experimental units get 
which treatment. 


DEFINITION Random assignment 


In an experiment, random assignment means that experimental units 
are assigned to treatments using a chance process. 


Let’s look at how random assignment can be used to improve the 
design of the caffeine experiment. 


EXAMPLE | Caffeine and pulse rates >» 


How random assignment works 


ool 


Zaia Snively 


PROBLEM: A total of 20 students have agreed to participate in an 
experiment comparing the effects of caffeinated cola and caffeine-free 
cola on pulse rates. Describe how you would randomly assign 10 
students to each of the two treatments: 


a. Using 20 identical slips of paper 
b. Using technology 
c. Using Table D 


SOLUTION: 


a. On 10 slips of paper, write the letter “A”; on the remaining 10 slips, 
write the letter “B.” Shuffle the slips of paper and hand out one slip 
of paper to each volunteer. Students who get an “A” slip receive the 
cola with caffeine and students who get a “B” slip receive the cola 
without caffeine. 


When describing a method of random assignment, don’t stop after 


creating the groups. Make sure to identify which group gets which 


treatment. 


b. Label each student with a different integer from 1 to 20. Then 
randomly generate 10 different integers from 1 to 20. The students 
with these labels receive the cola with caffeine. The remaining 10 
students receive the cola without caffeine. 


| When using a random number generator or a table of random digits to | 


p02 


assign treatments, make sure to account for the possibility of repeated 
numbers when describing your method. 


c. Label each student with a different integer from 01 to 20. Go toa 
line of Table D and read two-digit groups moving from left to right. 
The first 10 different labels between 01 and 20 identify the 10 
students who receive cola with caffeine. The remaining 10 students 
receive the caffeine-free cola. Ignore groups of digits from 21 to 00. 
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Random assignment should distribute the students who regularly 
consume caffeine in roughly equal numbers to each group. It should also 
balance out the students with high metabolism and those with larger body 
sizes in the caffeine and caffeine-free groups. Random assignment helps 
ensure that the effects of other variables (e.g., caffeine tolerance, 
metabolism, or body size) are spread evenly among the two groups. 


Designing Experiments: Control 


Although random assignment should create two groups of students that are 
roughly equivalent to begin with, we still have to ensure that the only 
consistent difference between the groups during the experiment is the type 
of cola they receive. We can control the effects of some variables by 
keeping them the same for both groups. For example, we should make 
both treatments contain the same amount of sugar. If one group got regular 
cola and the other group got caffeine-free diet cola, then the amount of 
sugar would be confounded with the amount of caffeine—we wouldn’t 
know if it was the sugar or the caffeine that was causing a change in pulse 
rates. 


DEFINITION Control 


In an experiment, control means keeping other variables constant for 
all experimental units. 
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We also want to control other variables to reduce the variability in the 
response variable. Suppose we let volunteers in both groups choose how 
much cola they want to drink. In that case, the changes in pulse rate would 
be more variable than if we made sure each subject drank the same amount 
of soda. Letting the amount of cola vary will make it harder to determine if 
caffeine is really having an effect on pulse rates. 


The dotplots on the left show the results of an experiment in which the 
amount of cola consumed was the same for all participating students. 
Because there is so little overlap in these graphs, it seems clear that 
caffeine increases pulse rates. The dotplots on the right show the results of 
an experiment in which the students were permitted to choose how much 
or how little cola they consumed. Notice that the centers of the 
distributions haven’t changed, but the distributions are much more 
variable. The increased overlap in the graphs makes the evidence 
supporting the effect of caffeine less convincing. 
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Change in (keeping amount of soda constant) pulse rate Change in (letting amount of soda vary) pulse rate 


After randomly assigning treatments and controlling other variables, 
the two groups should be about the same, except for the treatments. Then a 
difference in the average change in pulse rate must be due either to the 
treatments themselves—or to the random assignment. We can’t say that 
any difference between the average pulse rate changes for students in the 
two groups must be caused by the difference in caffeine. There would be 
some difference, even if both groups received the same type of cola, 
because the random assignment is unlikely to produce two groups that are 
exactly equivalent with respect to every variable that might affect pulse 
rate. 


Designing Experiments: Replication 


Would you trust an experiment with just one student in each group? No, 
because the results would depend too much on which student was assigned 
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to the caffeinated cola. However, if we randomly assign many subjects to 
each group, the effects of chance will balance out, and there will be little 
difference in the average responses in the two groups—unless the 

treatments themselves cause a difference. This is the idea of replication. 


DEFINITION Replication 


In an experiment, replication means using enough experimental units 
to distinguish a difference in the effects of the treatments from chance 
variation due to the random assignment. 


oo In statistics, replication means “use enough subjects.” In other 
fields, the term replication has a different meaning. In these fields, 
replication means conducting an experiment in one setting and then having 
other investigators conduct a similar experiment in a different setting. That 
is, replication means repeatability. 


Experiments: Putting It All Together 


The following box summarizes the four key principles of experimental 
design: comparison, random assignment, control, and replication. 


PRINCIPLES OF EXPERIMENTAL DESIGN 


The basic principles for designing experiments are as follows: 
1. Comparison. Use a design that compares two or more treatments. 


2. Random assignment. Use chance to assign experimental units to 
treatments. Doing so helps create roughly equivalent groups of 
experimental units by balancing the effects of other variables among 
the treatment groups. 


3. Control. Keep other variables the same for all groups, especially 
variables that are likely to affect the response variable. Control helps 
avoid confounding and reduces variability in the response variable. 


4. Replication. Use enough experimental units in each group so that 
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any differences in the effects of the treatments can be distinguished 
from chance differences between the groups. 


Let’s see how these principles were used in designing a famous 
medical experiment. 


EXAMPLE |The Physicians’ Health Study (») 


Principles of experimental design 


-_ 


SCIENCE PHOTO LIBRARY/AGE 
Fotostock 


PROBLEM: Does regularly taking aspirin help protect people against 
heart attacks? The Physicians’ Health Study was a medical experiment 
that helped answer this question. In fact, the Physicians’ Health Study 
looked at the effects of two drugs: aspirin and beta-carotene. 
Researchers wondered if beta-carotene would help prevent some forms 
of cancer. The subjects in this experiment were 21,996 male physicians. 
There were two explanatory variables (factors), each having two levels: 
aspirin (yes or no) and beta-carotene (yes or no). Combinations of the 
levels of these factors form the four treatments shown in the diagram. 
One-fourth of the subjects were assigned at random to each of these 
treatments. 


On odd-numbered days, the subjects took either a tablet that 
contained aspirin or a placebo that looked and tasted like the aspirin but 
had no active ingredient. On even-numbered days, they took either a 
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capsule containing beta-carotene or a placebo. There were several 
response variables—the study looked for heart attacks, several kinds of 
cancer, and other medical outcomes. After several years, 239 of the 
placebo group but only 139 of the aspirin group had suffered heart 
attacks. This difference is large enough to give good evidence that 
taking aspirin does reduce heart attacks.2# It did not appear, however, 
that beta-carotene had any effect on preventing cancer. 


Factor 2: Beta-carotene 


Yes No 
2 C | J . C 5 . 
Factor 1: Aspirin Beta-carotene Aspirin Placebo 
Aspirin i 
No} | s | s 
Placebo Beta-carotene Placebo Placebo 


a. Explain how this experiment used comparison. 


b. Explain the purpose of randomly assigning the physicians to the four 
treatments. 


c. Name two variables that were controlled in this experiment and why 
it was beneficial to control these variables. 


d. Explain how this experiment used replication. What is the purpose of 
replication in this context? 

SOLUTION: 

a. Researchers used a design that compared each of the active 
treatments to a placebo. 


b. Random assignment helped ensure that the four groups of physicians 
were roughly equivalent at the beginning of the experiment. 


c. The experiment used subjects of the same gender and same 
occupation. Using only male physicians helps to reduce the 
variability in the response variables. 


d. There were over 5000 subjects in each treatment group. This helped 
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ensure that the difference in heart attacks was due to the aspirin and 
not to chance variation in the random assignment. 


If women and people with other occupations were included, the results 


might be more variable, making it harder to determine the effects of 


aspirin and beta-carotene. However, using only male physicians means 
we don’t know how females or other males would respond to these 
treatments. 
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The Physicians’ Health Study shows how well-designed experiments 
can yield good evidence that differences in the treatments cause the 
differences we observe in the response. 


Reports in medical journals regularly begin with words like these from a study of a 


flu vaccine given as a nose spray: “This study was a randomized, double-blind, 
placebo-controlled trial. Participants were enrolled from 13 sites across the 
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continental United States between mid-September and mid-November.”“* Doctors 


are supposed to know what this means. Now you know, too. 


CHECK YOUR UNDERSTANDING 


‘. 


\ 
y Many utility companies have introduced programs to encourage 
energy conservation among their customers. An electric company 
considers placing small digital displays in households to show current 
electricity use and what the cost would be if this use continued for a 
month. Will the displays reduce electricity use? One cheaper approach 
is to give customers a chart and information about monitoring their 
electricity use from their outside meter. Would this method work almost 
as well? The company decides to conduct an experiment using 60 
households to compare these two approaches (display, chart) with a 
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group of customers who receive information about energy consumption 
but no help in monitoring electricity use. 


1. Explain why it was important to have a control group that didn’t get 
the display or the chart. 


2. Describe how to randomly assign the treatments to the 60 households. 


3. What is the purpose of randomly assigning treatments in this context? 


Completely Randomized Designs 


The diagram in Figure 4.1 presents the details of the caffeine experiment: 
random assignment, the sizes of the groups and which treatment they 
receive, and the response variable. This type of design is called a 


completely randomized design. 


Group 1 Treatment 1 


— x 
ae 10 students Caffeine ae 
20 volunteer Random Compare change 
— 


students assignment ue in pulse rates 
Pay, Group 2 Treatment 2 
er 


10 students No caffeine 
FIGURE 4.1 Outline of a completely randomized design to compare caffeine and 


no caffeine. 


DEFINITION Completely randomized design 


In a completely randomized design, the experimental units are 
assigned to the treatments completely by chance. 


Although there are good statistical reasons for using treatment groups 
that are about equal in size, the definition of a completely randomized 
design does not require that each treatment be assigned to an equal number 
of experimental units. It does specify that the assignment of treatments 
must occur completely at random. 


EXAMPLE | Chocolate milk and concussions (>) 


Completely randomized design 
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Ann Heath 


PROBLEM: “Concussion-Related Measures Improved in High School 
Football Players Who Drank New Chocolate Milk” announced a recent 


headline.22 In the study, researchers compared a group of concussed 
football players given a new type of chocolate milk with a group of 
concussed football players who received no treatment. 


a. Explain why it isn’t reasonable to conclude that the new type of 
chocolate milk is effective for treating high school football players 
with concussions based on this study. 


b. To test the effectiveness of the new type of chocolate milk, you 
recruit 50 high school football players who suffered a concussion in 
the previous 24 hours to participate in an experiment. Write a few 
sentences describing a completely randomized design for this 
experiment. 


SOLUTION: 


a. It is possible that the group who received the new type of chocolate 
milk improved because they knew they were being treated and 
expected to get better, not because of the new chocolate milk. 


b. Number the players from 1 to 50. Use a random number generator to 
produce 25 different integers from 1 to 50 and give the new type of 
chocolate milk to the players with these numbers. Give regular 
chocolate milk to the remaining 25 players. Compare the concussion- 
related measures for the two groups. 
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AP® EXAM TIP 


If you are asked to describe a completely randomized design, stay away 
from flipping coins. For example, suppose we ask each student in the 
caffeine experiment to toss a coin. If it’s heads, then the student will 
drink the cola with caffeine. If it’s tails, then the student will drink the 
caffeine-free cola. As long as all 20 students toss a coin, this is still a 
completely randomized design. Of course, the two groups are unlikely 
to contain exactly 10 students because it is unlikely that 20 coin tosses 
will result in a perfect 50-50 split between heads and tails. 

The problem arises if we try to force the two groups to have equal 
sizes. Suppose we continue to have students toss coins until one of the 
groups has 10 students and then place the remaining students in the 
other group. In this case, the last two students in line are very likely to 
end up in the same group. However, in a completely randomized 
design, the last two subjects should only have a 50% chance of ending 
up in the same group. 


Randomized Block Designs 


Completely randomized designs are the simplest statistical designs for 
experiments. They illustrate clearly the principles of comparison, random 
assignment, control, and replication. But just as with sampling, there are 
times when the simplest method doesn’t yield the most precise results. 
When a population consists of groups of individuals that are “similar 
within but different between,” a stratified random sample gives a better 
estimate than a simple random sample. This same logic applies in 
experiments. 

Suppose that a mobile phone company is considering two different 
keyboard designs (A and B) for its new smartphone. The company decides 
to perform an experiment to compare the two keyboards using a group of 
10 volunteers. The response variable is typing speed, measured in words 
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per minute. 


How should the company address the fact that four of the volunteers 
already use a smartphone, whereas the remaining six volunteers do not? 
They could use a completely randomized design and hope that the random 
assignment distributes the smartphone users and non-smartphone users 
about evenly between the group using keyboard A and the group using 
keyboard B. Even so, there might be a lot of variability in typing speed 
within both treatment groups because some members of each treatment 
group are more familiar with smartphones than the others. This additional 
variability might make it difficult to detect a difference in the effectiveness 
of the two keyboards. What should the researchers do? 


Zaia Snively 


Because the company knows that experience with smartphones will 
affect typing speed, they could start by separating the volunteers into two 
groups—one with experienced smartphone users and one with 
inexperienced smartphone users. Each of these groups of similar subjects 
is known as a block. Within each block, the company could then randomly 
assign half of the subjects to use keyboard A and the other half to use 
keyboard B. To control other variables, each subject should be given the 
Same passage to type while in a quiet room with no distractions. This 
randomized block design helps account for the variation in typing speed 
that is due to experience with smartphones. 


DEFINITION Block, Randomized block design 


A block is a group of experimental units that are known before the 
experiment to be similar in some way that is expected to affect the 
response to the treatments. 


In a randomized block design, the random assignment of 
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experimental units to treatments is carried out separately within each 
block. 


Figure 4.2 outlines the randomized block design for the smartphone 
experiment. The subjects are first separated into blocks based on their 
experience with smartphones. Then the two treatments are randomly 
assigned within each block. 


Assignment to blocks 
is not random 


Keyboard A 
=2 
Smartphone as i” Na. 
users te Random Compare 
os assignment typing speed 
n=4 
10 ff oe Keyboard B a he 


n=2 ‘ 
volunteers Combine 
Non-smartphone 


Keyboard A adjusted results 
a n=3 7 ug and compare 
Compare 
ei assignment typing speed 
n=6 
i Keyboard B pf 
n=3 


FIGURE 4.2 Outline of a randomized block design for the smartphone 
experiment. The blocks consist of volunteers who have used smartphones and 


Random 


volunteers who have not used smartphones. The treatments are keyboard A and 
keyboard B. 


Using a randomized block design allows us to account for the variation 
in the response that is due to the blocking variable of smartphone 
experience. This makes it easier to determine if one treatment is really 
more effective than the other. 


To see how blocking helps, let’s look at the results of the smartphone 
experiment. In the block of 4 smartphone users, 2 were randomly assigned 
to use keyboard A and the other 2 were assigned to use keyboard B. 
Likewise, in the block of 6 non-smartphone users, 3 were randomly 
assigned to use keyboard A and the other 3 were assigned to use keyboard 
B. Each of the 10 volunteers typed the same passage and the typing speed 
was recorded. Here are the results: 


~ T 
15 20 25 30 35 40 45 50 
Typing speed (wpm) 
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There is some evidence that keyboard A results in higher typing 
speeds, but the evidence isn’t that convincing. Enough overlap occurs in 
the two distributions that the differences might simply be due to the chance 
variation in the random assignment. 


If we compare the results for the two keyboards within each block, 
however, a different story emerges. Among the 4 smartphone users 
(indicated by the red squares), keyboard A was the clear winner. Likewise, 
among the 6 non- smartphone users (indicated by the black dots), keyboard 
A was also the clear winner. 


* Smartphone user ¢ Non-smartphone user 


15 20 25 30 35 40 45 50 
Typing speed (wpm) 

The overlap in the first set of dotplots was due almost entirely to the 
variation in smartphone experience—smartphone users were generally 
faster than non-smartphone users, regardless of which keyboard they used. 
In fact, the average typing speed for the smartphone users was 40, while 
the average typing speed for non-smartphone users was only 26, a 
difference of 14 words per minute. To account for the variation created by 
the difference in smartphone experience, let’s subtract 14 from each of the 
typing speeds in the block of smartphone users to “even the playing field.” 
Here are the results: 


» Smartphone user © Non-smartphone user 


15 20 25 30 35 40 45 50 
Adjusted typing speed (wpm) 

Because we accounted for the variation due to the difference in 
smartphone experience, the variation in each of the distributions has been 
reduced. There is now much less overlap between the two distributions, 
meaning that the evidence in favor of keyboard A is much more 
convincing. When blocks are formed wisely, it is easier to find convincing 
evidence that one treatment is more effective than another. 
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AP® EXAM TIP 


Don’t mix the language of experiments and the language of sample 
surveys or other observational studies. You will lose credit for saying 


things like “use a randomized block design to select the sample for this 
survey” or “this experiment suffers from nonresponse because some 
subjects dropped out during the study.” 


The idea of blocking is an important additional principle of 
experimental design. A wise experimenter will form blocks based on the 
most important unavoidable sources of variation among the experimental 
units. In other words, the experimenter will form blocks using the 
variables that are the best predictors of the response variable. Random 
assignment will then average out the effects of the remaining other 
variables and allow a fair comparison of the treatments. The moral of the 
story is: control what you can, block on what you can’t control, and 
randomize to create comparable groups. 


EXAMPLE | Should I use the popcorn button? » 


Blocking in an experiment 


Ann Health 


PROBLEM: A popcorn lover wants to determine if it is better to use 
the “popcorn button” on her microwave oven or use the amount of time 
recommended on the bag of popcorn. To measure how well each 
method works, she will count the number of unpopped kernels 
remaining after popping. To obtain the experimental units, she goes to 
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the store and buys 10 bags each of 4 different varieties of microwave 
popcorn (butter, cheese, natural, and kettle corn), for a total of 40 bags. 


a. Describe a randomized block design for this experiment. Justify your 
choice of blocks. 


b. Explain why a randomized block design might be preferable to a 
completely randomized design for this experiment. 


SOLUTION: 


a. Form blocks based on variety, because the number of unpopped 
kernels is likely to differ by variety. Randomly assign 5 bags of each 
variety to the popcorn button treatment and 5 to the timed treatment 
by placing all 10 bags of a particular variety in a large box. Shake the 
box, pick 5 bags without looking, and assign them to be popped 
using the popcorn button. The remaining 5 bags will be popped using 
the instructions on the bags. Repeat this process for the remaining 3 
varieties. After popping each of the 40 bags in random order, count 
the number of unpopped kernels in each bag and compare the results 
within each variety. Then combine the results from the 4 varieties 
after accounting for the difference in average response for each 
variety. 


It is important to pop the bags in random order so that changes over 
time (e.g., temperature, humidity) aren’t confounded with the 
explanatory variable. For example, if the 20 “popcorn button” bags 


are popped last when the room temperature is greater, we wouldn’t 
know if using the popcorn button or the warmer temperature was the 
cause of a difference in the number of unpopped kernels. 


b. A randomized block design accounts for the variability in the 
number of unpopped kernels created by the different varieties of 
popcorn (butter, cheese, natural, kettle). This makes it easier to 
determine if using the microwave button is more effective for 
reducing the number of unpopped kernels. 
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Another way to address the variability in unpopped kernels created by 
the different varieties is to use only one variety of popcorn in the 
experiment. Because variety of popcorn is no longer a variable, it will not 
be a source of variability. Of course, this means that the results of the 
experiment only apply to that one variety of popcorn—not ideal for the 
popcorn lover in the example! 


MATCHED PAIRS DESIGN A common type of randomized block 
design for comparing two treatments is a matched pairs design. The idea 
is to create blocks by matching pairs of similar experimental units. The 
random assignment of subjects to treatments is done within each matched 
pair. Just as with other forms of blocking, matching helps account for the 
variation due to the variable(s) used to form the pairs. 


DEFINITION Matched pairs design 


A matched pairs design is a common experimental design for 
comparing two treatments that uses blocks of size 2. In some matched 
pairs designs, two very similar experimental units are paired and the 
two treatments are randomly assigned within each pair. In others, 
each experimental unit receives both treatments in a random order. 


Suppose we want to investigate if listening to classical music while 
taking a math test affects performance. A total of 30 students in a math 
class volunteer to take part in the experiment. The difference in 
mathematical ability among the volunteers is likely to create additional 
variation in the test scores, making it harder to see the effect of classical 
music. To account for this variation, we could pair the students by their 
grade in the class—the two students with the highest grades are paired 
together, the two students with the next highest grades are paired together, 
and so on. Within each pair, one student is randomly assigned to take a 
math test while listening to classical music and the other member of the 
pair is assigned to take the math test in silence. 
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Sometimes, each “pair” in a matched pairs design consists of just one 
experimental unit that gets both treatments in random order. In the 
experiment about the effect of listening to classical music, we could have 
each student take a math test in both conditions. To decide the order, we 
might flip a coin for each student. If the coin lands on heads, the student 
takes a math test with classical music playing today and a similar math test 
without music playing tomorrow. If it lands on tails, the student does the 
Opposite—no music today and classical music tomorrow. 


Randomizing the order of treatments is important to avoid 
confounding. Suppose everyone did the classical music treatment on the 
first day and the no-music treatment on the second day, but the air 
conditioner wasn’t working on the second day. We wouldn’t know if any 
difference in mean test score was due to the difference in treatment or the 
difference in room temperature. 


EXAMPLE | Will an additive improve my mileage? > 
Matched pairs design 


PROBLEM: A consumer organization wants to know if using a certain 
fuel additive increases the fuel efficiency (in miles per gallon, or mpg) 
of cars. A total of 20 cars of different types are available for testing. 
Design an experiment that uses a matched pairs design to investigate 
this question. Explain your method of pairing. 


SOLUTION: 
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Give each car both treatments. It is reasonable to think that some cars 
are more fuel efficient than others, so using each car as its own “pair” 
accounts for the variation in fuel efficiency in the experimental units. 
For each car, randomly assign the order in which the treatments are 
assigned by flipping a coin. Heads indicates using the additive first and 
no additive second. Tails indicates using no additive first and then the 
additive second. For each car, record the fuel efficiency (mpg) after 
using each treatment. 
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In the preceding example, it is also possible to form pairs of two 
similar cars. For instance, we could pair together the two most fuel- 
efficient cars, the next two most fuel-efficient cars, and so on. This is less 
ideal, however, because there will still be some differences between the 
members of each pair that may cause additional variation in the results. 
Using the same car twice creates perfectly matched “pairs,” and it also 
doubles the number of pairs used in the experiment. Both these features 
make it easier to find convincing evidence that the gas additive is 
effective, if it really is effective. 


CHECK YOUR UNDERSTANDING 


. 


y Researchers would like to design an experiment to compare the 
effectiveness of three different advertisements for a new television 
series featuring the works of Jane Austen. There are 300 volunteers 
available for the experiment. 


1. Describe a completely randomized design to compare the 
effectiveness of the three advertisements. 


2. Describe a randomized block design for this experiment. Justify your 
choice of blocks. 


3. Why might a randomized block design be preferable in this context? 


969 


Section 4.2 Summary 


e Statistical studies often try to show that changing one variable (the 
explanatory variable) causes changes in another variable (the response 
variable). Variables are confounded when their effects on a response 
variable can’t be distinguished from each other. 


e We can produce data to answer specific questions using observational 
studies or experiments. An observational study gathers data on 
individuals as they are. Experiments actively do something to people, 
animals, or objects in order to measure their response. Experiments are 
the best way to show cause and effect. 


e In an experiment, we impose one or more treatments on a group of 
experimental units (sometimes called subjects if they are human). 
Each treatment is a combination of the levels of the explanatory 
variables (also called factors). 


e Some experiments give a placebo (fake treatment) to a control group. 
That helps prevent confounding due to the placebo effect, whereby 
some patients get better because they expect the treatment to work. 


e Many behavioral and medical experiments are double-blind. That is, 
neither the subjects nor those interacting with them and measuring their 
responses know who is receiving which treatment. If one group knows 
and the other doesn’t, then the experiment is single-blind. 


e The basic principles of experimental design are: 
« Comparison: Use a design that compares two or more treatments. 


: Random assignment: Use chance (slips of paper, a random number 
generator, a table of random digits) to assign experimental units to 
treatments. This helps create roughly equivalent groups before 
treatments are imposed. 


= Control: Keep other variables the same for all groups. Control helps 
avoid confounding and reduces the variation in responses, making it 
easier to decide if a treatment is effective. 


= Replication: Impose each treatment on enough experimental units so 
that the effects of the treatments can be distinguished from chance 
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differences between the groups. 


e Ina completely randomized design, the experimental units are 
assigned to the treatments completely by chance. 


e A randomized block design forms groups (blocks) of experimental 
units that are similar with respect to a variable that is expected to affect 
the response. Treatments are assigned at random within each block. 
Responses are then compared within each block and combined with the 
responses of other blocks after accounting for the differences between 
the blocks. When blocks are chosen wisely, it is easier to determine if 
one treatment is more effective than another. 


e A matched pairs design is a common form of randomized block design 
for comparing two treatments. In some matched pairs designs, each 
subject receives both treatments in a random order. In others, two very 
similar subjects are paired, and the two treatments are randomly 
assigned within each pair. 


Section 4.2 Exercises 


43. pg 243 |») Good for the gut? Is fish good for the gut? Researchers 


tracked 22,000 male physicians for 22 years. Those who reported 
eating seafood of any kind at least 5 times per week had a 40% lower 
risk of colon cancer than those who said they ate seafood less than 
once a week. Explain how confounding makes it unreasonable to 
conclude that eating seafood causes a reduction in the risk of colon 
cancer, based on this study.24 


44. Straight A’s now, healthy later A study by Pamela Herd of the 
University of Wisconsin—Madison found a link between high school 
grades and health. Analyzing data from the Wisconsin Longitudinal 
Study, which has tracked the lives of thousands of Wisconsin high 
school graduates from the class of 1957, Herd found that students 
with higher grade-point averages were more likely to say they were in 
excellent or very good health in their early 60s. Explain how 
confounding makes it unreasonable to conclude that people will live 
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healthier lives if they increase their GPA, based on this study.22 


pg 244 |») Snacking and TV Does the type of program people 


watch influence how much they eat? A total of 94 college students 
were randomly assigned to one of three treatments: watching 20 
minutes of a Hollywood action movie (The Island), watching the 
same 20-minute excerpt of the movie with no sound, and watching 20 
minutes of an interview program (Charlie Rose). While watching, 
participants were given snacks (M&M’S®, cookies, carrots, and 
grapes) and allowed to eat as much as they wanted. Subjects who 
watched the highly stimulating excerpt from The Island ate 65% more 
calories than subjects who watched Charlie Rose. Participants who 
watched the silent version of The Island ate 46% more calories than 
those who watched Charlie Rose.2® Identify the explanatory and 
response variables in this study. Then determine if it is an experiment 
or an observational study. Explain your reasoning. 


Learning biology with computers An educator wants to compare 
the effectiveness of computer software for teaching biology with that 
of a textbook presentation. She gives a biology pretest to each student 
in a group of high school juniors, then randomly divides them into 
two groups. One group uses the computer, and the other studies the 
text. At the end of the year, she tests all the students again and 
compares the increase in biology test scores in the two groups. 
Identify the explanatory and response variables in this study. Then 
determine if it is an experiment or an observational study. Explain 
your reasoning. 


Child care and aggression A study of child care enrolled 1364 
infants and followed them through their sixth year in school. Later, 
the researchers published an article in which they stated that “the 
more time children spent in child care from birth to age 4%, the more 
adults tended to rate them, both at age 4% and at kindergarten, as less 
likely to get along with others, as more assertive, as disobedient, and 


as aggressive.”22 


a. What are the explanatory and response variables? 


ole 


b. Is this an observational study or an experiment? Justify your 
answer. 


c. Does this study show that child care makes children more 
aggressive? Explain your reasoning. 


48. Chocolate and happy babies A University of Helsinki (Finland) 


49. 


50. 


study wanted to determine if chocolate consumption during 
pregnancy had an effect on infant temperament at age 6 months. 
Researchers began by asking 305 healthy pregnant women to report 
their chocolate consumption. Six months after birth, the researchers 
asked mothers to rate their infants’ temperament using the traits of 
smiling, laughter, and fear. The babies born to women who had been 
eating chocolate daily during pregnancy were found to be more active 
and “positively reactive”’—a measure that the investigators said 
encompasses traits like smiling and laughter.22 


a. What are the explanatory and response variables? 


b. Was this an observational study or an experiment? Justify your 
answer. 


c. Does this study show that eating chocolate regularly during 
pregnancy helps produce infants with good temperament? Explain 
your reasoning. 


pg 246 |») Growing in the shade The ability to grow in shade may 


help pine trees found in the dry forests of Arizona to resist drought. 
How well do these pines grow in shade? Investigators planted pine 
seedlings in a greenhouse in either full light, light reduced to 25% of 
normal by shade cloth, or light reduced to 5% of normal. At the end 
of the study, they dried the young trees and weighed them. Identify 
the experimental units and the treatments. 


Sealing your teeth Many children have their molars sealed to help 
prevent cavities. In an experiment, 120 children aged 6—8 were 
randomly assigned to a control group, a group in which sealant was 
applied and reapplied periodically for 36 months, and a group in 
which fluoride varnish was applied and reapplied periodically for 42 
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months. After 9 years, the percent of initially healthy molars with 
cavities was calculated for each group.22 Identify the experimental 
units and the treatments. 


pg 246 |p) Improving response rate How can we reduce the rate 


of refusals in telephone surveys? Most people who answer at all listen 
to the interviewer’s introductory remarks and then decide whether to 
continue. One study made telephone calls to randomly selected 
households to ask opinions about the next election. In some calls, the 
interviewer gave her name; in others, she identified the university she 
was representing; and in still others, she identified both herself and 
the university. For each type of call, the interviewer either did or did 
not offer to send a copy of the final survey results to the person 
interviewed. 


a. List the factors in this experiment and state how many levels each 
factor has. 


b. If the researchers used every possible combination to form the 


treatments, how many treatments were included in the experiment? 


c. List two of the treatments. 


Fabric science A maker of fabric for clothing is setting up a new line 
to “finish” the raw fabric. The line will use either metal rollers or 
natural-bristle rollers to raise the surface of the fabric; a dyeing-cycle 
time of either 30 or 40 minutes; and a temperature of either 150°C or 
175°C. Three specimens of fabric will be subjected to each treatment 
and scored for quality. 


a. List the factors in this experiment and state how many levels each 
factor has. 


b. If the researchers used every possible combination to form the 


treatments, how many treatments were included in the experiment? 


c. List two of the treatments. 


Want a snack? Can snacking on fruit rather than candy reduce later 
food consumption? Researchers randomly assigned 12 women to eat 
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either 65 calories of berries or 65 calories of candy. Two hours later, 
all 12 women were given an unlimited amount of pasta to eat. The 
researchers recorded the amount of pasta consumed by each subject. 
The women who ate the berries consumed 133 fewer calories, on 
average. Identify the explanatory and response variables, the 
experimental units, and the treatments. 


Pricey pizza? The cost of a meal might affect how customers 
evaluate and appreciate food. To investigate, researchers worked with 
an Italian all-you-can-eat buffet to perform an experiment. A total of 
139 subjects were randomly assigned to pay either $4 or $8 for the 
buffet and then asked to rate the quality of the pizza on a 9-point 
scale. Subjects who paid $8 rated the pizza 11% higher than those 
who paid only $4.42 Identify the explanatory and response variables, 
the experimental units, and the treatments. 


pg 248 |») Oils and inflammation The extracts of avocado and 


soybean oils have been shown to slow cell inflammation in test tubes. 
Will taking avocado and soybean unsaponifiables (called ASU) help 
relieve pain for subjects with joint stiffness due to arthritis? In an 
experiment, 345 men and women were randomly assigned to receive 
either 300 milligrams of ASU daily for three years or a placebo daily 
for three years. Explain why it was necessary to include a control 
group in this experiment. 


Supplements for testosterone As men age, their testosterone levels 
gradually decrease. This may cause a reduction in energy, an increase 
in fat, and other undesirable changes. Do testosterone supplements 
reverse some of these effects? A study in the Netherlands assigned 
237 men aged 60 to 80 with low or low-normal testosterone levels to 
either a testosterone supplement or a placebo.“ Explain why it was 
necessary to include a control group in this experiment. 


Cocoa and blood flow A study of blood flow involved 27 healthy 
people aged 18 to 72. Each subject consumed a cocoa beverage 
containing 900 milligrams of flavonols daily for 5 days. Using a 
finger cuff, blood flow was measured on the first and fifth days of the 
study. After 5 days, researchers measured what they called 
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“significant improvement” in blood flow and the function of the cells 
that line the blood vessels.42 What flaw in the design of this 
experiment makes it impossible to say if the cocoa really caused the 
improved blood flow? Explain your answer. 


Reducing unemployment Will cash bonuses speed the return to 
work of unemployed people? A state department of labor notes that 
last year 68% of people who filed claims for unemployment insurance 
found a new job within 15 weeks. As an experiment, this year the 
state offers $500 to people filing unemployment claims if they find a 
job within 15 weeks. The percent who do so increases to 77%. What 
flaw in the design of this experiment makes it impossible to say if the 
bonus really caused the increase? Explain your answer. 


pg 250 |») More oil and inflammation Refer to Exercise 55. 


Could blinding be used in this experiment? Explain your reasoning. 
Why is blinding an important consideration in this context? 


More testosterone Refer to Exercise 56. Could blinding be used in 
this experiment? Explain your reasoning. Why is blinding an 
important consideration in this context? 


Meditation for anxiety An experiment that claimed to show that 
meditation lowers anxiety proceeded as follows. The experimenter 
interviewed the subjects and rated their level of anxiety. Then the 
subjects were randomly assigned to two groups. The experimenter 
taught one group how to meditate and they meditated daily for a 
month. The other group was simply told to relax more. At the end of 
the month, the experimenter interviewed all the subjects again and 
rated their anxiety level. The meditation group now had less anxiety. 
Psychologists said that the results were suspect because the ratings 
were not blind. Explain what this means and how lack of blindness 
could affect the reported results. 


Side effects Even if an experiment is double-blind, the blinding might 
be compromised if side effects of the treatments differ. For example, 
suppose researchers at a skin-care company are comparing their new 
acne treatment against that of the leading competitor. Fifty subjects 
are assigned at random to each treatment, and the company’s 
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researchers will rate the improvement for each of the 100 subjects. 
The researchers aren’t told which subjects received which treatments, 
but they know that their new acne treatment causes a slight reddening 
of the skin. How might this knowledge compromise the blinding? 
Explain why this is an important consideration in the experiment. 


pg 251 |») Layoffs and “survivor guilt” Workers who survive a 


layoff of other employees at their location may suffer from “survivor 
guilt.” A study of survivor guilt and its effects used as subjects 120 
students who were offered an opportunity to earn extra course credit 
by doing proofreading. Each subject worked in the same cubicle as 
another student, who was an accomplice of the experimenters. At a 
break midway through the work, one of three things happened: 
Treatment 1: The accomplice was told to leave; it was explained that 
this was because she performed poorly. 

Treatment 2: It was explained that unforeseen circumstances meant 
there was only enough work for one person. By “chance,” the 
accomplice was chosen to be laid off. 

Treatment 3: Both students continued to work after the break. 

The subjects’ work performance after the break was compared with 
their performance before the break. Overall, subjects worked harder 
when told the other student’s dismissal was random.“4 Describe how 
you would randomly assign the subjects to the treatments 


a. using slips of paper. 
b. using technology. 


c. using Table D. 


Precise offers People often use round prices as first offers in a 
negotiation. But would a more precise number suggest that the offer 
was more reasoned and informed? In an experiment, 238 adults 
played the role of a person selling a used car. Each adult received one 
of three initial offers: $2000, $1865 (a precise under-offer), and 
$2135 (a precise over-offer). After hearing the initial offer, each 
subject made a counter-offer. The difference in the initial offer and 
counter-offer was the largest in the group that received the $2000 
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offer.42 Describe how the researchers could have randomly assigned 
the subjects to the treatments 


a. using slips of paper. 
b. using technology. 


c. using Table D. 


Stronger players A football coach hears that a new exercise program 
will increase upper-body strength better than lifting weights. He is 
eager to test this new program in the off-season with the players on 
his high school team. The coach decides to let his players choose 
which of the two treatments they will undergo for 3 weeks—exercise 
or weight lifting. He will use the number of push-ups a player can do 
at the end of the experiment as the response variable. Which principle 
of experimental design does the coach’s plan violate? Explain how 
this violation could lead to confounding. 


Killing weeds A biologist would like to determine which of two 
brands of weed killer, X or Y, is less likely to harm the plants in a 
garden at the university. Before spraying near the plants, the biologist 
decides to conduct an experiment using 24 individual plants. Which 
of the following two plans for randomly assigning the treatments 
should the biologist use? Why? 

Plan A: Choose the 12 healthiest-looking plants. Then flip a coin. If it 
lands heads, apply Brand X weed killer to these plants and Brand Y 
weed killer to the remaining 12 plants. If it lands tails, do the 
opposite. 

Plan B: Choose 12 of the 24 plants at random. Apply Brand X weed 
killer to those 12 plants and Brand Y weed killer to the remaining 12 
plants. 


pg 254 |») Boosting preemies Do blood-building drugs help brain 


development in babies born prematurely? Researchers randomly 
assigned 53 babies, born more than a month premature and weighing 
less than 3 pounds, to one of three groups. Babies either received 
injections of erythropoietin (EPO) three times a week, darbepoetin 
once a week for several weeks, or no treatment. Results? Babies who 
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got the medicines scored much better by age 4 on measures of 
intelligence, language, and memory than the babies who received no 
treatment.*° 


a. Explain how this experiment used comparison. 


b. Explain the purpose of randomly assigning the babies to the three 
treatments. 


c. Name two variables that were controlled in this experiment and 
why it was beneficial to control these variables. 


d. Explain how this experiment used replication. What is the purpose 
of replication in this context? 


The effects of day care Does day care help low-income children stay 
in school and hold good jobs later in life? The Carolina Abecedarian 
Project (the name suggests the ABCs) has followed a group of 111 
children for over 40 years. Back then, these individuals were all 
healthy but low-income black infants in Chapel Hill, North Carolina. 
All the infants received nutritional supplements and help from social 
workers. Half were also assigned at random to an intensive preschool 
program. Results? Children who were assigned to the preschool 
program had higher IQ’s, higher standardized test scores, and were 
less likely to repeat a grade in school.4Z 


a. Explain how this experiment used comparison. 


b. Explain the purpose of randomly assigning the infants to the two 
treatments. 


c. Name two variables that were controlled in this experiment and 
why it was beneficial to control these variables. 


d. Explain how this experiment used replication. What is the purpose 
of replication in this context? 
pg 256 |») Treating prostate disease A large study used records 


from Canada’s national health care system to compare the 
effectiveness of two ways to treat prostate disease. The two 
treatments are traditional surgery and a new method that does not 
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require surgery. The records described many patients whose doctors 
had chosen what method to use. The study found that patients treated 


by the new method were more likely to die within 8 years.48 


a. Further study of the data showed that this conclusion was wrong. 
The extra deaths among patients who were treated with the new 
method could be explained by other variables. What other 
variables might be confounded with a doctor’s choice of surgical 
or nonsurgical treatment? 


b. You have 300 prostate patients who are willing to serve as subjects 


in an experiment to compare the two methods. Write a few 
sentences describing a completely randomized design for this 
experiment. 


Diet soda and pregnancy A large study of 3000 Canadian children 
and their mothers found that the children of mothers who drank diet 
soda daily during pregnancy were twice as likely to be overweight at 
age 1 than children of mothers who avoided diet soda during 
pregnancy.42 


a. A newspaper article about this study had the headline “Diet soda, 
pregnancy: Mix may fuel childhood obesity.” This headline 
suggests that there is a cause- and-effect relationship between diet 
soda consumption during pregnancy and the weight of the children 
1 year after birth. However, this relationship could be explained by 
other variables. What other variables might be confounded with a 
mother’s consumption of diet soda during pregnancy? 


b. You have 300 pregnant mothers who are willing to serve as 


subjects in an experiment that compares three treatments during 
pregnancy: no diet soda, one diet soda per day, and two diet sodas 
per day. Write a few sentences describing a completely 
randomized design for this experiment. 


pg 259 |) A fruitful experiment A citrus farmer wants to know 


which of three fertilizers (A, B, and C) is most effective for 
increasing the number of oranges on his trees. He is willing to use 30 
mature trees of various sizes from his orchard in an experiment with a 
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randomized block design. 


a. Describe a randomized block design for this experiment. Justify 
your choice of blocks. 


b. Explain why a randomized block design might be preferable to a 


completely randomized design for this experiment. 


In the cornfield An agriculture researcher wants to compare the yield 
of 5 corn varieties: A, B, C, D, and E. The field in which the 
experiment will be carried out increases in fertility from north to 
south. The researcher therefore divides the field into 25 plots of equal 
size, arranged in 5 east—west rows of 5 plots each, as shown in the 
diagram. 


North 


a. Describe a randomized block design for this experiment. Justify 
your choice of blocks. 


b. Explain why a randomized block design might be preferable to a 


completely randomized design for this experiment. 


Doctors and nurses Nurse-practitioners are nurses with advanced 
qualifications who often act much like primary-care physicians. Are 
they as effective as doctors at treating patients with chronic 
conditions? An experiment was conducted with 1316 patients who 
had been diagnosed with asthma, diabetes, or high blood pressure. 
Within each condition, patients were randomly assigned to either a 
doctor or a nurse-practitioner. The response variables included 
measures of the patients’ health and of their satisfaction with their 
medical care after 6 months.22 


a. Which are the blocks in this experiment: the different diagnoses 
(asthma, diabetes, or high blood pressure) or the type of care 
(nurse or doctor)? Why? 
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b. Explain why a randomized block design is preferable to a 
completely randomized design in this context. 


c. Suppose the experiment used only diabetes patients, but there were 
still 1316 subjects willing to participate. What advantage would 
this offer? What disadvantage? 


74, Comparing cancer treatments The progress of a type of cancer 
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differs in women and men. Researchers want to design an experiment 
to compare three therapies for this cancer. They recruit 500 male and 
300 female patients who are willing to serve as subjects. 


a. Which are the blocks in this experiment: the three cancer therapies 
or the two sexes? Why? 


b. What are the advantages of a randomized block design over a 
completely randomized design using these 800 subjects? 


c. Suppose the researchers had 800 male and no female subjects 
available for the study. What advantage would this offer? What 
disadvantage? 


Aw, rats! A nutrition experimenter intends to compare the weight 
gain of newly weaned male rats fed Diet A with that of rats fed Diet 
B. To do this, she will feed each diet to 10 rats. She has available 10 
rats from one litter and 10 rats from a second litter. Rats in the first 
litter appear to be slightly healthier. 


a. If the 10 rats from Litter 1 were fed Diet A, then genetics and type 
of diet would be confounded. Explain this statement carefully. 


b. Describe how to randomly assign the rats to treatments using a 


randomized block design with litters as blocks. 


c. Use technology or Table D to carry out the random assignment. 


Comparing weight-loss treatments A total of 20 overweight 
females have agreed to participate in a study of the effectiveness of 
four weight-loss treatments: A, B, C, and D. The researcher first 
calculates how overweight each subject is by comparing the subject’s 
actual weight with her “ideal” weight. The subjects and their excess 
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weights in pounds are as follows: 


Bimbaum 35 Hernandez 25 Moses 25 Smith 29 
Brown 34 Jackson 33 Nevesky 39 Stall 33 
Brunk 30 Kendall 28 Obrach 30 Tran 35 
Cruz 34 Loren 32 Rodriguez 30 Wilansky 42 
Deng 24 Mann 28 Santiago 27 Williams 22 


The response variable is the weight lost after 8 weeks of treatment. 


a. If the 5 most overweight women were assigned Treatment A, the 
next 5 most overweight women were assigned Treatment B, and so 
on, then the amount overweight and type of treatment would be 
confounded. Explain this statement carefully. 


b. Describe how to randomly assign the women to treatments using a 


randomized block design. Use blocks of size 4 formed by the 
amount overweight. 


c. Use technology or Table D to carry out the random assignment. 


pg 260 |») SAT preparation A school counselor wants to compare 


the effectiveness of an online SAT preparation program with an in- 
person SAT preparation class. For an experiment, the counselor 
recruits 30 students who have already taken the SAT once. The 
response variable will be the improvement in SAT score. 


a. Design an experiment that uses a completely randomized design to 
investigate this question. 


b. Design an experiment that uses a matched pairs design to 


investigate this question. Explain your method of pairing. 


c. Which design do you prefer? Explain your answer. 


Valve surgery Medical researchers want to compare the success rate 
of anew non-invasive method for replacing heart valves using a 
cardiac catheter with traditional open-heart surgery. They have 40 
male patients, ranging in age from 55 to 75, who need valve 
replacement. One of several response variables will be the percentage 
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of blood that flows backward—in the wrong direction—through the 
valve on each heartbeat. 


a. Design an experiment that uses a completely randomized design to 
investigate this question. 


b. Design an experiment that uses a matched pairs design to 
investigate this question. Explain your method of pairing. 


c. Which design do you prefer? Explain your answer. 


Look, Ma, no hands! Does talking on a hands-free cell phone 
distract drivers? Researchers recruit 40 student subjects for an 
experiment to investigate this question. They have a driving simulator 
equipped with a hands-free phone for use in the study. Each subject 
will complete two sessions in the simulator: one while talking on the 
hands-free phone and the other while just driving. The order of the 
two sessions for each subject will be determined at random. The 
route, driving conditions, and traffic flow will be the same in both 
sessions. 


a. What type of design did the researchers use in their study? 


b. Explain why the researchers chose this design instead of a 
completely randomized design. 


c. Why is it important to randomly assign the order of the treatments? 


d. Explain how and why researchers controlled for other variables in 
this experiment. 


Chocolate gets my heart pumping Cardiologists at Athens Medical 
School in Greece wanted to test if chocolate affects blood vessel 
function. The researchers recruited 17 healthy young volunteers, who 
were each given a 3.5-ounce bar of dark chocolate, either bittersweet 
or fake chocolate. On another day, the volunteers received the other 
treatment. The order in which subjects received the bittersweet and 
fake chocolate was determined at random. The subjects had no 
chocolate outside the study, and investigators didn’t know if a subject 
had eaten the real or the fake chocolate. An ultrasound was taken of 
each volunteer’s upper arm to observe the functioning of the cells in 
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the walls of the main artery. The researchers found that blood vessel 
function was improved when the subjects ate bittersweet chocolate, 
and that there were no such changes when they ate the placebo (fake 


chocolate).24 


a. What type of design did the researchers use in their study? 


b. Explain why the researchers chose this design instead of a 
completely randomized design. 


c. Why is it important to randomly assign the order of the treatments 
for the subjects? 


d. Explain how and why researchers controlled for other variables in 
this experiment. 


Got deodorant? A group of students wants to perform an experiment 
to determine whether Brand A or Brand B deodorant lasts longer. One 
group member suggests the following design: Recruit 40 student 
volunteers—20 male and 20 female. Separate by gender, because 
male and female bodies might respond differently to deodorant. Give 
all the males Brand A deodorant and all the females Brand B. Have 
the principal judge how well the deodorant is still working at the end 
of the school day on a 0 to 10 scale. Then compare ratings for the two 
treatments. 


a. Identify any flaws you see in the proposed design for this 
experiment. 


b. Describe how you would design the experiment. Explain how your 
design addresses each of the problems you identified in part (a). 


Close shave Which of two brands (X or Y) of electric razor shaves 
closer? Researchers want to design and carry out an experiment to 
answer this question using 50 adult male volunteers. Here’s one idea: 
Have all 50 subjects shave the left sides of their faces with the Brand 
X razor and shave the right sides of their faces with the Brand Y 
razor. Then have each man decide which razor gave the closer shave 
and compile the results. 
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Identify any flaws you see in the proposed design for this 
experiment. 


. Describe how you would design the experiment. Explain how your 


design addresses each of the problems you identified in part (a). 


Multiple Choice Select the best answer for Exercises 83—90. 


83. Can a vegetarian or low-salt diet reduce blood pressure? Men with 
high blood pressure are assigned at random to one of four diets: (1) 
normal diet with unrestricted salt; (2) vegetarian with unrestricted 
salt; (3) normal with restricted salt; and (4) vegetarian with restricted 
salt. This experiment has 
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a. 
b. 
C. 
d. 


e. 


one factor, the type of diet. 

two factors, high blood pressure and type of diet. 

two factors, normal/vegetarian diet and unrestricted/restricted salt. 
three factors, men, high blood pressure, and type of diet. 


four factors, the four diets being compared. 


In the experiment of the preceding exercise, the subjects were 
randomly assigned to the different treatments. What is the most 
important reason for this random assignment? 


a. 


Random assignment eliminates the effects of other variables such 
as stress and body weight. 


. Random assignment is a good way to create groups of subjects that 


are roughly equivalent at the beginning of the experiment. 


Random assignment makes it possible to make a conclusion about 
all men. 


. Random assignment reduces the amount of variation in blood 


pressure. 


Random assignment prevents the placebo effect from ruining the 
results of the study. 
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To investigate if standing up while studying affects performance in an 
algebra class, a teacher assigns half of the 30 students in his class to 
stand up while studying and assigns the other half to not stand up 
while studying. To determine who receives which treatment, the 
teacher identifies the two students who did best on the last exam and 
randomly assigns one to stand and one to not stand. The teacher does 
the same for the next two highest-scoring students and continues in 
this manner until each student is assigned a treatment. Which of the 
following best describes this plan? 


a. This is an observational study. 

b. This is an experiment with blocking. 

c. This is a completely randomized experiment. 
d. This is a stratified random sample. 


e. This is a cluster sample. 


A gardener wants to try different combinations of fertilizer (none, 1 
cup, 2 cups) and mulch (none, wood chips, pine needles, plastic) to 
determine which combination produces the highest yield for a variety 
of green beans. He has 60 green-bean plants to use in the experiment. 
If he wants an equal number of plants to be assigned to each 
treatment, how many plants will be assigned to each treatment? 


a 1 
b. 3 
c. 4 
d. 5 


e. 12 


Corn variety 1 yielded 140 bushels per acre last year at a research 

farm. This year, corn variety 2, planted in the same location, yielded 
only 110 bushels per acre. Based on these results, is it reasonable to 
conclude that corn variety 1 is more productive than corn variety 2? 
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d. 


e. 


Yes, because 140 bushels per acre is greater than 110 bushels per 
acre. 


. Yes, because the study was done at a research farm. 


No, because there may be other differences between the two years 
besides the corn variety. 


No, because there was no use of a placebo in the experiment. 


No, because the experiment wasn’t double-blind. 


A report in a medical journal notes that the risk of developing 
Alzheimer’s disease among subjects who regularly opted to take the 
drug ibuprofen was about half the risk of those who did not. Is this 
good evidence that ibuprofen is effective in preventing Alzheimer’s 
disease? 


a. 


e. 


Yes, because the study was a randomized, comparative 
experiment. 


. No, because the effect of ibuprofen is confounded with the placebo 


effect. 


Yes, because the results were published in a reputable professional 
journal. 


. No, because this is an observational study. An experiment would 


be needed to confirm (or not confirm) the observed effect. 


Yes, because a 50% reduction can’t happen just by chance. 


A farmer is conducting an experiment to determine which variety of 
apple tree, Fuji or Gala, will produce more fruit in his orchard. The 
orchard is divided into 20 equally sized square plots. He has 10 trees 
of each variety and randomly assigns each tree to a separate plot in 
the orchard. What are the experimental unit(s) in this study? 


a. 
b. 


Cc. 


The trees 
The farmer 


The plots 
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d. The orchard 


e. The apples 


90. Two essential features of all statistically designed experiments are 
a. comparing several treatments; using the double-blind method. 


b. comparing several treatments; using chance to assign subjects to 
treatments. 


c. always having a placebo group; using the double-blind method. 
d. using a block design; using chance to assign subjects to treatments. 


e. using enough subjects; always having a control group. 


Recycle and Review 


91. Seed weights (2.2) Biological measurements on the same species 
often follow a Normal distribution quite closely. The weights of seeds 
of a variety of winged bean are approximately Normal with mean 525 
milligrams (mg) and standard deviation 110 mg. 


a. What percent of seeds weigh more than 500 mg? 


b. If we discard the lightest 10% of these seeds, what is the smallest 
weight among the remaining seeds? 
92. Comparing rainfall (1.3) The boxplots summarize the distributions 
of average monthly rainfall (in inches) for Tucson, Arizona, and 
Princeton, New Jersey.22 Compare these distributions. 
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SECTION 4.3 Using Studies Wisely 


LEARNING TARGETS By the end of the section, you should be able to: 


e Explain the concept of sampling variability when making an 
inference about a population and how sample size affects sampling 
variability. 


Explain the meaning of statistically significant in the context of an 
experiment and use simulation to determine if the results of an 


experiment are statistically significant. 


Identify when it is appropriate to make an inference about a 
population and when it is appropriate to make an inference about 
cause and effect. 


Evaluate if a statistical study has been carried out in an ethical 
manner.! 


Researchers who conduct statistical studies often want to draw 
conclusions that go beyond the data they produce. Here are two examples. 


e The U.S. Census Bureau carries out a monthly Current Population 
Survey of about 60,000 households. Their goal is to use data from these 
randomly selected households to estimate the percent of unemployed 
individuals in the population. 


e Scientists performed an experiment that randomly assigned 21 volunteer 
subjects to one of two treatments: sleep deprivation for one night or 
unrestricted sleep. The scientists hoped to show that sleep deprivation 
causes a decrease in performance two days later.22 


What conclusions can be drawn from a particular study? The answer 
depends on how the data were collected. 


Inference for Sampling 


When the members of a sample are selected at random from a population, 
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we can use the sample results to infer things about the population. That is, 
we can make inferences about the population from which the sample was 
randomly selected. Inference from convenience samples or voluntary 
response samples would be misleading because these methods of choosing 
a sample are biased. In these cases, we are almost certain that the sample 
does not fairly represent the population. 


Even when making an inference from a random sample, it would be 
surprising if the estimate from the sample was exactly equal to the truth 
about the population. For example, in a random sample of 1399 U.S. teens 
aged 13-18, 26% reported more than 8 hours of entertainment media use 
per day. Because of sampling variability, it would be surprising if exactly 
26% of all U.S. teens aged 13-18 reported more than 8 hours of 
entertainment media use per day. Why? Because different samples of 1399 
U.S. teens aged 13-18 will include different sets of people and produce 
different estimates. 


DEFINITION Sampling variability 


Sampling variability refers to the fact that different random samples 
of the same size from the same population produce different 
estimates. 


The following activity explores the idea of sampling variability. 


ACTIVITY Exploring sampling variability 


When making an inference about a population from a random sample, 
we shouldn’t expect the estimate to be exactly correct. But how much 
do sample results vary? Your teacher has prepared a large population of 
beads, where 30% have a certain color (e.g., red) so you can explore this 
question. 
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1. Ina moment, you will select a random sample of 20 beads. Do you 
expect that your sample will contain exactly 30% red beads? Explain 
your reasoning. 


2. Mix the beads thoroughly, select a random sample of 20 beads from 
the population, calculate the percent of red beads in the sample, and 
replace the beads in the population. 


3. After all students have selected a sample, make a class dotplot 
showing each student’s estimate for the percent of red beads. Where 
is the graph centered? How much does the percent of red beads vary? 


4. Imagine that you repeated Steps 2 and 3 with random samples of size 
100. How do you expect the dotplots would compare? If there’s time, 
select random samples of size 100 to confirm your answer. 


When Mrs. Storrs’s class of 40 students did the red bead activity, they 
produced the dotplot shown at left (top) for samples of size 20. The dotplot 
is centered around 30%, the true percent of red beads. This shouldn’t be 
surprising because random sampling helps avoid bias. Notice also that the 
estimates varied from 10% to 50% and that only 11 of the 40 estimates 
were equal to exactly 30%. 
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Percent of red beads (n = 20) 


10 20 30 40 50 
Percent of red beads (n = 100) 


To see the effect of increasing the sample size, we simulated 40 
random samples of size 100 from the same population and recorded the 
percent of red beads in each sample. Notice that the graph is still centered 
at 30%, but there is much less variability. 


SAMPLING VARIABILITY AND SAMPLE SIZE 


Larger random samples tend to produce estimates that are closer to the 
true population value than smaller random samples. In other words, 
estimates from larger samples are more precise. 


EXAMPLE | Weighing football players (») 


Inference for sampling 


ee 
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PROBLEM: How much do National Football League (NFL) players 
weigh, on average? In a random sample of 50 NFL players, the average 
weight is 244.4 pounds. 


a. Do you think that 244.4 pounds is the true average weight of all NFL 
players? Explain your answer. 


b. Which would be more likely to give an estimate close to the true 
average weight of all NFL players: a random sample of 50 players or 
a random sample of 100 players? Explain your answer. 


SOLUTION: 


a. No. Different samples of size 50 would produce different average 
weights. So it would be surprising if this estimate is equal to the true 
average weight of all NFL players. 


b. A random sample of 100 players, because estimates tend to be closer 
to the truth when the sample size is larger. 


FOR PRACTICE, TRY EXERCISE 93 


Estimates from random samples often come with a margin of error that 
allows us to create an interval of plausible values for the true population 
value. In the preceding example about NFL players, the margin of error for 
the estimate of 244.4 pounds is 14.2 pounds. Based on this margin of error, 
it wouldn’t be surprising if the true average weight for all NFL players was 
as small as 244.4 — 14.2 = 230.2 pounds or as large as 244.4 + 14.2 = 
258.6 pounds. 


You will learn how to calculate the margin of error in Chapter 8. For 
now, make sure to remember the effect of sampling variability when using 
data from a random sample to make an inference about a population. 


Inference for Experiments 
Well-designed experiments allow for inferences about cause and effect. 


But we should only conclude that changes in the explanatory variable 
cause changes in the response variable if the results of an experiment are 
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statistically significant. 


DEFINITION Statistically significant 


When the observed results of a study are too unusual to be explained 
by chance alone, the results are called statistically significant. 


Mr. Wilcox and his students decided to perform the caffeine 
experiment from the preceding section. In their experiment, 10 student 
volunteers were randomly assigned to drink cola with caffeine and the 
remaining 10 students were assigned to drink caffeine-free cola. The table 
and graph show the change in pulse rate for each student (Final pulse rate 
— Initial pulse rate), along with the mean change for each group. 


‘affoine as 
Caffeine 7° . e Change in pulse rate Mean 
. (Final pulse rate — Initial pulse rate) change 
Caffeine 8 3 5 14061 4 0 3.2 


No Caffeine 3 —2 4 -1 5 5 12 -1 4 = 20 


= 
z 
3 
I 
) 


No caffeine . 
2 0 2 4 6 
Change in pulse rate 


(Final — Initial) 

The dotplots provide some evidence that caffeine has an effect on 
pulse rates. The mean change for the 10 students who drank cola with 
caffeine was 3.2, which is 1.2 greater than for the group who drank 
caffeine-free cola. But are the results statistically significant? 


Recall that the purpose of random assignment in this experiment was 
to create two groups that were roughly equivalent at the beginning of the 
experiment. Subjects with high caffeine tolerance should be split up in 
about equal numbers, subjects with high metabolism should be split up in 
about equal numbers, and so on. 

Of course, the random assignment is unlikely to produce groups that 
are exactly equivalent. One group might get more “favorable” subjects just 
by chance. That is, the caffeine group might end up with a few extra 
subjects who were likely to have a pulse rate increase, just due to chance 
variation in the random assignment. 

There are two ways to explain why the mean change in pulse rate was 
1.2 greater for the caffeine group: 


.. Caffeine does not have an effect on pulse rates, and the difference of 1.2 
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happened because of chance variation in the random assignment. 


’. Caffeine increases pulse rates. 


If it is plausible to get a difference of 1.2 or more simply due to the 
chance variation in random assignment, the results of the experiment are 
not statistically significant. But if it is very unlikely to get a difference of 
1.2 or more by chance alone, we rule out Explanation 1 and say the results 
are statistically significant—and that caffeine increases pulse rates. 


How can we determine if a difference of 1.2 is statistically significant? 
You’ll find out in the following activity. 


ACTIVITY Analyzing the caffeine experiment 


In the experiment performed by Mr. Wilcox’s class, the mean change in 
pulse rate for the caffeine group was 1.2 greater than the mean change 
for the no-caffeine group. This provides some evidence that caffeine 
increases pulse rates. Is this evidence convincing? Or is it plausible that 
a difference of 1.2 would arise just due to chance variation in the 
random assignment? In this activity, we’ll investigate by seeing what 
differences typically occur just by chance, assuming caffeine doesn’t 
affect pulse rates. That is, we’ll assume that the change in pulse rate for 
a particular student would be the same regardless of what treatment he 
or she was assigned. 


Zaia Snively 


Change in pulse rate Mean 
(Final pulse rate — Initial pulse rate) change 


Caffeine 8 3 5 14061 4 0 32 
No Caffeine 3 2 4 -1 5 5 12 -1 4 2.0 


1. Gather 20 index cards to represent the 20 students in this experiment. 
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On each card, write one of the 20 outcomes listed in the table. For 
example, write “8” on the first card, “3” on the second card, and so 
on. 


2. Shuffle the cards and deal two piles of 10 cards each. This represents 
randomly assigning the 20 students to the two treatments, assuming 
that the treatment received doesn’t affect the change in pulse rate. 
The first pile of 10 cards represents the caffeine group, and the 
second pile of 10 cards represents the no-caffeine group. 


3. Find the mean change for each group and subtract the means 
(Caffeine — No caffeine). Note: It is possible to get a negative 
difference. 


4. Your teacher will draw and label an axis for a class dotplot. Plot the 
difference you got in Step 3 on the graph. 


5. In Mr. Wilcox’s class, the observed difference in means was 1.2. Is a 
difference of 1.2 statistically significant? Discuss with your 
classmates. 


We used technology to perform 100 trials of the simulation described 
in the activity. The dotplot in Figure 4.3 shows that getting a difference of 
1.2 isn’t that unusual. In 19 of the 100 trials, we obtained a difference of 
1.2 or more simply due to chance variation in the random assignment. 


In the 100 trials, 19 times 
the difference was 1.2 or 


3 2 
Simulated difference (Caffeine — No caffeine) 
in mean change in pulse rate 


FIGURE 4.3 Dotplot showing the differences in means that occurred in 100 
simulated random assignments, assuming that caffeine has no effect on pulse rates. 


Because the difference of 1.2 or greater is somewhat likely to occur by 
chance alone, the results of Mr. Wilcox’s class experiment aren’t 
statistically significant. Based on this experiment, there isn’t convincing 
evidence that caffeine increases pulse rates. 
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EXAMPLE | Distracted driving (») 


Inference for experiments 


Sean Locke Photography/Shutterstock.com 


PROBLEM: Is talking on a cell phone while driving more distracting 
than talking to a passenger? David Strayer and his colleagues at the 
University of Utah designed an experiment to help answer this question. 
They used 48 undergraduate students as subjects. The researchers 
randomly assigned half of the subjects to drive in a simulator while 
talking on a cell phone, and the other half to drive in the simulator while 
talking to a passenger. One response variable was whether or not the 
driver stopped at a rest area that was specified by researchers before the 
simulation started. The table shows the results.24 


a. Calculate the difference (Passenger — Cell phone) in the proportion of 
students who stopped at the rest area in the two groups. 
One hundred trials of a simulation were performed to see what 
differences in proportions would occur due only to chance variation 
in the random assignment, assuming that the type of distraction did 
not affect whether a subject stopped at the rest area. That is, 33 
“stoppers” and 15 “non-stoppers” were randomly assigned to two 
groups of 24. 


b. There are three dots at 0.29. Explain what these dots mean in this 
context. 


c. Use the results of the simulation to determine if the difference in 
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proportions from part (a) is statistically significant. Explain your 
reasoning. 


Treatment 
Cell phone | Passenger Total 
Stopped at rest area 33 
Response Didn’t stop 15 
Total 48 
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Simulated difference (Passenger — Cell phone) 
in proportion of students who stopped 


SOLUTION: 


a. Difference in proportions = 21/24 — 12/24 = 0.875 — 0.500 = 0.375 


b. When we assumed that the type of distraction doesn’t matter, there 
were three simulated random assignments where the difference in the 
proportion of students who stopped at the rest area was 0.29. 


c. Because a difference of 0.375 or greater never occurred in the 
simulation, the difference is statistically significant. It is extremely 
unlikely to get a difference this big simply due to chance variation in 
the random assignment. 


Because the difference is statistically significant, we can make a cause- 
and-effect conclusion: talking on a cell phone is more distracting than 


talking with a passenger—at least for subjects like those in the 


experiment. 
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FOR PRACTICE, TRY EXERCISE 99 


In the caffeine example, we said that a difference in means of 1.2 was 
not unusual because a difference that big or bigger occurred 19% of the 
time by chance alone. In the distracted drivers example, we said that a 
difference in proportions of 0.375 was unusual because a difference this 
big or bigger occurred 0% of the time by chance alone. So the boundary 
between “not unusual” and “unusual” must be somewhere between 0% and 
19%. For now, we recommend using a boundary of 5% so that differences 
that would occur less than 5% of the time by chance alone are considered 
Statistically significant. 


The Scope of Inference: Putting it All Together 


The type of conclusion that can be drawn from a study depends on how the 
data in the study were collected. 


In the example about average weight in the NFL, the players were 
randomly selected from all NFL players. As you learned in Section 4.1, 
random sampling helps to avoid bias and produces reliable estimates of the 
truth about the population. Because the mean weight in the sample of 
players was 244.4 pounds, our best guess for the mean weight in the 
population of all NFL players is 244.4 pounds. Even though our estimates 
are rarely exactly correct, when samples are selected at random, we can 
make an inference about the population. 


In the distracted driver experiment, subjects were randomly assigned to 
talk on a cell phone or talk to a passenger. As you learned in Section 4.2, 
random assignment helps ensure that the two groups of subjects are as 
alike as possible before the treatments are imposed. If the group assigned 
to talk with a passenger remembers to stop at the rest area more often than 
the group assigned to talk on a cell phone, and the difference is too large to 
be explained by chance variation in the random assignment, it must be due 
to the treatments. In that case, the researchers could safely conclude that 
talking on a cell phone is more distracting than talking to a passenger. That 
is, they can make an inference about cause and effect. However, because 
the experiment used volunteer subjects, the scientists can only apply this 
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conclusion to subjects like the ones in their experiment. 


Both random sampling and random assignment introduce chance variation into a 
statistical study. When performing inference, statisticians use the laws of probability 


to describe this chance variation. You’ll learn how this works later in the book. 


Let’s recap what we’ve learned about the scope of inference in a 
Statistical study. 


THE SCOPE OF INFERENCE 


e Random selection of individuals allows inference about the 
population from which the individuals were chosen. 


e Random assignment of individuals to groups allows inference about 
cause and effect. 


The following chart summarizes the possibilities.22 
Were individuals randomly assigned to groups? 
Yes No 


Inference about the Inference about the 
population: YES Inference population: YES Inference 


Were individuals about cause and effect: YES about cause and effect: NO 


randomly 


selected? Inference about the Inference about the 


No population: NO Inference —_ population: NO Inference 
about cause and effect: YES about cause and effect: NO 


Well-designed experiments randomly assign individuals to treatment 
groups. However, most experiments don’t select experimental units at 
random from the larger population. That limits such experiments to 
inference about cause and effect for individuals like those who received 
the treatments. Observational studies don’t randomly assign individuals to 
groups, which makes it challenging to make an inference about cause and 
effect. But an observational study that uses random sampling can make an 
inference about the population. 
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EXAMPLE | When will I ever use this stuff? (») 


The scope of inference 


PROBLEM: Researchers at the University of North Carolina were 
concerned about the increasing dropout rate in the state’s high schools, 
especially for low-income students. Surveys of recent dropouts revealed 
that many of these students had started to lose interest during middle 
school. They said they saw little connection between what they were 
studying in school and their future plans. To change this perception, 
researchers developed a program called CareerStart. The central idea of 
the program is that teachers show students how the topics they’re 
learning about can be applied to specific careers. 


To test the effectiveness of CareerStart, the researchers recruited 14 
middle schools in Forsyth County to participate in an experiment. Seven 
of the schools, determined at random, used CareerStart along with the 
district’s standard curriculum. The other 7 schools just followed the 
standard curriculum. Researchers followed both groups of students for 
several years, collecting data on students’ attendance, behavior, 
standardized test scores, level of engagement in school, and whether or 
not the students graduated from high school. 


Results: Students at schools that used CareerStart generally had 
significantly better attendance and fewer discipline problems, earned 
higher test scores, reported greater engagement in their classes, and 


were more likely to graduate.2© 


What conclusion can we draw from this study? Explain your reasoning. 


SOLUTION: 


Because treatments were randomly assigned and the results were 
significant, we can conclude that using the CareerStart curriculum 
caused better attendance, fewer discipline problems, higher test scores, 
greater engagement, and increased graduation rates. However, these 
results only apply to schools like those in the study because the schools 
were not randomly selected from any population. 


602 


With no random selection, the results of the study should be applied only 


to schools like those in the study. With random assignment, it is possible 
to make an inference about cause and effect. 


FOR PRACTICE, TRY EXERCISE 103 


CHECK YOUR UNDERSTANDING 


. 


y When an athlete suffers a sports-related concussion, does it help to 
remove the athlete from play immediately? Researchers recruited 95 
athletes seeking care for a sports-related concussion at a medical clinic 
and followed their progress during recovery. Researchers found 
Statistically significant evidence that athletes who were removed from 
play immediately recovered more quickly, on average, than athletes who 
continued to play.24 What conclusion can we draw from this study? 
Explain your answer. 


The Challenges of Establishing Causation 


A well-designed experiment can tell us that changes in the explanatory 
variable cause changes in the response variable. More precisely, it tells us 
that this happened for specific individuals in the specific environment of 
this specific experiment. In some cases, it isn’t practical or even ethical to 
do an experiment. Consider these important questions: 


e Does going to church regularly help people live longer? 


e Does smoking cause lung cancer? 


To answer these cause-and-effect questions, we just need to perform a 
randomized comparative experiment. Unfortunately, we can’t randomly 
assign people to attend church or to smoke cigarettes. The best data we 
have about these and many other cause-and-effect questions come from 
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observational studies. 


Doctors had long observed that most lung cancer patients were 
smokers. Comparison of smokers and similar nonsmokers showed a very 
strong association between smoking and death from lung cancer. Could the 
association be due to some other variable? Is there some genetic factor that 
makes people both more likely to become addicted to nicotine and to 
develop lung cancer? If so, then smoking and lung cancer would be 
strongly associated even if smoking had no direct effect on the lungs. Or 
maybe confounding is to blame. It might be that smokers live unhealthy 
lives in other ways (diet, alcohol, lack of exercise) and that some other 
habit confounded with smoking is a cause of lung cancer. Still, it is 
sometimes possible to build a strong case for causation in the absence of 
experiments. The evidence that smoking causes lung cancer is about as 
strong as nonexperimental evidence can be. 


archives/Getty Images 


There are several criteria for establishing causation when we can’t do 
an experiment: 


e The association is strong. The association between smoking and lung 
cancer is very strong. 


e The association is consistent. Many studies of different kinds of people 
in many countries link smoking to lung cancer. That reduces the chance 
that some other variable specific to one group or one study explains the 
association. 


e Larger values of the explanatory variable are associated with stronger 
responses. People who smoke more cigarettes per day or who smoke 
over a longer period get lung cancer more often. People who stop 


604 


smoking reduce their risk. 


e The alleged cause precedes the effect in time. Lung cancer develops 
after years of smoking. The number of men dying of lung cancer rose as 
smoking became more common, with a lag of about 30 years. Lung 
cancer kills more men than any other form of cancer. Lung cancer was 
rare among women until women began to smoke. Lung cancer in 
women rose along with smoking, again with a lag of about 30 years, and 
has passed breast cancer as the leading cause of cancer death among 
women. 


e The alleged cause is plausible. Experiments with animals show that tars 
from cigarette smoke do cause cancer. 


Medical authorities do not hesitate to say that smoking causes lung 
cancer. The U.S. Surgeon General states that cigarette smoking is “the 
largest avoidable cause of death and disability in the United States.”22 The 
evidence for causation is overwhelming—but it is not as strong as the 
evidence provided by well-designed experiments. Conducting an 
experiment in which some subjects were forced to smoke and others were 
not allowed to would be unethical. In cases like this, observational studies 
are our best source of reliable information. 


Data Ethics! 


There are some potential experiments that are clearly unethical. In other 
cases, the boundary between “ethical” and “unethical” isn’t as clear. 
Decide if you think each of the following studies is ethical or unethical: 


e A promising new drug has been developed for treating cancer in 
humans. Before giving the drug to human subjects, researchers want to 
administer the drug to animals to see if there are any potentially serious 
side effects. 


e Are companies discriminating against some individuals in the hiring 
process? To find out, researchers prepare several equivalent résumés for 
fictitious job applicants, with the only difference being the gender of the 
applicant. They send the fake résumés to companies advertising 
positions and keep track of the number of males and females who are 
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contacted for interviews. 


e Will people try to stop someone from driving drunk? A television news 
program hires an actor to play a drunk driver and uses a hidden camera 
to record the behavior of individuals who encounter the driver. 


The most complex issues of data ethics arise when we collect data 
from people. The ethical difficulties are more severe for experiments that 
impose some treatment on people than for sample surveys that simply 
gather information. Trials of new medical treatments, for example, can do 
harm as well as good to their subjects. 


Here are some basic standards of data ethics that must be obeyed by all 
studies that gather data from human subjects, both observational studies 
and experiments. The law requires that studies carried out or funded by the 
federal government obey these principles.22 But neither the law nor the 
consensus of experts is completely clear about the details of their 
application. 


BASIC DATA ETHICS 


e All planned studies must be reviewed in advance by an institutional 
review board charged with protecting the safety and well-being of 
the subjects. 


e All individuals who are subjects in a study must give their informed 
consent before data are collected. 


e All individual data must be kept confidential. Only statistical 
summaries for groups of subjects may be made public. 


Institutional Review Boards The purpose of an institutional review board 
is not to decide whether a proposed study will produce valuable 
information or if it is statistically sound. The board’s purpose is, in the 
words of one university’s board, “to protect the rights and welfare of 
human subjects (including patients) recruited to participate in research 
activities.” The board reviews the plan of the study and can require 
changes. It reviews the consent form to be sure that subjects are informed 
about the nature of the study and about any potential risks. Once research 
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begins, the board monitors its progress at least once a year. 


Informed Consent Both words in the phrase informed consent are 
important, and both can be controversial. Subjects must be informed in 
advance about the nature of a study and any risk of harm it may bring. In 
the case of a questionnaire, physical harm is not possible. But a survey on 
sensitive issues could result in emotional harm. The participants should be 
told what kinds of questions the survey will ask and roughly how much of 
their time it will take. Experimenters must tell subjects the nature and 
purpose of the study and outline possible risks. Subjects must then consent 
in writing. 


Confidentiality It is important to protect individuals’ privacy by keeping 
all data about them confidential. The report of an opinion poll may say 
what percent of the 1200 respondents believed that legal immigration 
should be reduced. It may not report what you said about this or any other 
issue. Confidentiality is not the same as anonymity. Anonymity means that 
individuals are anonymous—their names are not known even to the 
director of the study. Anonymity is rare in statistical studies. Even where 
anonymity is possible (mainly in surveys conducted by mail), it prevents 
any follow-up to improve nonresponse or inform individuals of results. 


Section 4.3 Summary 


e Sampling variability refers to the idea that different random samples of 
the same size from the same population produce different estimates. 
Reduce sampling variability by increasing the sample size. 


e When the observed results of a study are too unusual to be explained by 
chance alone, we say that the results are statistically significant. 


e Most studies aim to make inferences that go beyond the data produced. 


: Inference about a population requires that the individuals taking part 
in a study be randomly selected from the population. 


« A well-designed experiment that randomly assigns experimental units 
to treatments allows inference about cause and effect. 
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e Inthe absence of an experiment, good evidence of causation requires a 
strong association that appears consistently in many studies, a clear 
explanation for the alleged causal link, and careful examination of other 
variables. 


e Studies involving humans must be screened in advance by an 
institutional review board. All participants must give their informed 
consent before taking part. Any information about the individuals in the 
study must be kept confidential. 


Section 4.3 Exercises 


93. pg 271 pm) Tweet, tweet! What proportion of students at your 


school use Twitter? To find out, you survey a simple random sample 
of students from the school roster. 


a. Will your sample result be exactly the same as the true population 
proportion? Explain your answer. 


b. Which would be more likely to produce a sample result closer to 
the true population value: an SRS of 50 students or an SRS of 100 
students? Explain your answer. 


94. Far from home? A researcher wants to estimate the average distance 
that students at a large community college live from campus. To find 
out, she surveys a simple random sample of students from the 
registrar’s database. 


a. Will the researcher’s sample result be exactly the same as the true 
population mean? Explain your answer. 


b. Which would be more likely to produce a sample result closer to 
the true population value: an SRS of 100 students or an SRS of 50 
students? Explain your answer. 


95. Football on TV A Gallup poll conducted telephone interviews with a 
random sample of 1000 adults aged 18 and older. Of these, 37% said 
that football is their favorite sport to watch on television. The margin 
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of error for this estimate is 3.1 percentage points. 


a. Would you be surprised if a census revealed that 50% of adults in 
the population would say their favorite sport to watch on TV is 
football? Explain your answer. 


b. Explain how Gallup could decrease the margin of error. 


96. Car colors in Miami Using a webcam, a traffic analyst selected a 
random sample of 800 cars traveling on I-195 in Miami on a weekday 
morning. Among the 800 cars in the sample, 24% were white. The 
margin of error for this estimate is 3.0 percentage points. 


a. Would you be surprised if a census revealed that 26% of cars on I- 
195 in Miami on a weekday morning were white? Explain your 
answer. 


b. Explain how the traffic analyst could decrease the margin of error. 


97. Kissing the right way According to a newspaper article, “Most 
people are kissing the ‘right way.’” That is, according to a study, the 
majority of couples prefer to tilt their heads to the right when kissing. 
In the study, a researcher observed a random sample of 124 kissing 
couples and found that 83/124 (66.9%) of the couples tilted to the 
right.£2 To determine if these data provide convincing evidence that 
couples are more likely to tilt their heads to the right, 100 simulated 
SRSs were selected. 


Each dot in the graph shows the number of couples that tilt to the 
right in a simulated SRS of 124 couples, assuming that each couple 
has a 50% chance of tilting to the right. 
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Simulated number of couples who tilt to the right 


a. Explain how the graph illustrates the concept of sampling 
variability. 


b. Based on the data from the study and the results of the simulation, 
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is there convincing evidence that couples prefer to kiss the “right 
way’? Explain your answer. 


98. Weekend birthdays Over the years, the percentage of births that are 
planned caesarean sections has been rising. Because doctors can 
schedule these deliveries, there might be more children born during 
the week and fewer born on the weekend than if births were 
uniformly distributed throughout the week. To investigate, Mrs. 
McDonald and her class selected an SRS of 73 people born since 
1993. Of these people, 24 were born on Friday, Saturday, or Sunday. 


To determine if these data provide convincing evidence that fewer 
than 43% (3/7) of people born since 1993 were born on Friday, 
Saturday, or Sunday, 100 simulated SRSs were selected. Each dot in 
the graph shows the number of people that were born on Friday, 
Saturday, or Sunday in a simulated SRS of 73 people, assuming that 
each person had a 43% chance of being born on one of these three 
days. 


eecsecoeoesesoeseooaes 


N 
N 
Ne 
= 
nN 
ia) 
No 
oo 


30 32 34 36 38 40 


Simulated number who were born on 
Friday, Saturday, or Sunday 


a. Explain how the graph illustrates the concept of sampling 
variability. 


b. Based on the data from the study and the results of the simulation, 
is there convincing evidence that fewer than 43% of people born 
since 1993 were born on Friday, Saturday, or Sunday? Explain 
your answer. 


99. pg 274 |) I work out a lot Are people influenced by what others 


say? Michael conducted an experiment in front of a popular gym. As 
people entered, he asked them how many days they typically work 
out per week. As he asked the question, he showed the subjects one of 
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two clipboards, determined at random. Clipboard A had the question 
and many responses written down, where the majority of responses 
were 6 or 7 days per week. Clipboard B was the same, except most of 
the responses were 1 or 2 days per week. The mean response for the 
Clipboard A group was 4.68 and the mean response for the Clipboard 


B group was Ao a) 


a. Calculate the difference (Clipboard A — Clipboard B) in the mean 
number of days for the two groups. 
One hundred trials of a simulation were performed to see what 
differences in means would occur due only to chance variation in 
the random assignment, assuming that the responses on the 
clipboard don’t matter. The results are shown in the dotplot. 
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b. There is one dot at 0.72. Explain what this dot means in this 
context. 


c. Use the results of the simulation to determine if the difference in 
means from part (a) is statistically significant. Explain your 
reasoning. 


100. A louse-y situation A study published in the New England Journal 
of Medicine compared two medicines to treat head lice: an oral 
medication called ivermectin and a topical lotion containing 
malathion. Researchers studied 812 people in 376 households in 
seven areas around the world. Of the 185 households randomly 
assigned to ivermectin, 171 were free from head lice after 2 weeks, 
compared with only 151 of the 191 households randomly assigned 


to malathion.©2 


a. Calculate the difference (Ivermectin — Malathion) in the 
proportion of households that were free from head lice in the two 
groups. 
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One hundred trials of a simulation were performed to see what 
differences in proportions would occur due only to chance 
variation in the random assignment, assuming that the type of 
medication doesn’t matter. The results are shown in the dotplot. 
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in proportion who were free from head lice 


. There is one dot at 0.09. Explain what this dot means in this 


context. 


Use the results of the simulation to determine if the difference in 
proportions from part (a) is statistically significant. Explain your 
reasoning. 


101. Acupuncture and pregnancy A study sought to determine if the 
ancient Chinese art of acupuncture could help infertile women 


become pregnant.®2 A total of 160 healthy women undergoing 
assisted reproductive therapy were recruited for the study. Half of 
the subjects were randomly assigned to receive acupuncture 
treatment 25 minutes before embryo transfer and again 25 minutes 
after the transfer. The remaining 80 subjects were instructed to lie 
still for 25 minutes after the embryo transfer. Results: In the 
acupuncture group, 34 women became pregnant. In the control 
group, 21 women became pregnant. 


a. 


Why did researchers randomly assign the subjects to the two 
treatments? 


The difference in the percent of women who became pregnant in 
the two groups is statistically significant. Explain what this 
means to someone who knows little statistics. 


Explain why the design of the study prevents us from concluding 
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that acupuncture caused the difference in pregnancy rates. 


102. Do diets work? Dr. Linda Stern and her colleagues recruited 132 
obese adults at the Philadelphia Veterans Affairs Medical Center in 
Pennsylvania. Half the participants were randomly assigned to a 
low-carbohydrate diet and the other half to a low-fat diet. 
Researchers measured each participant’s change in weight and 
cholesterol level after six months and again after one year. Subjects 
in the low-carb diet group lost significantly more weight than 
subjects in the low-fat diet group during the first six months. At the 
end of a year, however, the average weight loss for subjects in the 
two groups was not significantly different.®4 


a. Why did researchers randomly assign the subjects to the diet 
treatments? 


b. Explain to someone who knows little statistics what “lost 
significantly more weight” means. 


c. The subjects in the low-carb diet group lost an average of 5.1 kg 
in a year. The subjects in the low-fat diet group lost an average of 
3.1 kg in a year. Explain how this information could be consistent 
with the fact that weight loss in the two groups was not 
significantly different. 


103. pg 276 |) Foster care versus orphanages Do abandoned children 


placed in foster homes do better than similar children placed in an 
institution? The Bucharest Early Intervention Project found 
statistically significant evidence that they do. The subjects were 136 
young children abandoned at birth and living in orphanages in 
Bucharest, Romania. Half of the children, chosen at random, were 
placed in foster homes. The other half remained in the 
orphanages.©2 (Foster care was not easily available in Romania at 
the time and so was paid for by the study.) What conclusion can we 
draw from this study? Explain your reasoning. 


104. Frozen batteries Will storing batteries in a freezer make them last 
longer? To find out, a company that produces batteries takes a 
random sample of 100 AA batteries from its warehouse. The 
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company statistician randomly assigns 50 batteries to be stored in 
the freezer and the other 50 to be stored at room temperature for 3 
years. At the end of that time period, each battery’s charge is tested. 
Result: Batteries stored in the freezer had a significantly higher 
average charge. What conclusion can we draw from this study? 
Explain your reasoning. 


105. Attend church, live longer? One of the better studies of the effect of 
regular attendance at religious services gathered data from a random 
sample of 3617 adults. The researchers then measured lots of 
variables, not just the explanatory variable (religious activities) and 
the response variable (length of life). A news article said: 
“Churchgoers were more likely to be nonsmokers, physically active, 
and at their right weight. But even after health behaviors were taken 
into account, those not attending religious services regularly still 
were significantly more likely to have died.”©© What conclusion can 
we draw from this study? Explain your reasoning. 


106. Rude surgeons Is a friendly surgeon a better surgeon? In a study of 
more than 32,000 surgical patients from 7 different medical centers, 
researchers classified surgeons by the number of unsolicited 
complaints that had been recorded about their behavior. The 
researchers found that surgical complications were significantly 
more common in patients whose surgeons had received the most 
complaints, compared with patients whose surgeons had received 
the fewest complaints.£4 What conclusion can we draw from this 
study? Explain your reasoning. 


107. Berry good Eating blueberries and strawberries might improve heart 
health, according to a long-term study of 93,600 women who 
volunteered to take part. These berries are high in anthocyanins due 
to their pigment. Women who reported consuming the most 
anthocyanins had a significantly smaller risk of heart attack 
compared to the women who reported consuming the least. What 
conclusion can we draw from this study? Explain your reasoning.©8 


108. Exercise and memory A study of strength training and memory 
randomly assigned 46 young adults to two groups. After both 
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groups were shown 90 pictures, one group had to bend and extend 
one leg against heavy resistance 60 times. The other group stayed 
relaxed, while the researchers used the same exercise machine to 
bend and extend their legs with no resistance. Two days later, each 
subject was shown 180 pictures—the original 90 pictures plus 90 
new pictures and asked to identify which pictures were shown two 
days earlier. The resistance group was significantly more successful 
in identifying these pictures than was the relax group. What 
conclusions can we draw from this study? Explain your reasoning.&2 


109. * Minimal risk? You have been invited to serve on a college’s 
institutional review board. You must decide whether several 
research proposals qualify for lighter review because they involve 
only minimal risk to subjects. Federal regulations say that “minimal 
risk” means the risks are no greater than “those ordinarily 
encountered in daily life or during the performance of routine 
physical or psychological examinations or tests.” That’s vague. 
Which of these do you think qualifies as “minimal risk”? 


a. Draw a drop of blood by pricking a finger to measure blood 
sugar. 


b. Draw blood from the arm for a full set of blood tests. 


c. Insert a tube that remains in the arm so that blood can be drawn 
regularly. 


110. * Who reviews? Government regulations require that institutional 
review boards consist of at least five people, including at least one 
scientist, one nonscientist, and one person from outside the 
institution. Most boards are larger, but many contain just one 
outsider. 


a. Why should review boards contain people who are not scientists? 


b. Do you think that one outside member is enough? How would 
you choose that member? (For example, would you prefer a 
medical doctor? A religious leader? An activist for patients’ 
rights?) 
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111. * Facebook emotions In cooperation with researchers from Cornell 
University, Facebook randomly selected almost 700,000 users for an 
experiment in “emotional contagion.” Users’ news feeds were 
manipulated (without their knowledge) to selectively show postings 
from their friends that were either more positive or more negative in 
tone, and the emotional tone of their own subsequent postings was 
measured. The researchers found evidence that people who read 
emotionally negative postings were more likely to post messages 
with a negative tone, whereas those who read positive messages 
were more likely to post messages with a positive tone. The research 
was widely criticized for being unethical. Explain why-2 


112. * No consent needed? In which of the circumstances listed here 
would you allow collecting personal information without the 
subjects’ consent? 


a. A government agency takes a random sample of income tax 
returns to obtain information on the average income of people in 
different occupations. Only the incomes and occupations are 
recorded from the returns, not the names. 


b. A social psychologist attends public meetings of a religious 
group to study the behavior patterns of its members. 


c. A social psychologist pretends to be converted to membership in 
a religious group and attends private meetings to study the 
behavior patterns of its members. 


113. * Anonymous? Confidential? One of the most important 
nongovernment surveys in the United States is the National Opinion 
Research Center’s General Social Survey (GSS). The GSS regularly 
monitors public opinion on a wide variety of political and social 
issues. Interviews are conducted in person in the subject’s home. 
Are a subject’s responses to GSS questions anonymous, 
confidential, or both? Explain your answer. 


114. * Anonymous? Confidential? Texas A&M, like many universities, 
offers screening for HIV, the virus that causes AIDS. Students may 
choose either anonymous or confidential screening. An 
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announcement says, “Persons who sign up for screening will be 
assigned a number so that they do not have to give their name.” 
They can learn the results of the test by telephone, still without 
giving their name. Does this describe anonymous or confidential 
screening? Why? 


115. * The Willowbrook hepatitis studies In the 1960s, children entering 
the Willowbrook State School, an institution for the intellectually 
disabled on Staten Island in New York, were deliberately infected 
with hepatitis. The researchers argued that almost all children in the 
institution quickly became infected anyway. The studies showed for 
the first time that two strains of hepatitis existed. This finding 
contributed to the development of effective vaccines. Despite these 
valuable results, the Willowbrook studies are now considered an 
example of unethical research. Explain why, according to current 
ethical standards, useful results are not enough to allow a study. 


116. * Unequal benefits Researchers on aging proposed to investigate the 
effect of supplemental health services on the quality of life of older 
people. Eligible patients on the rolls of a large medical clinic were 
to be randomly assigned to treatment and control groups. The 
treatment group would be offered hearing aids, dentures, 
transportation, and other services not available without charge to the 
control group. The review board believed that providing these 
services to some but not other persons in the same institution raised 
ethical questions. Do you agree? 


Multiple Choice Select the best answer for Exercises 117 and 118. 


117. Do product labels influence customer perceptions? To find out, 
researchers recruited more than 500 adults and asked them to 
estimate the number of calories, amount of added sugar, and amount 
of fat in a variety of food products. Half of the subjects were 
randomly assigned to evaluate products with the word “Natural” on 
the label, while the other half were assigned to evaluate the same 
products without the “Natural” label. On average, the products with 
the “Natural” label were judged to have significantly fewer calories. 
Based on this study, is it reasonable to conclude that including the 
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word “Natural” on the label causes a reduction in estimated 
calories? 


a. No, because the adults weren’t randomly selected from the 
population of all adults. 


b. No, because there wasn’t a control group for comparison. 
c. No, because association doesn’t imply causation. 


d. Yes, because the adults were randomly assigned to the 
treatments. 


e. Yes, because there were a large number of adults involved in the 
study. 

118. Some news organizations maintain a database of customers who have 
volunteered to share their opinions on a variety of issues. Suppose 
that one of these databases includes 9000 registered voters in 
California. To measure the amount of support for a controversial 
ballot issue, 1000 registered voters in California are randomly 
selected from the database and asked their opinion. Which of the 
following is the largest population to which the results of this survey 
should be generalized? 


a. The 1000 people in the sample 

b. The 9000 registered voters from California in the database 
c. All registered voters in California 

d. All California residents 


e. All registered voters in the United States 


Review and Recycle 


19. Animal testing (1.1) “Tt is right to use animals for medical testing if 
it might save human lives.” The General Social Survey asked 1152 
adults to react to this statement. Here is the two-way table of their 
responses: 
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Gender 


Male Female Total 
Strongly agree 135 
ie 17 
Opinion = 5 
about using Neither agree se 
animals for _‘10r disagree 
medical Disagree 184 
testi 
ea? Strongly agree 90 
Total 516 636 1152 


a. Construct segmented bar graphs to display the distribution of 
opinion for males and for females. 


b. Is there an association between gender and opinion for the 
members of this sample? Explain your answer. 
20. Initial public offerings (1.3) The business magazine Forbes reports 
that 4567 companies sold their first stock to the public between 
1990 and 2000. The mean change in the stock price of these 
companies since the first stock was issued was +111%. The median 
change was —31%.4 Explain how this difference could happen. 


*Exercises 109-116: This is an important topic, but it is not required for 


the AP® Statistics exam. 
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_Mé _. FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free-response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


In a recent study, 166 adults from the St. Louis area were recruited 
and randomly assigned to receive one of two treatments for a sinus 
infection. Half of the subjects received an antibiotic (amoxicillin) and 
the other half received a placebo.2 


a. Describe how the researchers could have assigned treatments 
to subjects if they wanted to use a completely randomized 
design. 


b. All the subjects in the experiment had moderate, severe, or 
very severe symptoms at the beginning of the study. Describe 
one statistical benefit and one statistical drawback for using 
subjects with moderate, severe, or very severe Symptoms 
instead of just using subjects with very severe symptoms. 


c. At different stages during the next month, all subjects took the 
sino-nasal outcome test. After 10 days, the difference in 
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average test scores was not statistically significant. In this 
context, explain what it means for the difference to be not 
Statistically significant. 


d. One possible way that researchers could have improved the 
study is to use a randomized block design. Explain how the 
researchers could have incorporated blocking in their design. 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 


“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score 
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Chapter 4 Review 


Section 4.1: Sampling and Surveys 


In this section, you learned that a population is the group of all individuals 
that we want information about. A sample is the subset of the population 
that we use to gather this information. The goal of most sample surveys is 
to use information from the sample to draw conclusions about the 
population. Choosing people for a sample because they are located nearby 
or letting people choose whether or not to be in the sample are poor ways 
to select a sample. Because convenience sampling and voluntary response 
sampling will produce estimates that are likely to underestimate or likely 
to overestimate the value you want to know, these methods of choosing a 
sample are biased. 


To avoid bias in the way the sample is formed, the members of the 
sample should be chosen at random. One way to do this is with a simple 
random sample (SRS), which is equivalent to selecting well-mixed slips of 
paper from a hat. It is often more convenient to select an SRS using 
technology or a table of random digits. 


Two other random sampling methods are stratified sampling and 
cluster sampling. To obtain a stratified random sample, divide the 
population into groups (strata) of individuals that are likely to have similar 
responses, take an SRS from each stratum, and combine the chosen 
individuals to form the sample. Stratified random samples can produce 
estimates with much greater precision than simple random samples. To 
obtain a cluster sample, divide the population into groups (clusters) of 
individuals that are in similar locations, randomly select clusters, and use 
every individual in the chosen clusters. Cluster samples are easier to obtain 
than simple random samples or stratified random samples, but they may 
not produce very precise estimates. 

Finally, you learned about other issues in sample surveys that can lead 
to bias: undercoverage occurs when the sampling method systematically 
underrepresents one part of the population. Nonresponse describes when 
answers cannot be obtained from some people that were chosen to be in 
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the sample. Bias can also result when some people in the sample don’t 
give accurate responses due to question wording, interviewer 
characteristics, or other factors. 


Section 4.2: Experiments 


In this section, you learned about the difference between observational 
studies and experiments. Experiments deliberately impose a treatment to 
see if there is a cause-and-effect relationship between two variables. 
Observational studies look at relationships between two variables, but 
make it difficult to show cause and effect because other variables may be 
confounded with the explanatory variable. Variables are confounded when 
it is impossible to determine which of the variables is causing a change in 
the response variable. 


A common type of comparative experiment uses a completely 
randomized design. In this type of design, the experimental units are 
assigned to the treatments at random. With random assignment, the 
treatment groups will be roughly equivalent at the beginning of the 
experiment. Replication means giving each treatment to as many 
experimental units as possible. This makes it easier to see the effects of the 
treatments because the effects of other variables are more likely to be 
balanced among the treatment groups. 


During an experiment, it is important that other variables be controlled 
(kept the same) for each experimental unit. Doing so helps avoid 
confounding and removes a possible source of variation in the response 
variable. Also, beware of the placebo effect—the tendency for people to 
improve because they expect to, not because of the treatment they are 
receiving. One way to make sure that all experimental units have the same 
expectations is to make them blind—unaware of which treatment they are 
receiving. When the people interacting with the subjects and measuring the 
response variable are also blind, the experiment is called double-blind. 


Blocking in experiments is similar to stratifying in sampling. To form 
blocks, group together experimental units that are similar with respect to a 
variable that is associated with the response. Then randomly assign the 
treatments within each block. A randomized block design that uses blocks 
with two experimental units is called a matched pairs design. Blocking 
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helps us estimate the effects of the treatments more precisely because we 
can account for the variability introduced by the variables used to form the 
blocks. 


Section 4.3: Using Studies Wisely 


In this section, you learned that the types of conclusions we can draw 
depend on how the data are produced. When samples are selected at 
random, we can make inferences about the population from which the 
sample was drawn. However, the estimates we calculate from sample data 
rarely equal the true population value because of sampling variability. We 
can reduce sampling variability by increasing the sample size. 

When treatments are applied to groups formed at random in an 
experiment, we can make an inference about cause and effect. Making a 
cause-and-effect conclusion is often difficult because it is impossible or 
unethical to perform certain types of experiments. Good data ethics 
requires that studies should be approved by an institutional review board, 
subjects should give informed consent, and individual data must be kept 
confidential. 

Finally, the results of a study are statistically significant if they are too 
unusual to occur by chance alone. 


What Did You Learn? 


Related 
Example on Relevant Chapter 
Learning Target Section Page(s) Review Exercise(s) 
Identify the population 4.1 221 R4.1 


and sample in a 
Statistical study. 


Identify voluntary 4.1 224 R4.2 
response sampling and 

convenience sampling 

and explain how these 

sampling methods can 

lead to bias. 
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Describe how to select 4.1 228 
a simple random 

sample with 

technology or a table 

of random digits. 


Describe how to select 4.1 231 
a sample using 

stratified random 

sampling and cluster 

sampling, distinguish 

stratified random 

sampling from cluster 

sampling, and give an 

advantage of each 

method. 


Explain how 4.1 234 
undercoverage, 

nonresponse, question 

wording, and other 

aspects of a sample 

survey can lead to 

bias. 


Explain the concept of 4.2 243 
confounding and how 

it limits the ability to 

make cause-and-effect 

conclusions. 


Distinguish between 4.2 244 
an observational study 

and an experiment, 

and identify the 

explanatory and 

response variables in 

each type of study. 


Identify the 4.2 246 
experimental units and 

treatments in an 

experiment. 


Describe the placebo 4.2 250 
effect and the purpose 
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of blinding in an 
experiment. 


Describe how to 4.2 251 
randomly assign 

treatments in an 

experiment using slips 

of paper, technology, 

or a table of random 

digits. 

Explain the purpose of 4.2 254 
comparison, random 

assignment, control, 

and replication in an 

experiment. 


Describe a completely 4.2 256 
randomized design for 
an experiment. 


Describe a randomized 4.2 259, 260 
block design and a 

matched pairs design 

for an experiment and 

explain the purpose of 

blocking in an 

experiment. 


Explain the concept of 4.3 271 
sampling variability 

when making an 

inference about a 

population and how 

sample size affects 

sampling variability. 

Explain the meaning 4.3 274 
of statistically 

significant in the 

context of an 

experiment and use 

simulation to 

determine if the results 

of an experiment are 

statistically 

significant. 
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Identify when it is 4.3 276 R4.7 
appropriate to make an 

inference about a 

population and when it 

is appropriate to make 

an inference about 

cause and effect. 


Evaluate if a statistical 4.3 278 R4.10 
study has been carried 

out in an ethical 

manner.i 


iThis is an important topic, but it is not required for the AP® Statistics exam. 
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Chapter 4 Review Exercises 


R4.1 |) Nurses are the best A recent random sample of n = 805 adult 


U.S. residents found that the proportion who rated the honesty and 
ethical standards of nurses as high or very high is 0.85. This is 0.15 
higher than the proportion recorded for doctors, the next highest- 


ranked profession.22 


a. Identify the sample and the population in this setting. 


b. Do you think that the proportion of all U.S. residents who would 
rate the honesty and ethical standards of nurses as high or very 
high is exactly 0.85? Explain your answer. 


c. What is the benefit of increasing the sample size in this context? 


R4.2 |») Parking problems The administration at a high school with 


1800 students wants to gather student opinion about parking for 
students on campus. It isn’t practical to contact all students. 


a. Give an example of a way to choose a voluntary response sample 
of students. Explain how this method could lead to bias. 


b. Give an example of a way to choose a convenience sample of 
students. Explain how this method could lead to bias. 


c. Describe how to select an SRS of 50 students from the school. 


d. Explain how the method you described in part (c) avoids the 
biases you described in parts (a) and (b). 


R4.3 |») Surveying NBA fans The manager of a sports arena wants to 


learn more about the financial status of the people who are attending 
an NBA basketball game. He would like to give a survey to a 
representative sample of about 10% of the fans in attendance. Ticket 
prices for the game vary a great deal: seats near the court cost over 
$200 each, while seats in the top rows of the arena cost $50 each. 
The arena is divided into 50 numbered sections, from 101 to 150. 
Each section has rows of seats labeled with letters from A (nearest 
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the court) to ZZ (top row of the arena). 


a. Explain why it might be difficult to give the survey to an SRS of 
fans. 


b. Explain why it would be better to select a stratified random 
sample using the lettered rows rather than the numbered sections 
as strata. What is the benefit of using a stratified sample in this 
context? 


c. Explain how to select a cluster sample of fans. What is the 
benefit of using a cluster sample in this context? 


R4.4 |») Been to the movies? An opinion poll calls 2000 randomly 


chosen residential telephone numbers, then asks to speak with an 
adult member of the household. The interviewer asks, “Box office 
revenues are at an all-time high. How many movies have you 
watched in a movie theater in the past 12 months?” In all, 1131 
people responded. The researchers used the responses to estimate the 
mean number of movies adults had watched in a movie theater over 
the past 12 months. 


a. Describe a potential source of bias related to the wording of the 
question. Suggest a change that would help fix this problem. 


b. Describe how using only residential phone numbers might lead to 
bias and how this will affect the estimate. 


c. Describe how nonresponse might lead to bias and how this will 
affect the estimate. 


R4.5 |p) Are anesthetics safe? The National Halothane Study was a 


major investigation of the safety of anesthetics used in surgery. 
Records of over 850,000 operations performed in 34 major hospitals 


showed the following death rates for four common anesthetics: 
Anesthetic A B C D 


Death rate 1.7% 1.7% 3.4% 1.9% 


There seems to be a clear association between the anesthetic used 
and the death rate of patients. Anesthetic C appears to be more 
dangerous. 
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a. Explain why we call the National Halothane Study an 
observational study rather than an experiment, even though it 
compared the results of using different anesthetics in actual 
surgery. 


b. Identify the explanatory and response variables in this study. 


c. When the study looked at other variables that are related to a 
doctor’s choice of anesthetic, it found that Anesthetic C was not 
causing extra deaths. Explain the concept of confounding in this 
context and identify a variable that might be confounded with the 
doctor’s choice of anesthetic. 


R4.6 |») Ugly fries Few people want to eat discolored french fries. To 


prevent spoiling and to preserve flavor, potatoes are kept 
refrigerated before being cut for french fries. But immediate 
processing of cold potatoes causes discoloring due to complex 
chemical reactions. The potatoes must therefore be brought to room 
temperature before processing. Researchers want to design an 
experiment in which tasters will rate the color and flavor of french 
fries prepared from several groups of potatoes. The potatoes will be 
freshly picked or stored for a month at room temperature or stored 
for a month refrigerated. They will then be sliced and cooked either 
immediately or after an hour at room temperature. 


a. Identify the experimental units, the factors, the number of levels 
for each factor, and the treatments. 


b. Describe a completely randomized design for this experiment 
using 300 potatoes. 


c. A single supplier has made 300 potatoes available to the 
researchers. Describe a statistical benefit and a statistical 
drawback of using potatoes from only one supplier. 


d. The researchers decided to do a follow-up experiment using 
potatoes from several different suppliers. Describe how they 
should change the design of the experiment to account for the 
addition of other suppliers. 


R4.7 |) Don’t catch a cold! A recent study of 1000 students at the 


630 


University of Michigan investigated how to prevent catching the 
common cold. The students were randomly assigned to three 
different cold prevention methods for 6 weeks. Some wore masks, 
some wore masks and used hand sanitizer, and others took no 
precautions. The two groups who used masks reported 10-50% 


fewer cold symptoms than those who did not wear a mask.22 


a. Does this study allow for inference about a population? Explain 
your answer. 


b. Does this study allow for inference about cause and effect? 
Explain your answer. 


R4.8 |») An herb for depression? Does the herb St. John’s wort relieve 


major depression? Here is an excerpt from the report of one study of 
this issue: “Design: Randomized, Double-Blind, Placebo-Controlled 
Clinical Trial.”22 The study concluded that the difference in 
effectiveness of St. John’s wort and a placebo was not statistically 
significant. 


a. Describe the placebo effect in this context. How did the design of 
this experiment account for the placebo effect? 


b. Explain the purpose of the random assignment. 
c. Why is a double-blind design a good idea in this setting? 
d. Explain what “not statistically significant” means in this context. 


R4.9 |») How long did I work? A psychologist wants to know if the 


difficulty of a task influences our estimate of how long we spend 
working at it. She designs two sets of mazes that subjects can work 
through on a computer. One set has easy mazes and the other has 
difficult mazes. Subjects work until told to stop (after 6 minutes, but 
subjects do not know this). They are then asked to estimate how 
long they worked. The psychologist has 30 students available to 
serve as subjects. 


a. Describe an experiment using a completely randomized design to 
learn the effect of difficulty on estimated time. Make sure to 
carefully explain your method of assigning treatments. 
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b. Describe a matched pairs experimental design using the same 30 
subjects. 


c. Which design would be more likely to detect a difference in the 
effects of the treatments? Explain your answer. 


R4.1 01 ») Deceiving subjects Students sign up to be subjects in a 


psychology experiment. When they arrive, they are told that 
interviews are running late and are taken to a waiting room. The 
experimenters then stage the theft of a valuable object left in the 
waiting room. Some subjects are alone with the thief, and others are 
present in pairs—these are the treatments being compared. Will the 
subject report the theft? 


a. The students had agreed to take part in an unspecified study, and 
the true nature of the experiment is explained to them afterward. 
Does this meet the requirement of informed consent? Explain 
your answer. 


b. What two other ethical principles should be followed in this 
study? 
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Chapter 4 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T4.1 When we take a census, we attempt to collect data from 
a. a Stratified random sample. 
b. every individual chosen in a simple random sample. 
c. every individual in the population. 
d. a voluntary response sample. 
e. a convenience sample. 


T4.2 You want to take a simple random sample (SRS) of 50 of the 816 
students who live in a dormitory on campus. You label the 
students 001 to 816 in alphabetical order. In the table of random 
digits, you read the entries 


95592 94007 69769 33547 72450 16632 81194 14873 


The first three students in your sample have labels 
a. 955, 929, 400. 
b. 400, 769, 769. 
c. 559, 294, 007. 
d. 929, 400, 769. 
e. 400, 769, 335. 


T4.3 A study of treatments for angina (pain due to low blood supply to 
the heart) compared bypass surgery, angioplasty, and use of drugs. 
The study looked at the medical records of thousands of angina 
patients whose doctors had chosen one of these treatments. It 
found that the average survival time of patients given drugs was 
the highest. What do you conclude? 


a. This study proves that drugs prolong life and should be the 


633 


treatment of choice. 


. We can conclude that drugs prolong life because the study was 


a comparative experiment. 


We can’t conclude that drugs prolong life because the patients 
were volunteers. 


We can’t conclude that drugs prolong life because the groups 
might differ in ways besides the treatment. 


We can’t conclude that drugs prolong life because no placebo 
was used. 


T4.4 Tonya wanted to estimate the average amount of time that students 
at her school spend on Facebook each day. She gets an 
alphabetical roster of students in the school from the registrar’s 
office and numbers the students from 1 to 1137. Then Tonya uses 
a random number generator to pick 30 distinct labels from 1 to 
1137. She surveys those 30 students about their Facebook use. 
Tonya’s sample is a simple random sample because 


a. 
b. 


Cc. 


d. 


e. 


it was selected using a chance process. 
it gave every individual the same chance to be selected. 


it gave every possible sample of size 30 an equal chance to be 
selected. 


it doesn’t involve strata or clusters. 


it is guaranteed to be representative of the population. 


T4.5 Consider an experiment to investigate the effectiveness of different 
insecticides in controlling pests and their impact on the 
productivity of tomato plants. What is the best reason for 
randomly assigning treatment levels (spraying or not spraying) to 
the experimental units (farms)? 


a. 


Random assignment eliminates the effects of other variables, 
like soil fertility. 


Random assignment eliminates chance variation in the 
responses. 


Random assignment allows researchers to generalize 
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conclusions about the effectiveness of the insecticides to all 
farms. 


d. Random assignment will tend to average out all other 
uncontrolled factors such as soil fertility so that they are not 
confounded with the treatment effects. 


e. Random assignment helps avoid bias due to the placebo effect. 


T4.6 Researchers randomly selected 1700 people from Canada who had 
never suffered a heart attack and rated the happiness of each 
person. Ten years later, the researchers followed up with each 
person and found that people who were initially rated as happy 
were less likely to have a heart problem.2 Which of the following 
is the most appropriate conclusion based on this study? 


a. Happiness causes better heart health for all people. 
b. Happiness causes better heart health for Canadians. 


c. Happiness causes better heart health for the 1700 people in the 
study. 


d. Happier people in Canada are less likely to have heart 
problems. 


e. Happier people in the study were less likely to have heart 
problems. 


T4.7 A TV station wishes to obtain information on the TV viewing 
habits in its market area. The market area contains one city of 
population 170,000, another city of 70,000, and four towns of 
about 5000 residents each. The station suspects that the viewing 
habits may be different in larger and smaller cities and in the rural 
areas. Which of the following sampling designs would yield the 
type of information the station requires? 


a. A stratified sample from the cities and towns in the market area 
b. A cluster sample using the cities and towns as clusters 


c. A convenience sample from the market area 


a 


A simple random sample from the market area 


e. An online poll that invites all people from the cities and towns 
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in the market area to participate 


T4.8 Bias in a sampling method is 


a. 


e. 


any difference between the sample result and the truth about the 
population. 


the difference between the sample result and the truth about the 
population due to using chance to select a sample. 


any difference between the sample result and the truth about the 
population due to practical difficulties such as contacting the 
subjects selected. 


any difference between the sample result and the truth about the 
population that tends to occur in the same direction whenever 
you use this sampling method. 


racism or sexism on the part of those who take the sample. 


T4.9 You wonder if TV ads are more effective when they are longer or 
repeated more often or both. So you design an experiment. You 
prepare 30-second and 60-second ads for a camera. Your subjects 
all watch the same TV program, but you assign them at random to 
four groups. One group sees the 30-second ad once during the 
program; another sees it three times; the third group sees the 60- 
second ad once; and the last group sees the 60-second ad three 
times. You ask all subjects how likely they are to buy the camera. 
Which of the following best describes the design of this 
experiment? 


a. 


This is a randomized block design, but not a matched pairs 
design. 


This is a matched pairs design. 


This is a completely randomized design with one explanatory 
variable (factor). 


This is a completely randomized design with two explanatory 
variables (factors). 


This is a completely randomized design with four explanatory 
variables (factors). 
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T4.10 Can texting make you healthier? Researchers randomly assigned 
700 Australian adults to either receive usual health care or usual 
heath care plus automated text messages with positive messages, 
such as “Walking is cheap. It can be done almost anywhere. All 
you need is comfortable shoes and clothing.” The group that 
received the text messages showed a Statistically significant 
increase in physical activity.22 What is the meaning of “statistically 
significant” in this context? 


a. The results of this study are very important. 
b. The results of this study should be generalized to all people. 


c. The difference in physical activity for the two groups is greater 
than 0. 


d. The difference in physical activity for the two groups is very 
large. 


e. The difference in physical activity for the two groups is larger 
than the difference that could be expected to happen by chance 
alone. 


T4.11 You want to know the opinions of American high school teachers 
on the issue of establishing a national proficiency test as a 
prerequisite for graduation from high school. You obtain a list of 
all high school teachers belonging to the National Education 
Association (the country’s largest teachers’ union) and mail a 
survey to arandom sample of 2500 teachers. In all, 1347 of the 
teachers return the survey. Of those who responded, 32% say that 
they favor some kind of national proficiency test. Which of the 
following statements about this situation is true? 


a. Because random sampling was used, we can feel confident that 
the percent of all American high school teachers who would say 
they favor a national proficiency test is close to 32%. 


b. We cannot trust these results, because the survey was mailed. 
Only survey results from face-to-face interviews are considered 
valid. 


c. Because over half of those who were mailed the survey actually 
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responded, we can feel fairly confident that the actual percent of 
all American high school teachers who would say they favor a 
national proficiency test is close to 32%. 


d. The results of this survey may be affected by undercoverage 
and nonresponse. 


e. The results of this survey cannot be trusted due to voluntary 
response bias. 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T4.12 Elephants sometimes damage trees in Africa. It turns out that 
elephants dislike bees. They recognize beehives in areas where 
they are common and avoid them. Can this information be used to 
keep elephants away from trees? Researchers want to design an 
experiment to answer these questions using 72 acacia trees and 


three treatments: active hives, empty hives, and no hives.Z2 


a. Identify the experimental units in this experiment. 


b. Explain why it is beneficial to include some trees that have no 
hives. 


c. Describe how the researchers could carry out a completely 
randomized design for this experiment. Include a description of 
how the treatments should be assigned. 


T4.13 Google and Gallup teamed up to survey a random sample of 1673 
U.S. students in grades 7-12. One of the questions was “How 
confident are you that you could learn computer science if you 
wanted to?” Overall, 54% of students said they were very 


confident.22 


a. Identify the population and the sample. 


b. Explain why it was better to randomly select the students rather 
than putting the survey question on a website and inviting 
students to answer the question. 
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c. Do you expect that the percent of all U.S. students in grades 7— 
12 who would say “very confident” is exactly 54%? Explain 
your answer. 


d. The report also broke the results down by gender. For this 
question, 62% of males and 48% of females said they were very 
confident. Which of the three estimates (54%, 62%, 48%) do 
you expect is closest to the value it is trying to estimate? 
Explain your answer. 


T4.14 Many people start their day with a jolt of caffeine from coffee or a 
soft drink. Most experts agree that people who consume large 
amounts of caffeine each day may suffer from physical withdrawal 
symptoms if they stop ingesting their usual amounts of caffeine. 
Researchers recruited 11 volunteers who were caffeine dependent 
and who were willing to take part in a caffeine withdrawal 
experiment. The experiment was conducted on two 2-day periods 
that occurred one week apart. During one of the 2-day periods, 
each subject was given a capsule containing the amount of caffeine 
normally ingested by that subject in one day. During the other 
study period, the subjects were given placebos. The order in which 
each subject received the two types of capsules was randomized. 
The subjects’ diets were restricted during each of the study periods. 
At the end of each 2-day study period, subjects were evaluated 
using a tapping task in which they were instructed to press a button 
200 times as fast as they could.24 


a. Identify the explanatory and response variables in this 
experiment. 


b. How was blocking used in the design of this experiment? What 
is the benefit of blocking in this context? 


c. Researchers randomized the order of the treatments to avoid 
confounding. Explain how confounding might occur if the 
researchers gave all subjects the placebo first and the caffeine 
second. In this context, what problem does confounding cause? 


d. Could this experiment have been carried out in a double-blind 
manner? Explain your answer. 
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Chapter 4 Project Response Bias 


In this project, your team will design and conduct an experiment to 


investigate the effects of response bias in surveys.22 You may choose the 
topic for your surveys, but you must design your experiment so that it can 
answer at least one of the following questions. 


Does the wording of a question affect the response? 

Do the characteristics of the interviewer affect the response? 
Does anonymity change the responses to sensitive questions? 
Does manipulating the answer choices affect the response? 


Can revealing other peoples’ answers to a question change the response? 


. Write a proposal describing the design of your experiment. Be sure to 


include the following items: 


a. Your chosen topic and which of the above questions you’ ll try to 
answer. 


b. A detailed description of how you will obtain your subjects 
(minimum of 50). Your plan must be practical! 


c. An explanation of the treatments in your experiment and how you 
will determine which subjects get which treatment. 


d. A clear explanation of how you will implement your design. 
e. Precautions you will take to collect data ethically. 


Here are two examples of successful student experiments. 


“Make-Up,” by Caryn S. and Trisha T. (all questions asked to 
males): 


i. “Do you find females who wear makeup attractive?” (Questioner 
wearing makeup: 75% answered “Yes.”) 


ii. “Do you find females who wear makeup attractive?” (Questioner 
not wearing makeup: 30% answered “Yes.”) 


“Cartoons” by Sean W. and Brian H.: 
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i. “Do you watch cartoons?” (90% answered “Yes.”) 
ii. “Do you still watch cartoons?” (60% answered “Yes.”) 


. Once your teacher has approved your design, carry out the experiment. 
Record your data in a table. 


. Analyze your data. What conclusion do you draw? Provide appropriate 
graphical and numerical evidence to support your answer. 


. Prepare a report that includes the data you collected, your analysis from 
Step 3, and a discussion of any problems you encountered and how you 
dealt with them. 
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Cumulative AP® Practice Test 1 


Section I: Multiple Choice Choose the best answer for Questions AP1.1— 
AP1.14. 


AP1.1 You look at real estate ads for houses in Sarasota, Florida. Many 
houses have prices from $200,000 to $400,000. The few houses on 
the water, however, have prices up to $15 million. Which of the 
following statements best describes the distribution of home prices 
in Sarasota? 


a. The distribution is most likely skewed to the left, and the mean 
is greater than the median. 


b. The distribution is most likely skewed to the left, and the mean 
is less than the median. 


c. The distribution is roughly symmetric with a few high outliers, 
and the mean is approximately equal to the median. 


d. The distribution is most likely skewed to the right, and the mean 
is greater than the median. 


e. The distribution is most likely skewed to the right, and the mean 
is less than the median. 


AP1.2 A child is 40 inches tall, which places her at the 90th percentile of 
all children of similar age. The heights for children of this age form 
an approximately Normal distribution with a mean of 38 inches. 
Based on this information, what is the standard deviation of the 
heights of all children of this age? 


a. 0.20 inch 
b. 0.31 inch 
c. 0.65 inch 
d. 1.21 inches 
e. 1.56 inches 


AP1.3 A large set of test scores has mean 60 and standard deviation 18. If 
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each score is doubled, and then 5 is subtracted from the result, the 
mean and standard deviation of the new scores are 


a. mean 115 and standard deviation 31. 
b. mean 115 and standard deviation 36. 
c. mean 120 and standard deviation 6. 

d. mean 120 and standard deviation 31. 
e. mean 120 and standard deviation 36. 


AP1.4 For a certain experiment, the available experimental units are eight 
rats, of which four are female (F1, F2, F3, F4) and four are male 
(M1, M2, M3, M4). There are to be four treatment groups, A, B, C, 
and D. If a randomized block design is used, with the experimental 
units blocked by gender, which of the following assignments of 
treatments is impossible? 


a. A = (Fl, M1), B = (F2, M2), C = (F3, M3), D — (F4, M4) 
b. A = (F1, M2), B = (F2, M3), C = (F3, M4), D — (F4, M1) 
c. A = (Fl, M2), B = (F3, F2), C = (F4, M1), D = (M3, M4) 
d. A = (F4, M1), B = (F2, M3), C = (F3, M2), D = (F1, M4) 
e. A = (F4, M1), B = (F1, M4), C = (F3, M2), D = (F2, M3) 


AP1.5 For a biology project, you measure the weight in grams (g) and the 
tail length in millimeters (mm) of a group of mice. The equation of 
the least-squares line for predicting tail length from weight is 


predicted tail length = 20 + 3 x weight 
predicted tail length = 20+ 3 x weight 


Which of the following is not correct? 


a. The slope is 3, which indicates that a mouse’s predicted tail 
length should increase by about 3 mm for each additional gram 
of weight. 


b. The predicted tail length of a mouse that weighs 38 grams is 
134 millimeters. 


c. By looking at the equation of the least-squares line, you can see 
that the correlation between weight and tail length is positive. 
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d. If you had measured the tail length in centimeters instead of 
millimeters, the slope of the regression line would have been 


3/10 = 0.3. 
e. Mice that have a weight of 0 grams will have a tail of length 20 
mm. 


AP1.6 The figure shows a Normal density curve. Which of the following 
gives the best estimates for the mean and standard deviation of this 
Normal distribution? 


I T I I I I 
50 100 =6150)§ 620006250 §=696300)§=6350)3Ss 400 


a. HW = 200, 0= 50 
b. p= 200, 0= 25 
c. w= 225,0=50 
d. p=225,0=25 
e. w= 225,0= 275 


AP1.7 The owner of a chain of supermarkets notices that there is a 
positive correlation between the sales of beer and the sales of ice 
cream over the course of the previous year. During seasons when 
sales of beer were above average, sales of ice cream also tended to 
be above average. Likewise, during seasons when sales of beer 
were below average, sales of ice cream also tended to be below 
average. Which of the following would be a valid conclusion from 
these facts? 


a. Sales records must be in error. There should be no association 
between beer and ice cream sales. 


b. Evidently, for a significant proportion of customers of these 
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supermarkets, drinking beer causes a desire for ice cream or 
eating ice cream causes a thirst for beer. 


A scatterplot of monthly ice cream sales versus monthly beer 
sales would show that a straight line describes the pattern in the 
plot, but it would have to be a horizontal line. 


There is a clear negative association between beer sales and ice 
cream sales. 


The positive correlation is most likely a result of the variable 
temperature; that is, as temperatures increase, so do both beer 
sales and ice cream sales. 


AP1.8 Here are the IQ scores of 10 randomly chosen fifth-grade students: 


145 139 126 122 125 130 96 = 110 118 118 


Which of the following statements about this data set is not true? 


a. 


The student with an IQ of 96 is considered an outlier by the 1.5 
<x IQR rule. 


. The five-number summary of the 10 IQ scores is 96, 118, 123.5, 


130, 145. 


If the value 96 were removed from the data set, the mean of the 
remaining 9 IQ scores would be greater than the mean of all 10 
IQ scores. 


. If the value 96 were removed from the data set, the standard 


deviation of the remaining 9 IQ scores would be less than the 
standard deviation of all 10 IQ scores. 


If the value 96 were removed from the data set, the JQR of the 
remaining 9 IQ scores would be less than the JQR of all 10 IQ 
scores. 


AP1.9 Before he goes to bed each night, Mr. Kleen pours dishwasher 
powder into his dishwasher and turns it on. Each morning, Mrs. 
Kleen weighs the box of dishwasher powder. From an examination 
of the data, she concludes that Mr. Kleen dispenses a rather 
consistent amount of powder each night. Which of the following 
statements is true? 
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I. There is a high positive correlation between the number of 
days that have passed since the box of dishwasher powder was 
opened and the amount of powder left in the box. 


II. A scatterplot with days since purchase as the explanatory 
variable and total amount of dishwasher powder used as the 
response variable would display a strong positive association. 

III. The correlation between the amount of powder left in the box 
and the amount of powder used should be -1. 


a. I only 
b. II only 

c. Ill only 

d. Il and III only 
e. I, II, and III 


.P1.10 The General Social Survey (GSS), conducted by the National 
Opinion Research Center at the University of Chicago, is a major 
source of data on social attitudes in the United States. Once each 
year, 1500 adults are interviewed in their homes all across the 
country. The subjects are asked their opinions about sex and 
marriage; attitudes toward women, welfare, foreign policy; and 
many other issues. The GSS begins by selecting a sample of 
counties from the 3000 counties in the country. The counties are 
divided into urban, rural, and suburban; a separate sample of 
counties is chosen at random from each group. This is a 


a. simple random sample. 

b. systematic random sample. 
c. cluster sample. 

d. stratified random sample. 
e. voluntary response sample. 


.P1.11 You are planning an experiment to determine the effect of the brand 
of gasoline and the weight of a car on gas mileage measured in 
miles per gallon. You will use a single test car, adding weights so 
that its total weight is 3000, 3500, or 4000 pounds. The car will 
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drive on a test track at each weight using each of Amoco, 
Marathon, and Speedway gasoline. Which is the best way to 
organize the study? 


a. 


Start with 3000 pounds and Amoco and run the car on the test 
track. Then do 3500 and 4000 pounds. Change to Marathon and 
go through the three weights in order. Then change to Speedway 
and do the three weights in order once more. 


. Start with 3000 pounds and Amoco and run the car on the test 


track. Then change to Marathon and then to Speedway without 
changing the weight. Then add weights to get 3500 pounds and 
go through the three gasolines in the same order. Then change to 
4000 pounds and do the three gasolines in order again. 


Choose a gasoline at random, and run the car with this gasoline 
at 3000, 3500, and 4000 pounds in order. Choose one of the two 
remaining gasolines at random and again run the car at 3000, 
then 3500, then 4000 pounds. Do the same with the last gasoline. 


. There are nine combinations of weight and gasoline. Run the car 


several times using each of these combinations. Make all these 
runs in random order. 


. Randomly select an amount of weight and a brand of gasoline, 


and run the car on the test track. Repeat this process a total of 30 
times. 


.P1.12 A linear regression was performed using the following five data 
points: A(2, 22), B(10, 4), C(6, 14), D(14, 2), E(18, —4). The 
residual for which of the five points has the largest absolute value? 


Cc. 


d. 


e. 


a. 
b. 


A 
B 
C 
D 
E 


.P1.13 The frequency table summarizes the distribution of time that 140 
patients at the emergency room of a small-city hospital waited to 
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receive medical attention during the last month. 


Waiting time Frequency 
Less than 10 minutes 5 

At least 10 but less than 20 minutes 24 

At least 20 but less than 30 minutes 45 

At least 30 but less than 40 minutes 38 

At least 40 but less than 50 minutes 19 

At least 50 but less than 60 minutes 

At least 60 but less than 70 minutes 2 


Which of the following represents possible values for the median 
and JQR of waiting times for the emergency room last month? 


a. median = 27 minutes and JQR = 15 minutes 
b. median = 28 minutes and JQR = 25 minutes 
c. median = 31 minutes and JQR = 35 minutes 
d. median = 35 minutes and JQR = 45 minutes 
e. median = 45 minutes and JQR = 55 minutes 


.P1.14 Boxplots of two data sets are shown. 


Plot 1] +} 


Based on the boxplots, which of the following is true? 


a. The range of both plots is about the same. 
b. The means of both plots are approximately equal. 


c. Plot 2 contains more data points than Plot 1. 


jar 


. The medians are approximately equal. 


e. Plot 1 is more symmetric than Plot 2. 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
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explanations. 


.P1.15 The manufacturer of exercise machines for fitness centers has 
designed two new elliptical machines that are meant to increase 
cardiovascular fitness. The two machines are being tested on 30 
volunteers at a fitness center near the company’s headquarters. The 
volunteers are randomly assigned to one of the machines and use it 
daily for two months. A measure of cardiovascular fitness is 
administered at the start of the experiment and again at the end. 
The following stemplot contains the differences (After — Before) in 
the two scores for the two machines. Note that higher scores 
indicate larger gains in fitness. 


Machine A Machine B 


2 
0 (Key: 2|1 represents a 
) 


54 
876320 
97411 
61 


159 difference (After — Before 
2489 \ ot 21 in fitness scores. 


257 
359 


a. Write a few sentences comparing the distributions of 
cardiovascular fitness gains from the two elliptical machines. 


b. Which machine should be chosen if the company wants to 
advertise it as achieving the highest overall gain in 
cardiovascular fitness? Explain your reasoning. 


c. Which machine should be chosen if the company wants to 
advertise it as achieving the most consistent gain in 
cardiovascular fitness? Explain your reasoning. 


d. Give one reason why the advertising claims of the company (the 
scope of inference) for this experiment would be limited. 
Explain how the company could broaden that scope of inference. 


.P1.16 Those who advocate for monetary incentives in a work environment 
claim that this type of incentive has the greatest appeal because it 
allows the winners to do what they want with their winnings. Those 
in favor of tangible incentives argue that money lacks the 
emotional appeal of, say, a weekend for two at a romantic country 
inn or elegant hotel, or a weeklong trip to Europe. 
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A few years ago a national tire company, in an effort to improve 
sales of a new line of tires, decided to test which method—offering 
cash incentives or offering non-cash prizes such as vacations—was 
more successful in increasing sales. The company had 60 retail 
sales districts of various sizes across the country and data on the 
previous sales volume for each district. 


a. Describe a completely randomized design using the 60 retail 
sales districts that would help answer this question. 


b. Explain how you would use the following excerpt from the table 
of random digits to do the random assignment that your design 
requires. Then use your method to make the first three 
assignments. 


07511 88915 41267 16853 84569 79367 32337 03316 
81486 69487 60513 09297 00412 71238 27649 39950 


c. One of the company’s officers suggested that it would be better 
to use a matched pairs design instead of a completely 
randomized design. Explain how you would change your design 
to accomplish this. 


.P1.17 In retail stores, there is a lot of competition for shelf space. There 
are national brands for most products, and many stores carry their 
own line of in-house brands, too. Because shelf space is not 
infinite, the question is how many linear feet to allocate to each 
product and which shelf (top, bottom, or somewhere in the middle) 
to put it on. The middle shelf is the most popular and lucrative, 
because many shoppers, if undecided, will simply pick the product 
that is at eye level. 


A local store that sells many upscale goods is trying to determine 
how much shelf space to allocate to its own brand of men’s 
personal-grooming products. The middle shelf space is randomly 
varied between 3 and 6 linear feet over the next 12 weeks, and 
weekly sales revenue (in dollars) from the store’s brand of 
personal-grooming products for men is recorded. Here is some 
computer output from the study, along with a scatterplot: 
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Predictor Coef SE Coef T P 


Constant 317.940 31.32 10.15 0.000 
Shelf length 152.680 6.445 23.69 0.000 
S = 22.9212 R-Sq = 98.2% R-Sq(adj) = 98.1% 
1300 + 2 
1200 a a 
@ 1100 - r ° 
4 1000 + 7 : 
900 ~ 
8004 « i 


T T T l i 
3.0 3.5 4.0 4.5 5.0 55 6.0 


Shelf length (ft) 


a. Describe the relationship between shelf length and sales. 


b. Write the equation of the least-squares regression line. Be sure to 
define any variables you use. 


c. If the store manager were to decide to allocate 5 linear feet of 
shelf space to the store’s brand of men’s grooming products, 
what is the best estimate of the weekly sales revenue? 


d. Interpret the value of s. 
e. Identify and interpret the coefficient of determination. 


.P1.18 The manager of the store in the preceding exercise calculated the 
residual for each point in the scatterplot and made a dotplot of the 
residuals. The distribution of residuals is roughly Normal with a 
mean of $0 and standard deviation of $22.92. 


a. What percent of the actual sales amounts do you expect to be 
within $5 of their expected sales amount? 


b. The middle 95% of residuals should be between which two 
values? Use this information to give an interval of plausible 
values for the weekly sales revenue if 5 linear feet are allocated 
to the store’s brand of men’s grooming products. 
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Chapter 3 Probability: What Are 


the Chances? | / 
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INTRODUCTION 


Chance is all around us. You and your friend play rock-paper-scissors to 
determine who gets the last slice of pizza. A coin toss decides which team 
gets to receive the ball first in a football game. Many adults regularly play 
the lottery, hoping to win a big jackpot with a few lucky numbers. Others 
head to casinos or racetracks, hoping that some combination of luck and 
skill will pay off. People young and old play games of chance involving 
cards or dice or spinners. The traits that children inherit—hair and eye 
color, blood type, handedness, dimples, whether they can roll their tongues 
—are determined by the chance involved in which genes their parents pass 
along. 

The mathematics of chance behavior is called probability. Probability 
is the topic of this chapter. Here is an activity that gives you some idea of 
what lies ahead. 


ACTIVITY The “1 in 6 wins” game 


In this activity, you and your classmates will use simulation to test 
whether a company’s claim is believable. 


As a special promotion for its 20-ounce bottles of soda, a soft drink 
company printed a message on the inside of each bottle cap. Some of the 
caps said, “Please try again!” while others said, “You’re a winner!” The 
company advertised the promotion with the slogan “1 in 6 wins a prize.” 
The prize is a free 20-ounce bottle of soda. 

Grayson’s statistics class wonders if the company’s claim holds true 
for the bottles at a nearby convenience store. To find out, all 30 students 
in the class go to the store and each student buys one 20-ounce bottle of 
the soda. Two of them get caps that say, “You’re a winner!” Does this 
result give convincing evidence that the company’s 1-in-6 claim is 
inaccurate? 

For now, let’s assume that the company is telling the truth, and that 
every 20-ounce bottle of soda it fills has a 1-in-6 chance of getting a cap 
that says, “You’re a winner!” We can model the status of an individual 
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bottle with a six-sided die: let 1 through 5 represent “Please try again!” 
and 6 represent “You’re a winner!” 


1. Roll your die 30 times to imitate the process of the students in 
Grayson’s statistics class buying their sodas. How many of them won 
a prize? 

2. Your teacher will draw and label axes for a class dotplot. Plot on the 
graph the number of prize winners you got in Step 1. 


3. Repeat Steps 1 and 2, if needed, to get a total of at least 40 repetitions 
of the simulation for your class. 


4. Discuss the results with your classmates. What percent of the time did 
the simulation yield two or fewer prize winners in a class of 30 
students, just by chance? Does it seem plausible (believable) that the 
company is telling the truth, but that the class just got unlucky? Or is 
there convincing evidence that the 1-in-6 claim is wrong? Explain 
your reasoning. 


NIMA Stock/Alamy 


As the activity shows, simulation is a powerful method for modeling 
chance behavior. Section 5.1 begins by examining the idea of probability 
and then illustrates how simulation can be used to estimate probabilities. In 
Sections 5.2 and 5.3, we develop the basic rules and techniques of 
probability. 

Probability calculations are the basis for inference. When we produce 
data by random sampling or randomized comparative experiments, the 
laws of probability answer the question “What would happen if we 
repeated the random sampling or random assignment process many 
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times?” Many of the examples, exercises, and activities in this chapter 
focus on the connection between probability and inference. 
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SECTION 5.1 Randomness, Probability, and 
Simulation 


LEARNING TARGETS. By the end of the section, you should be able to: 


e Interpret probability as a long-run relative frequency. 


e Use simulation to model chance behavior. 


Imagine tossing a coin 10 times. How likely are you to get a run of 3 or 
more consecutive heads? An airline knows that a certain percent of 
customers who purchase tickets will not show up for a flight. If the airline 
overbooks a particular flight, what are the chances that they’!l have enough 
seats for the passengers who show up? A couple plans to have children 
until they have at least one boy and one girl. How many children should 
they expect to have? To answer these questions, you need a better 
understanding of how chance behavior operates. 


The Idea of Probability 


In football, a coin toss helps determine which team gets the ball first. Why 
do the rules of football require a coin toss? Because tossing a coin seems a 
“fair” way to decide. What does that mean exactly? The following activity 
should help shed some light on this question. 


ACTIVITY what is probability? 


If you toss a fair coin, what’s the probability that it shows heads? It’s 
1/2, or 0.5, right? But what does probability 1/2 really mean? In this 
activity, you will investigate by flipping a coin several times. 


1. Toss your coin once. Record whether you get heads or tails. 


2. Toss your coin a second time. Record whether you get heads or tails. 
What proportion of your first two tosses is heads? 
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3. Toss your coin 8 more times so that you have 10 tosses in all. Record 


whether you get heads or tails on each toss in a table like the one that 
follows. 


4. Calculate the overall proportion of heads after each toss and record 
these values in the bottom row of the table. For instance, suppose you 
got tails on the first toss and heads on the second toss. Then your 
overall proportion of heads would be 0/1 = 0.00 after the first toss and 
1/2 = 0.50 after the second toss. 


mes +2] 3]*[5[*]7|*[s][o 


rear | | | | ||.) 
Proportion afheads | | | | | | | | | | 


pePLe> 


5. ¢€ Let’s use technology to speed things up. Launch the Idea of 


Probability applet at highschool.bfwpub.com/tps6e. Set the number 
of tosses to 10 and click toss. What proportion of the tosses were 
heads? Click “Reset” and toss the coin 10 more times. What 
proportion of heads did you get this time? Repeat this process several 
more times. What do you notice? 


Probability of Heads: 0.5 Cl 


Heads = 4/10 = 0.400 
rH 


Number of Tosses: 10 
(eee 1 


Tails = 6/10 = 0.600 
™ Show true probability 
0.7 
Heads 
0.5 
0.4 


10 
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What if you toss the coin 100 times? Reset the applet and have it do 
100 tosses. Is the proportion of heads exactly equal to 0.5? Close to 
0.5? 


. Keep on tossing without hitting “Reset.” What happens to the 


proportion of heads? 


. Asa class, discuss what the following statement means: “If you toss a 


fair coin, the probability of heads is 0.5.” 


If you toss a coin, it can land heads or tails. If you “toss” a thumbtack, 
it can land with the point sticking up or with the point down. Does 
that mean the probability of a tossed thumbtack landing point up is 
0.5? How can you find out? Discuss with your classmates. 


Figure 5.1 shows some results from the preceding activity. The 


proportion of tosses that land heads varies from 0.30 to 1.00 in the first 10 
tosses. As we make more and more tosses, however, the proportion of 
heads gets closer to 0.5 and stays there. 


Proportion of heads 


Proportion of heads 


ny 0 


l 10 | 500 


Number of tosses (b) Number of tosses 


FIGURE 5.1 (a) The proportion of heads in the first 10 tosses of a coin. (b) The 
proportion of heads in the first 500 tosses of a coin. 


When we watch coin tosses or the results of random sampling and 


random assignment closely, a remarkable fact emerges: Chance behavior 
is unpredictable in the short run but has a regular and predictable pattern 
in the long run. This is the basis for the idea of probability. 


DEFINITION Probability 
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The probability of any outcome of a chance process is a number 
between 0 and 1 that describes the proportion of times the outcome 
would occur in a very long series of repetitions. 


A repetition of a chance process is sometimes called a trial. 


Outcomes that never occur have probability 0. An outcome that 
happens on every repetition has probability 1. An outcome that happens 
half the time in a very long series of trials has probability 0.5. 


The fact that the proportion of heads in many tosses eventually closes 
in on 0.5 is guaranteed by the law of large numbers. You can see this in 
Figure 5.1(b). The horizontal line represents the probability, and the 
proportion of heads in the simulation approaches this value as the number 
of repetitions becomes large. 


DEFINITION Law of large numbers 


The law of large numbers says that if we observe more and more 
repetitions of any chance process, the proportion of times that a 
specific outcome occurs approaches its probability. 


Probability gives us a language to describe the long-term regularity of 
chance behavior that is guaranteed by the law of large numbers. The 
outcome of a coin toss and the sex of the next baby born in a local hospital 
are both random. So is the result of a random sample or a random 
assignment. Even life insurance is based on the fact that deaths occur at 
random among many individuals. Because men are more likely to die at a 
younger age than women, insurance companies sometimes charge a man 
up to 3 times more for a life insurance policy than a woman of the same 
age. 


Recall from Chapter 4 that random doesn’t mean “haphazard.” In statistics, random 


means “by chance.” 
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EXAMPLE | Who drinks coffee? (») 
Interpreting probability 


A. and |. Kruk/Shutterstock.com 


PROBLEM: According to The Book of Odds, the probability that a 
randomly selected U.S. adult drinks coffee on a given day is 0.56. 


a. Interpret this probability as a long-run relative frequency. 


b. If aresearcher randomly selects 100 U.S. adults, will exactly 56 of 
them drink coffee that day? Explain your answer. 


SOLUTION: 


a. If you take a very large random sample of U.S. adults, about 56% of 
them will drink coffee that day. 


The chance process consists of randomly selecting a U.S. adult and 
recording whether or not the person drinks coffee that day. 


b. Probably not. With only 100 randomly selected adults, the number 
who drink coffee that day may not be very close to 56. 


Probability describes what happens in many, many repetitions (way 


more than 100) of a chance process. 


FOR PRACTICE, TRY EXERCISE 1 
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Life insurance companies, casinos, and others who make important 
decisions based on probability rely on the long-run predictability of 
random behavior. 


UNDERSTANDING RANDOMNESS The idea of probability seems 
straightforward. It answers the question “What would happen if we did this 
many times?” Understanding chance behavior is important for making 
decisions, especially when our data collection process includes random 
sampling or random assignment. But understanding randomness isn’t 
always easy, as the following activity illustrates. 


ACTIVITY Investigating randomness 


In this activity, you and your classmates will test your ability to imitate 
chance behavior. 


1. Pretend that you are flipping a fair coin. Without actually flipping a 
coin, imagine the first toss. Write down the result you see in your 
mind, heads (H) or tails (T). 


2. Imagine a second coin flip. Write down the result. 


3. Keep doing this until you have recorded the results of 50 imaginary 
flips. Write your results in groups of 5 to make them easier to read, 
like this: HTHTH TTHHT, and so on. 


4. Arun is a repetition of the same result. In the example in Step 3, 
there is a run of two tails followed by a run of two heads in the first 
10 coin flips. Read through your 50 imagined coin flips and find the 
longest run. 


5. Your teacher will draw and label a number line for a class dotplot. 
Plot on the graph the length of the longest run you got in Step 4. 


6. Use an actual coin, Table D, or technology to generate a similar list of 
50 coin flips. Find the longest run that you have. 


7. Your teacher will draw and label a number line with the same scale 
immediately above or below the one in Step 5. Plot on the new 
dotplot the length of the longest run you got in Step 6. 
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8. Compare the distributions of longest run from imagined tosses and 
random tosses. What do you notice? 


Eyebyte/Alamy 


The idea of probability is that randomness is predictable in the long 
run. Unfortunately, our intuition about chance behavior tries to tell us that 
randomness should also be predictable in the short run. When it isn’t, we 
look for some explanation other than chance variation. 


Suppose you toss a coin 6 times and get TTTTTT. Some people think 
that the next toss must be more likely to give a head. It’s true that in the 
long run, heads will appear half the time. What is a myth is that future 
outcomes must make up for an imbalance like six straight tails. 


Coins and dice have no memories. A coin doesn’t know that the first 6 
outcomes were tails, and it can’t try to get a head on the next toss to even 
things out. Of course, things do even out in the long run. That’s the law of 
large numbers in action. After 10,000 tosses, the results of the first six 
tosses don’t matter. They are overwhelmed by the results of the next 9994 
tosses. 


Some people use the phrase law of averages to refer to the misguided belief that the 


results of a chance process have to even out in the short run. 


When asked to predict the sex—boy (B) or girl (G)—of the next seven 
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babies born in a local hospital, most people will guess something like B-G- 
B-G-B-G-G. Few people would say G-G-G-B-B-B-G because this 
sequence of outcomes doesn’t “look random.” In fact, these two sequences 
of births are equally likely. “Runs” consisting of several of the same 
outcomes in a row are surprisingly common in chance behavior. Many 
students are not aware of this fact when they imagine a sequence of 50 
coin tosses in the “Investigating randomness” activity! 


Is there such a thing as a “hot hand” in basketball? Belief that runs 
must result from something other than “just chance” influences behavior. 
If a basketball player makes several consecutive shots, both the fans and 
her teammates believe that she has a “hot hand” and is more likely to make 
the next shot. Several early studies of the hot hand theory showed that runs 
of baskets made or missed are no more frequent in basketball than would 
be expected if the result of each shot is unrelated to the outcomes of the 
player’s previous shots. If a player makes half her shots in the long run, 
her made shots and misses behave just like tosses of a coin—which means 
that runs of makes and misses are more common than our intuition 


expects.4 


Two more recent studies provide some evidence that there is a small 
hot hand effect for basketball players. These studies also suggest that “hot” 
shooters take riskier shots, which then masks the hot hand effect.2 


CHECK YOUR UNDERSTANDING 


R 
J 


1. Pedro drives the same route to work on Monday through Friday. His 
route includes one traffic light. According to the local traffic 
department, there is a 55% probability that the light will be red when 
Pedro reaches the light. Interpret the probability. 


2. Probability is a measure of how likely an outcome is to occur. Match 
one of the probabilities that follow with each statement. Be prepared 
to defend your answer. 
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00.0010.30.60.9919 9-001 0.3 0.6 0.99 1 


a. This outcome is impossible. It can never occur. 
b. This outcome is certain. It will occur on every trial. 


c. This outcome is very unlikely, but it will occur once in a while in a 
long sequence of trials. 


d. This outcome will occur more often than not. 


3. A husband and wife decide to have children until they have at least 
one child of each sex. The couple has had seven girls in a row. Their 
doctor assures them that they are much more likely to have a boy 
next. Explain why the doctor is wrong. 


Simulation 


We can model chance behavior and estimate probabilities with a 
simulation. 


DEFINITION Simulation 


Simulation is the imitation of chance behavior, based on a model that 
accurately reflects the situation. 


You already have some experience with simulations. In the “Hiring 
discrimination—it just won’t fly!” activity in Chapter 1 (page 6), you drew 
beads or slips of paper to imitate a random lottery to choose which pilots 
would become captains. The “Analyzing the caffeine experiment” activity 
in Chapter 4 (page 273) asked you to shuffle and deal piles of cards to 
mimic the random assignment of subjects to treatments. The “1 in 6 wins” 
game that opened this chapter had you roll a die several times to simulate 
buying 20-ounce sodas and looking under the cap. 

The goal in each of these cases was to use simulation to answer a 
question of interest about some chance process. Different chance “devices” 
were used to perform the simulations—beads, slips of paper, cards, or 
dice. But the same basic strategy was followed each time. 
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THE SIMULATION PROCESS 


e Describe how to use a chance device to imitate one trial (repetition) 


of the simulation. Tell what you will record at the end of each trial. 


e Perform many trials of the simulation. 


e Use the results of your simulation to answer the question of interest. 


For the 1-in-6 wins game, we wanted to estimate the probability of 
getting two or fewer prize winners in a Class of 30 students if the 
company’s 1-in-6 wins claim is true. 


e We rolled a six-sided die 30 times to determine the outcome for each 
person’s bottle of soda: 6 = wins a prize, 1 to 5 = no prize, and recorded 
the number of winners. The dotplot shows the number of winners in 40 
trials of this simulation. 


e In 4 of the 40 trials, two or fewer of the students won a prize. So our 
estimate of the probability is 4/40 = 0.10 = 10%. According to these 
results, getting 2 winners isn’t very likely, but it isn’t unusual enough to 
conclude that the company is lying. 


0 2 4 6 8 10 12 


Simulated number of winners in a class of 30 students 


EXAMPLE | NASCAR cards and cereal boxes » 


Performing simulations 
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Daniel Shirey/Getty Images 


PROBLEM: In an attempt to increase sales, a breakfast cereal company 
decides to offer a NASCAR promotion. Each box of cereal will contain 
a collectible card featuring one of the following NASCAR drivers: Joey 
Lagano, Kevin Harvick, Chase Elliott, Danica Patrick, or Jimmie 
Johnson. The company claims that each of the 5 cards is equally likely 
to appear in any box of cereal. A NASCAR fan decides to keep buying 
boxes of the cereal until she has all 5 drivers’ cards. She is surprised 
when it takes her 23 boxes to get the full set of cards. Does this outcome 
provide convincing evidence that the 5 cards are not equally likely? To 
help answer this question, we want to perform a simulation to estimate 
the probability that it will take 23 or more boxes to get a full set of 5 
NASCAR collectible cards. 


a. Describe how to use a random number generator to perform one trial 
of the simulation. 

The dotplot shows the number of cereal boxes it took to get all 5 

drivers’ cards in 50 trials. 

b. Explain what the dot at 20 represents. 


c. Use the results of the simulation to estimate the probability that it will 
take 23 or more boxes to get a full set of cards. Does this outcome 
provide convincing evidence that the 5 cards are not equally likely? 


Simulated number of boxes required 
SOLUTION: 
a. Let 1 = Joey Lagano, 2 = Kevin Harvick, 3 = Chase Elliott, 4 = 
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Danica Patrick, and 5 = Jimmie Johnson. Generate a random integer 
from 1 to 5 to simulate buying one box of cereal and looking at 
which card is inside. Keep generating random integers until all five 
labels from 1 to 5 appear. Record the number of boxes it takes to get 
all 5 cards. 


You can use the command RandInt(1,5) on the TI-83/84 to generate a 


random integer from 1 to 5. 


. One trial where it took 20 boxes to get all 5 drivers’ cards. 


Be sure to tell what you will record at the end of a trial. 


; a 
Probability ~ = OF so there’s about a 0% chance 


. Probability*050=0 
it would take 23 or more boxes to get a full set. Because it is so 

unlikely that it would take 23 or more boxes to get a full set, this 
result provides convincing evidence that the 5 NASCAR drivers’ 


cards are not equally likely to appear in each box of cereal. 


FOR PRACTICE, TRY EXERCISE 9 


It took our NASCAR fan 23 boxes to complete the set of 5 cards. Does 


that mean the company didn’t tell the truth about how the cards were 
distributed? Not necessarily. Our simulation says that it’s very unlikely for 
someone to have to buy 23 boxes to get a full set if each card is equally 
likely to appear in a box of cereal. The evidence suggests that the 
company’s statement is incorrect. It’s still possible, however, that the 
NASCAR fan was just very unlucky. 


Here’s one more example that shows the simulation process in action. 


EXAMPLE | Golden ticket parking lottery > 


Performing simulations 
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Ann Heath 


PROBLEM: Ata local high school, 95 students have permission to 
park on campus. Each month, the student council holds a “golden ticket 
parking lottery” at a school assembly. The two lucky winners are given 
reserved parking spots next to the school’s main entrance. Last month, 
the winning tickets were drawn by a student council member from the 
AP® Statistics class. When both golden tickets went to members of that 
same class, some people thought the lottery had been rigged. There are 
28 students in the AP® Statistics class, all of whom are eligible to park 
on campus. We want to perform a simulation to estimate the probability 
that a fair lottery would result in two winners from the AP® Statistics 
class. 


a. Describe how you would use a table of random digits to carry out this 
simulation. 


b. Perform 3 trials of the simulation using the random digits given. 

Make your procedure clear so that someone can follow what you did. 
70708 41098 55181 94904 43563 56934 48394 51719 
70708 41098 55181 94904 43563 56934 48394 51719 

c. In 9 of the 100 trials of the simulation, both golden tickets were won 
by members of the AP® Statistics class. Do these results give 
convincing evidence that the lottery was not carried out fairly? 
Explain your reasoning. 
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SOLUTION: 


a. Label the students in the AP® Statistics class from 01 to 28, and 
label the remaining students from 29 to 95. Numbers from 96 to 00 
will be skipped. Moving left to right across a row, look at pairs of 
digits until we come across two different labels from 01 to 95. The 
two students with these labels win the prime parking spaces. Record 
whether or not both winners come from the AP® Statistics class. 
Perform many simulated lotteries. See what percent of the time both 
winners come from this statistics class. 


Describe how to use a chance device to imitate one trial of the 


simulation. Tell what you will record at the end of each trial. 


; 904 43563 56934 48394 51719 
Xx Rot /T00X\vY v 
NO big | YES 


Perform many trials. 


There was one trial out of the first 3 in which both golden parking 
tickets went to members of the AP® Statistics class. 


Use the results of your simulation to answer the question of interest. 


c. No; there’s about a 9% chance of getting both winners from the AP® 
Statistics class in a fair lottery. Because this probability isn’t very 
small, we don’t have convincing evidence that the lottery was unfair. 
Outcomes like this could occur by chance alone in a fair lottery. 


| Does that mean the lottery was conducted fairly? Not necessarily. | 
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FOR PRACTICE, TRY EXERCISE 11 


In the golden ticket lottery example, we ignored repeated numbers 
from 01 to 95 within a given repetition. That’s because the chance process 
involved sampling students without replacement. In the NASCAR 
example, we allowed repeated numbers from 1 to 5 in a given repetition. 
That’s because we were selecting a small number of cards from a very 
large population of cards in thousands of cereal boxes. So the probability 
of getting, say, a Danica Patrick card in the next box of cereal was still 
very close to 1/5 even if we had already selected a Danica Patrick card. 


AP® EXAM TIP 


On the AP® Statistics exam, you may be asked to describe how to 
perform a simulation using rows of random digits. If so, provide a clear 
enough description of your process for the reader to get the same results 
from only your written explanation. Remember that every label needs to 
be the same length. In the golden ticket lottery example, the labels 
should be 01 to 95 (all two digits), not 1 to 95. When sampling without 
replacement, be sure to mention that repeated numbers should be 
ignored. 


CHECK YOUR UNDERSTANDING 


(\ 


\ 


y A basketball announcer suggests that a certain player is a streaky 
shooter. That is, the announcer believes that if the player makes a shot, 
the player is more likely to make the next shot. As evidence, the 
announcer points to a recent game where the player took 30 shots and 
had a streak of 10 made shots in a row. Is this convincing evidence of 
streaky shooting by the player? Assume that this player makes 50% of 
the shots and that the results of a shot don’t depend on previous shots. 
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1. Describe how you would carry out a simulation to estimate the 
probability that a 50% shooter who takes 30 shots in a game would 
have a streak of 10 or more made shots. 

The dotplot displays the results of 50 simulated games in which this 
player took 30 shots. 
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Longest streak in simulated game 


2. Explain what the two dots above 9 indicate. 


3. What conclusion would you draw about whether this player was 
streaky? Explain your answer. 


Section 5.1 Summary 


e Chance behavior is unpredictable in the short run but has a regular and 
predictable pattern in the long run. 


e The long-run relative frequency of an outcome after many repetitions of 
a chance process is its probability. A probability is a number between 0 
(never occurs) and 1 (always occurs). 


e The law of large numbers says that in many repetitions of the same 
chance process, the proportion of times that a particular outcome occurs 
will approach its probability. 


e Simulation can be used to imitate chance behavior and to estimate 
probabilities. To perform a simulation: 


» Describe how to use a chance device to imitate one trial of the 
simulation. Tell what you will record at the end of each trial. 


« Perform many trials of the simulation. 


: Use the results of your simulation to answer the question of interest. 
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Section 5.1 Exercises 


1, 


pg 302 |») Another commercial If Aaron tunes into his favorite 


radio station at a randomly selected time, there is a 0.20 probability 
that a commercial will be playing. 


a. Interpret this probability as a long-run relative frequency. 


b. If Aaron tunes into this station at 5 randomly selected times, will 
there be exactly one time when a commercial is playing? Explain 
your answer. 


Genetics There are many married couples in which the husband and 
wife both carry a gene for cystic fibrosis but don’t have the disease 
themselves. Suppose we select one of these couples at random. 
According to the laws of genetics, the probability that their first child 
will develop cystic fibrosis is 0.25. 


a. Interpret this probability as a long-run relative frequency. 


b. If researchers randomly select 4 such couples, is one of these 
couples guaranteed to have a first child who develops cystic 
fibrosis? Explain your answer. 


Mammograms Many women choose to have annual mammograms 
to screen for breast cancer after age 40. A mammogram isn’t 
foolproof. Sometimes the test suggests that a woman has breast 
cancer when she really doesn’t (a “false positive”). Other times the 
test says that a woman doesn’t have breast cancer when she actually 
does (a “false negative”). Suppose the false negative rate for a 
mammogram is 0.10. 


a. Explain what this probability means. 


b. Which is a more serious error in this case: a false positive or a false 
negative? Justify your answer. 


Liar, liar! Sometimes police use a lie detector test to help determine 
whether a suspect is telling the truth. A lie detector test isn’t foolproof 
—sometimes it suggests that a person is lying when he or she is 
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actually telling the truth (a “false positive”). Other times, the test says 
that the suspect is being truthful when he or she is actually lying (a 
“false negative”). For one brand of lie detector, the probability of a 
false positive is 0.08. 


a. Explain what this probability means. 


b. Which is a more serious error in this case: a false positive or a false 
negative? Justify your answer. 


5. Three pointers The figure shows the results of a basketball player 
attempting many 3-point shots. Explain what this graph tells you 
about chance behavior in the short run and long run. 


100 - 


Percent made 
wn 
i) 


Number of shots 
6. Keep on tossing The figure shows the results of two different sets of 


5000 coin tosses. Explain what this graph tells you about chance 
behavior in the short run and the long run. 


Proportion of heads 


1 5000 
Number of tosses 


7. Anunenlightened gambler 


674 


a. A gambler knows that red and black are equally likely to occur on 
each spin of a roulette wheel. He observes that 5 consecutive reds 
have occurred and bets heavily on black at the next spin. Asked 
why, he explains that “black is due.” Explain to the gambler what 
is wrong with this reasoning. 


b. After hearing you explain why red and black are still equally likely 
after 5 reds on the roulette wheel, the gambler moves to a card 
game. He is dealt 5 straight red cards from a standard deck with 26 
red cards and 26 black cards. He remembers what you said and 
assumes that the next card dealt in the same hand is equally likely 
to be red or black. Explain to the gambler what is wrong with this 
reasoning. 


8. Due for a hit A very good professional baseball player gets a hit 
about 35% of the time over an entire season. After the player failed to 
hit safely in six straight at-bats, a TV commentator said, “He is due 
for a hit.” Explain why the commentator is wrong. 


9. pg 305 \») Will Luke pass the quiz? Luke’s teacher has assigned 


each student in his class an online quiz, which is made up of 10 
multiple-choice questions with 4 options each. Luke hasn’t been 
paying attention in class and has to guess on each question. However, 
his teacher allows each student to take the quiz three times and will 
record the highest of the three scores. A passing score is 6 or more 
correct out of 10. We want to perform a simulation to estimate the 
score that Luke will earn on the quiz if he guesses at random on all 
the questions. 


a. Describe how to use a random number generator to perform one 
trial of the simulation. 
The dotplot shows Luke’s simulated quiz score in 50 trials of the 
simulation. 
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b. Explain what the dot at 1 represents. 


c. Use the results of the simulation to estimate the probability that 
Luke passes the quiz. 


d. Doug is in the same class and claims to understand some of the 
material. If he scored 8 points on the quiz, is there convincing 
evidence that he understands some of the material? Explain your 
answer. 


10. Double fault! A professional tennis player claims to get 90% of her 
second serves in. In a recent match, the player missed 5 of her first 20 
second serves. Is this a surprising result if the player’s claim is true? 
Assume that the player has a 0.10 probability of missing each second 
serve. We want to carry out a simulation to estimate the probability 
that she would miss 5 or more of her first 20 second serves. 


a. Describe how to use a random number generator to perform one 
trial of the simulation. 
The dotplot displays the number of second serves missed by the 
player out of the first 20 second serves in 100 simulated matches. 
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b. Explain what the dot at 6 represents. 


c. Use the results of the simulation to estimate the probability that the 
player would miss 5 or more of her first 20 second serves in a 
match. 


d. Is there convincing evidence that the player misses more than 10% 
of her second serves? Explain your answer. 
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11. 


12, 


pg 306 |») Airport security The Transportation Security 


Administration (TSA) is responsible for airport safety. On some 
flights, TSA officers randomly select passengers for an extra security 
check prior to boarding. One such flight had 76 passengers—12 in 
first class and 64 in coach class. Some passengers were surprised 
when none of the 10 passengers chosen for screening were seated in 
first class. We want to perform a simulation to estimate the 
probability that no first-class passengers would be chosen in a truly 
random selection. 


a. Describe how you would use a table of random digits to carry out 
this simulation. 


b. Perform one trial of the simulation using the random digits that 
follow. Copy the digits onto your paper and mark directly on or 
above them so that someone can follow what you did. 


71487 09984 29077 14863 61683 47052 62224 51025 


c. In 15 of the 100 trials of the simulation, none of the 10 passengers 
chosen was seated in first class. Does this result provide 
convincing evidence that the TSA officers did not carry out a truly 
random selection? Explain your answer. 


Scrabble In the game of Scrabble, each player begins by randomly 
selecting 7 tiles from a bag containing 100 tiles. There are 42 vowels, 
56 consonants, and 2 blank tiles in the bag. Cait chooses her 7 tiles 
and is surprised to discover that all of them are vowels. We want to 
perform a simulation to determine the probability that a player will 
randomly select 7 vowels. 


a. Describe how you would use a table of random digits to carry out 
this simulation. 


b. Perform one trial of the simulation using the random digits given. 
Copy the digits onto your paper and mark directly on or above 
them so that someone can follow what you did. 


00694 05977 19664 65441 20903 62371 22725 53340 


677 


c. In 2 of the 1000 trials of the simulation, all 7 tiles were vowels. 
Does this result give convincing evidence that the bag of tiles was 
not well mixed? 


13. Bull’s-eye! In a certain archery competition, each player continues to 
shoot until he or she misses the center of the target twice. Quinn is 
one of the archers in this competition. Based on past experience, she 
has a 0.60 probability of hitting the center of the target on each shot. 
We want to design a simulation to estimate the probability that Quinn 
stays in the competition for at least 10 shots. Describe how you would 
use each of the following chance devices to perform one trial of the 
simulation. 


a. Slips of paper 
b. Random digits table 


c. Random number generator 


14, Free-throw practice At the end of basketball practice, each player on 
the team must shoot free throws until he makes 10 of them. Dwayne 
is a 70% free-throw shooter. That is, his probability of making any 
free throw is 0.70. We want to design a simulation to estimate the 
probability that Dwayne makes 10 free throws in at most 12 shots. 
Describe how you would use each of the following chance devices to 
perform one trial of the simulation. 


a. Slips of paper 
b. Random digits table 


c. Random number generator 


In Exercises 15—18, determine whether the simulation design is valid. 
Justify your answer. 


15. Smartphone addiction? A media report claims that 50% of U.S. 
teens with smartphones feel addicted to their devices.2 A skeptical 
researcher believes that this figure is too high. She decides to test the 
claim by taking a random sample of 100 U.S. teens who have 
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16. 


WA 


smartphones. Only 40 of the teens in the sample feel addicted to their 
devices. Does this result give convincing evidence that the media 
report’s 50% claim is too high? To find out, we want to perform a 
simulation to estimate the probability of getting 40 or fewer teens 
who feel addicted to their devices in a random sample of size 100 
from a very large population of teens with smartphones in which 50% 
feel addicted to their devices. 


Let 1 = feels addicted and 2 = doesn’t feel addicted. Use a random 
number generator to produce 100 random integers from 1 to 2. 
Record the number of 1’s in the simulated random sample. Repeat 
this process many, many times. Find the percent of trials on which the 
number of 1’s was 40 or less. 


Lefties A website claims that 10% of U.S. adults are left-handed. A 
researcher believes that this figure is too low. She decides to test this 
claim by taking a random sample of 20 U.S. adults and recording how 
many are left-handed. Four of the adults in the sample are left- 
handed. Does this result give convincing evidence that the website’s 
10% claim is too low? To find out, we want to perform a simulation 
to estimate the probability of getting 4 or more left-handed people in 
a random sample of size 20 from a very large population in which 
10% of the people are left-handed. 


Let 00 to 09 indicate left-handed and 10 to 99 represent right- 
handed. Move left to right across a row in Table D. Each pair of digits 
represents one person. Keep going until you get 20 different pairs of 
digits. Record how many people in the simulated sample are left- 
handed. Repeat this process many, many times. Find the proportion of 
trials in which 4 or more people in the simulated sample were left- 
handed. 


Notebook check Every 9 weeks, Mr. Millar collects students’ 
notebooks and checks their homework. He randomly selects 4 
different assignments to inspect for all of the students. Marino is one 
of the students in Mr. Millar’s class. Marino completed 20 homework 
assignments and did not complete 10 assignments. He is surprised 
when Mr. Millar only selects 1 assignment that he completed. Should 
he be surprised? To find out, we want to design a simulation to 
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18. 


19. 


estimate the probability that Mr. Millar will randomly select 1 or 
fewer of the homework assignments that Marino completed. 


Get 30 identical slips of paper. Write “N” on 10 of the slips and 
“C” on the remaining 20 slips. Put the slips into a hat and mix well. 
Draw 1 slip without looking to represent the first randomly selected 
homework assignment, and record whether Marino completed it. Put 
the slip back into the hat, mix again, and draw another slip 
representing the second randomly selected assignment. Record 
whether Marino completed this assignment. Repeat this process two 
more times for the third and fourth randomly selected homework 
assignments. Record the number out of the 4 randomly selected 
homework assignments that Marino completed in this trial of the 
simulation. Perform many trials. Find the proportion of trials in which 
Mr. Millar randomly selects 1 or fewer of the homework assignments 
that Marino completed. 


Random assignment Researchers recruited 20 volunteers—8 men 
and 12 women—to take part in an experiment. They randomly 
assigned the subjects into two groups of 10 people each. To their 
surprise, 6 of the 8 men were randomly assigned to the same 
treatment. Should they be surprised? We want to design a simulation 
to estimate the probability that a proper random assignment would 
result in 6 or more of the 8 men ending up in the same group. 


Get 20 identical slips of paper. Write “M” on 8 of the slips and 
“W” on the remaining 12 slips. Put the slips into a hat and mix well. 
Draw 10 of the slips without looking and place into one pile 
representing Group 1. Place the other 10 slips in a pile representing 
Group 2. Record the largest number of men in either of the two 
groups from this simulated random assignment. Repeat this process 
many, many times. Find the percent of trials in which 6 or more men 
ended up in the same group. 


Color-blind men About 7% of men in the United States have some 
form of red—green color blindness. Suppose we randomly select one 
U.S. adult male at a time until we find one who is red—green color- 
blind. Should we be surprised if it takes us 20 or more men? Describe 
how you would carry out a simulation to estimate the probability that 
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20. 


21, 


22. 


we would have to randomly select 20 or more U.S. adult males to find 
one who is red—green color-blind. Do not perform the simulation. 


Taking the train According to New Jersey Transit, the 8:00 A.M. 
weekday train from Princeton to New York City has a 90% chance of 
arriving on time. To test this claim, an auditor chooses 6 weekdays at 
random during a month to ride this train. The train arrives late on 2 of 
those days. Does the auditor have convincing evidence that the 
company’s claim is false? Describe how you would carry out a 
simulation to estimate the probability that a train with a 90% chance 
of arriving on time each day would be late on 2 or more of 6 days. Do 
not perform the simulation. 


Recycling Do most teens recycle? To find out, an AP® Statistics 

class asked an SRS of 100 students at their school whether they 
regularly recycle. In the sample, 55 students said that they recycle. Is 
this convincing evidence that more than half of the students at the 
school would say they regularly recycle? The dotplot shows the 
results of taking 200 SRSs of 100 students from a population in which 
the true proportion who recycle is 0.50. 


T T T T 
0.35 0.40 0.45 0.50 0.55 0.60 0.65 
Simulated proportion who say “Yes” 


a. Explain why the sample result (55 out of 100 said “Yes”) does not 
give convincing evidence that more than half of the school’s 
students recycle. 


b. Suppose instead that 63 students in the class’s sample had said 


“Yes.” Explain why this result would give convincing evidence 
that a majority of the school’s students recycle. 


Brushing teeth, wasting water? A recent study reported that fewer 
than half of young adults turn off the water while brushing their teeth. 
Is the same true for teenagers? To find out, a group of statistics 
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students asked an SRS of 60 students at their school if they usually 
brush with the water off. In the sample, 27 students said “Yes.” The 
dotplot shows the results of taking 200 SRSs of 60 students from a 
population in which the true proportion who brush with the water off 
is 0.50. 


a. 
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cual T is T T T T T 
0.30 0.35 0.40 0.45 0.50 0.55 0.60 0.65 0.70 


Simulated proportion who say “Yes” 


Explain why the sample result (27 of the 60 students said “Yes”) 
does not give convincing evidence that fewer than half of the 
school’s students brush their teeth with the water off. 


. Suppose instead that 18 of the 60 students in the class’s sample had 


said “Yes.” Explain why this result would give convincing 
evidence that fewer than 50% of the school’s students brush their 
teeth with the water off. 


Multiple Choice Select the best answer for Exercises 23-28. 


23. You read in a book about bridge that the probability that each of the 
four players is dealt exactly one ace is approximately 0.11. This 
means that 


a. 


D: 


in every 100 bridge deals, each player has 1 ace exactly 11 times. 


in 1 million bridge deals, the number of deals on which each player 
has 1 ace will be exactly 110,000. 


in a very large number of bridge deals, the percent of deals on 
which each player has 1 ace will be very close to 11%. 


. ina very large number of bridge deals, the average number of aces 


in a hand will be very close to 0.11. 
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24. 


e. If each player gets an ace in only 2 of the first 50 deals, then each 
player should get an ace in more than 11% of the next 50 deals. 


If I toss a fair coin five times and the outcomes are TTTTT, then the 
probability that tails appears on the next toss is 


a. 0.5. 

b. less than 0.5. 

c. greater than 0.5. 
d. 0. 


e. 1. 


Exercises 25 to 27 refer to the following setting. A basketball player 
claims to make 47% of her shots from the field. We want to simulate the 
player taking sets of 10 shots, assuming that her claim is true. 


25. 


26. 


To simulate the number of makes in 10 shot attempts, you would 
perform the simulation as follows: 


a. Use 10 random one-digit numbers, where 0-4 are a make and 5—9 
are a miss. 


b. Use 10 random two-digit numbers, where 00—46 are a make and 
47—99 are a miss. 


c. Use 10 random two-digit numbers, where 00-47 are a make and 
48—99 are a miss. 


d. Use 47 random one-digit numbers, where 0 is a make and 1-9 area 
miss. 


e. Use 47 random two-digit numbers, where 00O—-46 are a make and 
47—99 are a miss. 


A total of 25 repetitions of the simulation were performed. The 
number of makes in each set of 10 simulated shots was recorded on 
the dotplot. What is the approximate probability that a 47% shooter 
makes 5 or more shots in 10 attempts? 
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e. 


T i; 
6 7 


mn Peers 


wWw—e8ee 


N 
os 


Simulated number of made shots 


5/10 
3/10 
12/25 
3/25 
47/100 


Suppose this player attempts 10 shots in a game and makes only 3 of 
them. Does this provide convincing evidence that she is less than a 
47% shooter? 


a. 


b. 


Yes, because 3/10 (30%) is less than 47%. 
Yes, because she never made 47% of her shots in the simulation. 


No, because it is plausible (believable) that she would make 3 or 
fewer shots by chance alone. 


. No, because the simulation was only repeated 25 times. 


No, because more than half of the simulated results were less than 
A7%, 


Ten percent of U.S. households contain 5 or more people. You want 
to simulate choosing a household at random and recording “Yes” if it 
contains 5 or more people. Which of these is a correct assignment of 
digits for this simulation? 


a. 
b. 
C. 


d. 


Odd = Yes; Even = No 
0 = Yes; 1-9 = No 
0-5 = Yes; 6-9 = No 
0-4 = Yes; 5-9 = No 
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e. None of these 


Recycle and Review 


29. AARP and Medicare (4.1) To find out what proportion of 


30. 


Americans support proposed Medicare legislation to help pay medical 
costs, the AARP conducted a survey of their members (people over 
age 50 who pay membership dues). One of the questions was: “Even 
if this plan won’t affect you personally either way, do you think it 
should be passed so that people with low incomes or people with high 
drug costs can be helped?” Of the respondents, 75% answered 
“Yes.74 


a. Describe how undercoverage might lead to bias in this study. 
Explain the likely direction of the bias. 


b. Describe how the wording of the question might lead to bias in this 


study. Explain the likely direction of the bias. 


Waiting to park (1.3, 4.2) Do drivers take longer to leave their 
parking spaces when someone is waiting? Researchers hung out in a 
parking lot and collected some data. The graphs and numerical 
summaries display information about how long it took drivers to exit 
their spaces. 


Someone waiting? 


0 10 20 30 40 50 60 70 80 9% 
Time (sec) 


Descriptive Statistics: Time 
Waiting n Mean StDev’ Min Qi Median Q3 Max 


No 20 4442 1410 33.76 35.61 39.56 48.48 84.92 
Yes 20 54.11 1439 41.61 43.41 47.14 66.44 85.97 


685 


a. Write a few sentences comparing these distributions. 


b. Can we conclude that having someone waiting causes drivers to 
leave their spaces more slowly? Why or why not? 
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SECTION 5.2 Probability Rules 


LEARNING TARGETS By the end of the section, you should be able to: 


e Give a probability model for a chance process with equally likely 
outcomes and use it to find the probability of an event. 


Use basic probability rules, including the complement rule and the 


addition rule for mutually exclusive events. 


Use a two-way table or Venn diagram to model a chance process 
and calculate probabilities involving two events. 


Apply the general addition rule to calculate probabilities. 


The idea of probability rests on the fact that chance behavior is 
predictable in the long run. In Section 5.1, we used simulation to imitate 
chance behavior. Do we always need to repeat a chance process—tossing 
coins, rolling dice, drawing slips from a hat—many times to determine the 
probability of a particular outcome? Fortunately, the answer is no. 


Probability Models 


In Chapter 2, we saw that a Normal density curve could be used to model 
some distributions of data. In Chapter 3, we modeled linear relationships 
between two quantitative variables with a least-squares regression line. 
Now we’re ready to develop a model for chance behavior. 


Steve Gorton/Getty Images 


Many board games involve rolling dice. Imagine rolling two fair, six- 


687 


sided dice—one that’s red and one that’s blue. How do we develop a 

robability model for this chance process? Figure 5.2 displays the sample 
space of 36 possible outcomes. Because the dice are fair, each of these 
outcomes will be equally likely and have probability 1/36. 
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FIGURE 5.2 The 36 possible outcomes from rolling two six-sided dice, one red 
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and one blue. Each of these equally likely outcomes has probability 1/36. 


DEFINITION Probability model, Sample space 


A probability model is a description of some chance process that 
consists of two parts: a list of all possible outcomes and the 
probability for each outcome. 


The list of all possible outcomes is called the sample space. 


A sample space can be very simple or very complex. If we toss a coin 
once, there are only two possible outcomes in the sample space, heads and 
tails. When Gallup takes a random sample of 1523 U.S. adults and asks a 
survey question, the sample space consists of all possible sets of responses 
from 1523 of the over 240 million adults in the country. 

A probability model does more than just assign a probability to each 
outcome. It allows us to find the probability of an event. 


DEFINITION Event 


An event is any collection of outcomes from some chance process. 
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Events are usually designated by capital letters, like A, B, C, and so on. 
For rolling two six-sided dice, we can define event A as getting a sum of 5. 
We write the probability of event A as P(A) or P(sum is 5). 

It is fairly easy to find the probability of an event in the case of equally 
likely outcomes. There are 4 outcomes in event A: 


if f 
| | 


The probability that event A occurs is therefore 


P(A)=number of outcomes with sum of Stotal number of outcomes when ro 


P(A) = number of outcomes with ww of 5 | -.m 0.111 
total number of outcomes when rolling two dice 36 


FINDING PROBABILITIES: EQUALLY LIKELY 
OUTCOMES 


If all outcomes in the sample space are equally likely, the probability 


that event A occurs can be found using the formula 


P(A)=number of outcomes in event Atotal number of outcomes in sample 
number of outcomes in event A 
P(A) = 


total number of outcomes in sample space 


EXAMPLE | Spin the spinner » 
Probability models: Equally likely outcomes 


PROBLEM: A spinner has three equal sections: red, blue, and yellow. 
Suppose you spin the spinner two times. 


689 


a. Give a probability model for this chance process. 


b. Define event A as spinning blue at least once. Find P(A). 
SOLUTION: 


a. Sample space: RR RB RY BR BB BY YR YB YY. Because the 
spinner has equal sections, each of these outcomes will be equally 
likely and have probability 1/9. 


Remember: A probability model consists of a list of all possible 


outcomes and the probability of each outcome. 


b. There are 5 outcomes with at least one blue: RB BR BB BY YB. So 
P(A) = 2 = 0.556 


P(A)=59=0.556 


If all outcomes in the sample space are equally likely, 


P(A)=number of outcomes in event Atotal number of outcomes in sar 
number of outcomes in event A 
P(A) 


total number of outcomes in sample space 


FOR PRACTICE, TRY EXERCISE 31 


Basic Probability Rules 


Our work so far suggests three commonsense rules that a valid probability 
model must obey: 


_. If all outcomes in the sample space are equally likely, the 
probability that event A occurs is 


P(A)=number of outcomes in event Atotal number of outcomes in sampl 
number of outcomes in event A 
P(A) 


total number of outcomes in sample space 


’. The probability of any event is a number between 0 and 1. This rule 
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follows from the definition of probability in Section 5.1: the proportion 
of times the event would occur in many repetitions of the chance 
process. A proportion is a number between 0 and 1, so any probability is 
also a number between 0 and 1. 


3. All possible outcomes together must have probabilities that add up 
to 1. Because some outcome must occur on every trial of a chance 
process, the sum of the probabilities for all possible outcomes must be 
exactly 1. 

Here’s another rule that follows from the previous two: 

|. The probability that an event does not occur is 1 minus the 
probability that the event does occur. If an event occurs in (say) 70% 
of all trials, it fails to occur in the other 30%. The probability that an 
event occurs and the probability that it does not occur always add to 
100%, or 1. Earlier, we found that the probability of getting a sum of 5 
when rolling two fair, six-sided dice is 4/36. What’s the probability that 
the sum is not 5? 


P(sum is not 5)=1-P(sum is 5)=1—436=3236=0.889 
P(sum is not 5) = 1 — P(sum is 5) = 1 —=—— = 0,889 
We refer to the event “not A” as the complement of A and denote it by 
AC. For that reason, this handy result is known as the complement rule. 


Using the complement rule in this setting is much easier than counting all 
32 possible ways to get a sum that isn’t 5. 


DEFINITION Complement rule, Complement 


The complement rule says that P(A©) = 1—P(A), where A° is the 
complement of event A; that is, the event that A does not occur. 


Let’s consider one more event involving the chance process of rolling 
two fair, six-sided dice: getting a sum of 6. The outcomes in this event are 


(sum is 6) = = 


So P(sum is 6)=536" 36. What’s the probability that we 
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get a sum of 5 or a sum of 6? 


P(sum is 5 or sum is 6) = P(sum is 5) + P(sum is 6) = 436+536=936=0.25 
Oey 
36 36) «36 

Why does this formula work? Because the events “getting a sum of 5” 
and “getting a sum of 6” have no outcomes in common—that is, they can’t 
both happen at the same time. We say that these two events are mutually 
exclusive. As a result, this intuitive formula is known as the addition rule 


for mutually exclusive events. 


P(sum is 5 or sum is 6) = P(sum is 5) + P(sum is 6) = 


DEFINITION Mutually exclusive, Addition rule for 
mutually exclusive events 


Two events A and B are mutually exclusive if they have no 
outcomes in common and so can never occur together—that is, if P(A 
and B) = 0. 


The addition rule for mutually exclusive events A and B says that 


P(A or B) = P(A) + P(B)P(A oF B) = P(A) + P(B) 


Mutually exclusive events are sometimes called disjoint events. 


oo Note that this rule works only for mutually exclusive events. We 


will soon develop a more general rule for finding P(A or B) that works for 
any two events. 


We can summarize the basic probability rules more concisely in 
symbolic form. 


BASIC PROBABILITY RULES 


e For any event A, 0 < P(A) <1. 
e If S is the sample space in a probability model, P(S) = 1. 


e In the case of equally likely outcomes, 
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P(A)=number of outcomes in event Atotal number of outcomes in sam 
(A) number of outcomes in event A 


total number of outcomes in sample space 


¢ Complement rule: P(A°) = 1 — P(A). 


e Addition rule for mutually exclusive events: If A and B are 
mutually exclusive, P(A or B) = P(A) + P(B). 


The earlier dice-rolling and spin the spinner settings involved equally 
likely outcomes. Here’s an example that illustrates use of the basic 


probability rules when the outcomes of a chance process are not equally 
likely. 


EXAMPLE | Avoiding blue M&M’S® (») 


Basic probability rules 


Niels Poulsen std/Alamy 


PROBLEM: Suppose you tear open the corner of a bag of M&M’S® 
Milk Chocolate Candies, pour one candy into your hand, and observe 
the color. According to Mars, Inc., the maker of M&M’S, the 
probability model for a bag from its Cleveland factory is: 


Color Blue Orange Green Yellow Red Brown 
Probability 0.207 0.205 0.198 0.135 0.131 0.124 


a. Explain why this is a valid probability model. 
b. Find the probability that you don’t get a blue M&M. 
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c. What’s the probability that you get an orange or a brown M&M? 
SOLUTION: 


a. The probability of each outcome is a number between 0 and 1. The 
sum of the probabilities is 0.207 + 0.205 + 0.198 + 0.135 + 0.131 + 
0.124 = 1. 


b. P (not blue) = 1 — P (blue) = 1 — 0.207 = 0.793 


Using the complement rule: P (AS )=1-—P (A). 


c. P (orange or brown) = P (orange) + P (brown) 
= 0.205 + 0.124 = 0.329 


Using the addition rule for mutually exclusive events because an 


M&M® can’t be both orange and brown. 


FOR PRACTICE, TRY EXERCISE 35 


For part (b) of the example, we could also use an expanded version of 
the addition rule for mutually exclusive events: 
P(not blue)=P(orange or green or yellow or red or brown)=P(orange)+P(gre 
P(not blue) = P (orange or green or yellow or red or brown) 
= P (orange) + P (green) + P (yellow) + P (red) + P (brown) 
= 0.205 + 0.198 + 0.135 + 0.131 + 0.124 
= 0.793 


Using the complement rule is much simpler than adding 5 probabilities 
together! 


CHECK YOUR UNDERSTANDING 
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y Suppose we choose an American adult at random. Define two 
events: 
A = the person has a cholesterol level of 240 milligrams per 
deciliter of blood (mg/dl) or above (high cholesterol) 
B = the person has a cholesterol level of 200 to <240 mg/dl 
(borderline high cholesterol) 
According to the American Heart Association, P(A) = 0.16 and P(B) = 
0.29. 


1. Explain why events A and B are mutually exclusive. 


2. Say in plain language what the event “A or B” is. Then find P(A or 
B). 

3. Let C be the event that the person chosen has a cholesterol level 
below 200 mg/dl (normal cholesterol). Find P(C). 


Two-Way Tables, Probability, and the General Addition 
Rule 


So far, you have learned how to model chance behavior and some basic 
rules for finding the probability of an event. What if you’re interested in 
finding probabilities involving two events that are not mutually exclusive? 
For instance, a survey of all residents in a large apartment complex reveals 
that 68% use Facebook, 28% use Instagram, and 25% do both.2 Suppose 
we select a resident at random. What’s the probability that the person uses 
Facebook or uses Instagram? 

There are two different uses of the word or in everyday life. Ina 
restaurant, when you are asked if you want “soup or salad,” the waiter 
wants you to choose one or the other, but not both. However, when you 
order coffee and are asked if you want “cream or sugar,” it’s OK to ask for 
one or the other or both. 

Mutually exclusive events A and B cannot both happen at the same 
time. For such events, “A or B” means that only event A happens or only 
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event B happens. You can find P(A or B) with the addition rule for 
mutually exclusive events: 


P(A or B)=P(A)+P(B)? (A or B) = P(A) + P(B) 


How can we find P(A or B) when the two events are not mutually 
exclusive? Now we have to deal with the fact that “A or B” means one or 
the other or both. For instance, “uses Facebook or uses Instagram” in the 
scenario just described includes U.S. adults who do both. 

When you’re trying to find probabilities involving two events, like P(A 
or B), a two-way table can display the sample space in a way that makes 
probability calculations easier. 


EXAMPLE | Who has pierced ears? a) 
Two-way tables and probability 


Big Cheese Photo/Superstock 


PROBLEM: Students in a college statistics class wanted to find out 
how common it is for young adults to have their ears pierced. They 
recorded data on two variables—gender and whether or not the student 
had a pierced ear—for all 178 people in the class. The two-way table 
summarizes the data. 


Gender 
Male Female Total 
Yes 103 
Pierced ear 
No 75 
Total 178 
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Suppose we choose a student from the class at random. Define event A 
as getting a male student and event B as getting a student with a pierced 
ear. 

a. Find P(B). 

b. Find P(A and B). Interpret this value in context. 

c. Find P(A or B). 


SOLUTION: 


a. P(B) = P(pierced ear) = 103178 = 0.579 
P(B) = P(pierced ear) = a = 0.579 


b. P(A andB) = P(male and pierced ear) = 19178 = 0.107 


P(AandB) = P(male and pierced ear) = io = 0.107 Tiere 


about an 11% chance that a randomly selected student from this class 
is male and has a pierced ear. 
c. P(A or B)=P(male or pierced ear)=71+19+84178 = 174178 = 0.978 
P(AorB) = P(male or pierced ear) 
= 71+19+84 = as — 0.978 


178 178 


In statistics, or means “one or the other or both.” So “male or pierced 


ear” includes (i) male but no pierced ear; (ii) pierced ear but not male; 
and (iii) male and pierced ear. 


FOR PRACTICE, TRY EXERCISE 41 


When we found P(male and pierced ear) in part (b) of the example, we 
could have described this as either P(A and B) or P(B and A). Why? 
Because “male and pierced ear” describes the same event as “pierced ear 
and male.” Likewise, P(A or B) is the same as P(B or A). Don’t get so 
caught up in the notation that you lose sight of what’s really happening! 

Part (c) of the example reveals an important fact about finding the 
probability P(A or B): we can’t use the addition rule for mutually 
exclusive events unless events A and B have no outcomes in common. In 
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this case, there are 19 outcomes that are shared by events A and B—the 
students who are male and have a pierced ear. If we did add the 
probabilities of A and B, we’d get 90/178 + 103/178 = 193/178. This is 
clearly wrong because the probability is bigger than 1! As Figure 5.3 
illustrates, outcomes common to both events are counted twice when we 
add the probabilities of these two events. 


Outcomes here are double-counted 
P(A and B) = cS Mma 
Gender 


Pierced ear }———_________——__-__————_—_- i 


FIGURE 5.3 Two-way table showing events A and B from the pierced-ear 
example. These events are not mutually exclusive, so we can’t find P(A or B) by 
just adding the probabilities of the two events. 


We can fix the double-counting problem illustrated in the two-way 
table by subtracting the probability P(male and pierced ear) from the sum. 
That is, 


P(male or pierced ear)=P(male)+P(pierced ear) 
—P(male and pierced ear)=90/178+103/178-19/178=174/178 
P (male or pierced ear) = P (male) + P (pierced ear) — P (male and pierced ear) 
= 90/178 + 103/178 — 19/178 
174/178 


| 


This result is known as the general addition rule. 


DEFINITION General addition rule 


If A and B are any two events resulting from some chance process, 
the general addition rule says that 


P(A or B)=P(A)+P(B)-P(A and B) 
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P(A or B) = P(A) + P(B) — P(A and B) 


Sometimes it’s easier to label events with letters that relate to the 
context, as the following example shows. 


EXAMPLE | Facebook versus Instagram » 


General addition rule 


e000 Verizon > 4:23 PM 7 58% a> 


(9) {nstagrom V 


ey Reading Terminal Market 


THERE AINS NOTHU 


40 likes 
framedcooks SO. TRUE. 


2A0800 2 
Ann Heath 
PROBLEM: A survey of all residents in a large apartment complex 
reveals that 68% use Facebook, 28% use Instagram, and 25% do both. 
Suppose we select a resident at random. What’s the probability that the 
person uses Facebook or uses Instagram? 


SOLUTION: 


Let event F = uses Facebook and I = uses Instagram. 
P(F or I)=P(F)+P(1) — P(F and I)=0.68+0.28 — 0.25=0.71 
P(Forl) = P(F)+P(I) —P(FandI) 
0.68 + 0.28 — 0.25 
0.71 
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FOR PRACTICE, TRY EXERCISE 47 


AP® EXAM TIP 


Many probability problems involve simple computations that you can 
do on your calculator. It may be tempting to just write down your final 
answer without showing the supporting work. Don’t do it! A “naked 
answer,” even if it’s correct, will usually be penalized on a free- 
response question. 


What happens if we use the general addition rule for two mutually 
exclusive events A and B? In that case, P(A and B) = 0, and the formula 
reduces to 

P(A or B)=P(A)+P(B)-P(A and B)=P(A)+P(B)—0=P(A)+P(B) 
P(A or B) = P(A) + P(B) — P(A and B) = P(A) + P(B) —0= P(A) + P(B) 
In other words, the addition rule for mutually exclusive events is just a 
special case of the general addition rule. 

You might be wondering if there is also a rule for finding P(A and B). 
There is, but it’s not quite as intuitive. Stay tuned for that later. 


CHECK YOUR UNDERSTANDING 


(\ 


’ 


y Yellowstone National Park staff surveyed a random sample of 1526 
winter visitors to the park. They asked each person whether they 
belonged to an environmental club (like the Sierra Club). Respondents 
were also asked whether they owned, rented, or had never used a 
snowmobile. The two-way table summarizes the survey responses. 


Environmental club 


Yes 
Snowmobile Never used 212 
experience Renter 77 


Owner 279 16 


Suppose we choose one of the survey respondents at random. 


1. What’s the probability that the person is an environmental club 
member? 


2. Find P(not a snowmobile renter). 
3. What’s P(environmental club member and not a snowmobile renter)? 


4. Find the probability that the person is not an environmental club 
member or is a snowmobile renter. 


Venn Diagrams and Probability 


We have seen that two-way tables can be used to illustrate the sample 
space of a chance process involving two events. So can Venn diagrams, 
like the one shown in Figure 5.4. 


Sample space 


FIGURE 5.4 A typical Venn diagram that shows the sample space and the 
relationship between two events A and B. 


DEFINITION Venn diagram 


A Venn diagram consists of one or more circles surrounded by a 
rectangle. Each circle represents an event. The region inside the 
rectangle represents the sample space of the chance process. 


In an earlier example, we looked at data from a survey on gender and 
ear piercings for a large group of college students. The chance process was 
selecting a student in the class at random. Our events of interest were A: is 
male and B: has a pierced ear. Here is the two-way table that summarizes 
the data: 
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Gender 


Male Female Total 
Yes 103 
Pierced ear 
No 75 
Total 90 88 178 


The Venn diagram in Figure 5.5 displays the sample space in a slightly 
different way. There are four distinct regions in the Venn diagram. These 
regions correspond to the four (non-total) cells in the two-way table as 
follows. 


Male and 
pierced ear 


Female and 
no pierced ear 


FIGURE 5.5 The completed Venn diagram for the large group of college students. 
The circles represent the two events A = male and B = has a pierced ear. 


Region in Venn diagram In words Count 
In the intersection of two circles Male and pierced ear 19 
Inside circle A, outside circle B Male and no pierced ear 71 
Inside circle B, outside circle A Female and pierced ear 84 
Outside both circles Female and no pierced ear 4 


Because Venn diagrams have uses in other branches of mathematics, 
some standard vocabulary and notation have been developed that will 
make our work with Venn diagrams a bit easier. 


e We introduced the complement of an event earlier. In Figure 5.6(a), the 
complement A© contains the outcomes that are not in A. 


e Figure 5.6(b) shows the event “A and B.” You can see why this event is 
also called the intersection of A and B. The corresponding notation is A 
n B. 


e The event “A or B” is shown in Figure 5.6(c). This event is also known 
as the union of A and B. The corresponding notation is A U B. 


702 


ANB ANB 


(a) (b) L Ce a) 
FIGURE 5.6 The green shaded region in each Venn diagram shows: (a) the 
complement A* of event A, (b) the intersection of events A and B, and (c) the 
union of events A and B. 


DEFINITION Intersection, Union 


The event “A and B” is called the intersection of events A and B. It 
consists of all outcomes that are common to both events, and is 
denoted A n B. 


The event “A or B” is called the union of events A and B. It consists 
of all outcomes that are in event A or event B, or both, and is denoted 
A UB. 


Here’s a way to keep the symbols straight: U for union; o for intersection. 


With this new notation, we can rewrite the general addition rule in 
symbols as 


P(A U B)=P(A)+P(B)-P(AnB)? (AU B) = P(A) + P(B) — P(AN B) 


This Venn diagram shows why the formula works in the pierced-ear 
example. 


Outcomes here are 
double-counted by 
P(A) + P(B) 


P(B) = 103/178 


Event A 
male 
P(A) = 90/178 


Event A and B male 
and pierced ear 
P(A B)= 19/178 


For mutually exclusive events A and B, the two events have no 
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outcomes in common. So the corresponding Venn diagram consists of two 
non-overlapping circles. You can see from the figure at left why, in this 
special case, the general addition rule reduces to 


P(A UB)=P(A)+P(B)P(A U B) = P(A) + P(B) 


EXAMPLE | Facebook versus Instagram >» 


Venn diagrams and probability 


Ann Heath 


PROBLEM: A survey of all residents in a large apartment complex 
reveals that 68% use Facebook, 28% use Instagram, and 25% do both. 
Suppose we select a resident at random. 


a. Make a Venn diagram to display the sample space of this chance 
process using the events F: uses Facebook and I: uses Instagram. 


b. Find the probability that the person uses neither Facebook nor 
Instagram. 


SOLUTION: 
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Start with the intersection: P(F n I) = 0.25. 

We know that 68% of residents use Facebook. That figure includes the 
25% who also use Instagram. So 68% — 25% = 43% = 0.43 only use 
Facebook. 


We know that 28% of residents use Instagram. That figure includes 
the 25% who also use Facebook. So 28% — 25% = 3% = 0.03 only use 
Instagram. 

A total of 0.43 + 0.25 + 0.03 = 0.71 = 71% of residents use at least 
one of Facebook or Instagram. By the complement rule, 1 — 0.71 = 
0.29 = 29% use neither Facebook nor Instagram. 


FOR PRACTICE, TRY EXERCISE 51 


In the preceding example, the event “uses neither Facebook nor 
Instagram” is the complement of the event “uses at least one of Facebook 
or Instagram.” To solve part (b) of the problem, we could have used our 
answer from the example on page 321 and the complement rule: 

P(neither Facebook nor Instagram)=1-—P(at least one of Facebook or Instag! 
P(neither Facebook nor Instagram) 
= 1 — P(at least one of Facebook or Instagram) 
= 1] —0.71 = 0.29 
As you’ll see in Section 5.3, the fact that “none” is the opposite of “at least 
1” comes in handy for a variety of probability questions. 


An alternate solution to the example uses a two-way table. Here is a 
partially completed table with the information given in the problem 
statement. Do you see how we can fill in the missing entries? 
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Facebook use 


Yes No Total 
Yes 28% 
Instagram use 
No 
Total 68% 100% 


e 100% - 68% = 32% of residents do not use Facebook. 

e 100% — 28% = 72% of residents do not use Instagram. 

© 68% — 25% = 43% of residents use Facebook but do not use Instagram. 
e 28% — 25% = 3% of residents use Instagram but do not use Facebook. 


e 32% — 3% = 72% — 43% = 29% of residents do not use Facebook and 
do not use Instagram. 


The completed table is shown here. We can see the desired probability 
marked in bold in the table: P(neither Facebook nor Instagram) = 0.29. 


Facebook use 


Yes No Total 

Instagram use 
No 72% 
Total 68% 32% 100% 


Section 5.2 Summary 


e A probability model describes a chance process by listing all possible 
outcomes in the sample space and giving the probability of each 
outcome. A valid probability model requires that all possible outcomes 
have probabilities that add up to 1. 


e An event is a collection of possible outcomes from the sample space. 
The probability of any event is a number between 0 and 1. 


e The event “A or B” is known as the union of A and B, denoted A U B. 
It consists of all outcomes in event A, event B, or both. 


e The event “A and B” is known as the intersection of A and B, denoted 
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A 1B. It consists of all outcomes that are common to both events. 
e To find the probability that an event occurs, we use some basic rules: 
= If all outcomes in the sample space are equally likely, 


P(A)=number of outcomes in event Atotal number of outcomes in samy 
number of outcomes in event A 
P(A) 


total number of outcomes in sample space 


. Complement rule: P(A®) = 1 — P(A), where AC is the complement 
of event A; that is, the event that A does not happen. 


« General addition rule: For any two events A and B, 
P(A UB)=P(A)+P(B)-P(AnByP (A U B) = P(A) + P(B) — P(ANB) 


. Addition rule for mutually exclusive events: Events A and B are 
mutually exclusive if they have no outcomes in common. If A and B 
are mutually exclusive, P(A or B) = P(A) + P(B). 
e A two-way table or Venn diagram can be used to display the sample 


space and to help find probabilities for a chance process involving two 
events. 


Section 5.2 Exercises 


31. pg 315 |») Four-sided dice A four-sided die is a pyramid whose 


four faces are labeled with the numbers 1, 2, 3, and 4 (see image). 
Imagine rolling two fair, four-sided dice and recording the number 
that is showing at the base of each pyramid. For instance, you would 
record a 4 if the die landed as shown in the figure. 


ae 


a. Give a probability model for this chance process. 
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33. 


34. 


35. 


b. Define event A as getting a sum of 5. Find P(A). 


Tossing coins Imagine tossing a fair coin 3 times. 
a. Give a probability model for this chance process. 


b. Define event B as getting more heads than tails. Find P(B). 


Grandkids Mr. Starnes and his wife have 6 grandchildren: Connor, 
Declan, Lucas, Piper, Sedona, and Zayne. They have 2 extra tickets to 
a holiday show, and will randomly select which 2 grandkids get to see 
the show with them. 


a. Give a probability model for this chance process. 


b. Find the probability that at least one of the two girls (Piper and 
Sedona) get to go to the show. 


Who’s paying? Abigail, Bobby, Carlos, DeAnna, and Emily go to 
the bagel shop for lunch every Thursday. Each time, they randomly 
pick 2 of the group to pay for lunch by drawing names from a hat. 


a. Give a probability model for this chance process. 


b. Find the probability that Carlos or DeAnna (or both) ends up 
paying for lunch. 


pg 317 |») Mystery box Ms. Tyson keeps a Mystery Box in her 


classroom. If a student meets expectations for behavior, she or he is 
allowed to draw a slip of paper without looking. The slips are all of 
equal size, are well mixed, and have the name of a prize written on 
them. One of the “prizes”—extra homework— isn’t very desirable! 
Here is the probability model for the prizes a student can win: 


Prize Pencil Candy Stickers Homework pass — Extra homework 
Probability 0.40 0.25 0.15 0.15 0.05 


a. Explain why this is a valid probability model. 
b. Find the probability that a student does not win extra homework. 


c. What’s the probability that a student wins candy or a homework 
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pass? 


36. Languages in Canada Canada has two official languages, English 
and French. Choose a Canadian at random and ask, “What is your 
mother tongue?” Here is the distribution of responses, combining 


many separate languages from the broad Asia/Pacific region:® 
Language English French Asian/Pacific Other 
Probability 0.63 0.22 0.06 0.09 


a. Explain why this is a valid probability model. 


b. What is the probability that the chosen person’s mother tongue is 
not English? 


c. What is the probability that the chosen person’s mother tongue is 
one of Canada’s official languages? 


37. Household size In government data, a household consists of all 
occupants of a dwelling unit. Choose an American household at 
random and count the number of people it contains. Here is the 
assignment of probabilities for the outcome. The probability of 
finding 3 people in a household is the same as the probability of 
finding 4 people. 


Number of people 1 2 3 4 fs) 6 Tt 
Probability 0.25 0.32 ? ? 0.07 420.03 0.01 


a. What probability should replace “?” in the table? Why? 
b. Find the probability that the chosen household contains more than 
2 people. 


38. When did you leave? The National Household Travel Survey gathers 
data on the time of day when people begin a trip in their car or other 
vehicle. Choose a trip at random and record the time at which the trip 
started.2 Here is an assignment of probabilities for the outcomes: 


Time of day 10 P.M.—12:59 AM. 1TAM—S:59 A.M. 6A.M.-8:59 A.M. 
Probability 0.040 0.033 0.144 
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Time of day 9 A.M.—12:59 P.M. 1 P.M.—3:59 P.M. 4 P.M.—6:59 P.M. 


Probability 0.234 0.208 ? 
Time of day 7 P.M.—9:59 P.M. 
Probability 0.123 


a. What probability should replace “?” in the table? Why? 


b. Find the probability that the chosen trip did not begin between 9 


A.M. and 12:59 P.M. 


39. Education among young adults Choose a young adult (aged 25 to 


40. 


29) at random. The probability is 0.13 that the person chosen did not 
complete high school, 0.29 that the person has a high school diploma 
but no further education, and 0.30 that the person has at least a 
bachelor’s degree. 


a. What must be the probability that a randomly chosen young adult 
has some education beyond high school but does not have a 
bachelor’s degree? Why? 


b. Find the probability that the young adult completed high school. 


Which probability rule did you use to find the answer? 


c. Find the probability that the young adult has further education 


beyond high school. Which probability rule did you use to find the 
answer? 


Preparing for the GMAT A company that offers courses to prepare 
students for the Graduate Management Admission Test (GMAT) has 
collected the following information about its customers: 20% are 
undergraduate students in business, 15% are undergraduate students 
in other fields of study, and 60% are college graduates who are 
currently employed. Choose a customer at random. 


a. What must be the probability that the customer is a college 


graduate who is not currently employed? Why? 


b. Find the probability that the customer is currently an 


undergraduate. Which probability rule did you use to find the 
answer? 
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41. 


42. 


c. Find the probability that the customer is not an undergraduate 
business student. Which probability rule did you use to find the 
answer? 


pg 319 |») Who eats breakfast? Students in an urban school were 


curious about how many children regularly eat breakfast. They 
conducted a survey, asking, “Do you eat breakfast on a regular 
basis?” All 595 students in the school responded to the survey. The 
resulting data are shown in the two-way table.® 


Gender 
Male Female Total 
Eats breakfast Yes 300 
regularly No 295 
Total 320 275 Soo 


Suppose we select a student from the school at random. Define event 
F as getting a female student and event B as getting a student who 
eats breakfast regularly. 


a. Find P(B®). 
b. Find P(E and B®). Interpret this value in context. 


c. Find P(F or B®). 


Is this your card? A standard deck of playing cards (with jokers 
removed) consists of 52 cards in four suits—clubs, diamonds, hearts, 
and spades. Each suit has 13 cards, with denominations ace, 2, 3, 4, 5, 
6, 7, 8, 9, 10, jack, queen, and king. The jacks, queens, and kings are 
referred to as “face cards.” Imagine that we shuffle the deck 
thoroughly and deal one card. Define events F: getting a face card and 
H: getting a heart. The two-way table summarizes the sample space 
for this chance process. 


Card 
Face card Nonface card Total 
Heart 13 
Suit 
Nonheart 39 


Total | 12 40 52 


a. Find P(H°). 
b. Find P(H© and F). Interpret this value in context. 


c. Find P(H® or F). 


43. Cell phones The Pew Research Center asked a random sample of 


44. 


2024 adult cell-phone owners from the United States their age and 
which type of cell phone they own: iPhone, Android, or other 
(including non-smartphones). The two-way table summarizes the 
data. 


Age 
18-34 35-54 55+ Total 
iPhone 467 
Type or cel Android 503 
phone 
Other 1054 
Total 517 637 870 2024 


Suppose we select one of the survey respondents at random. What’s 
the probability that: 


a. The person is not age 18 to 34 and does not own an iPhone? 


b. The person is age 18 to 34 or owns an iPhone? 


Middle school values Researchers carried out a survey of fourth-, 
fifth-, and sixth-grade students in Michigan. Students were asked 
whether good grades, athletic ability, or being popular was most 
important to them. The two-way table summarizes the survey data.2 


Grade 
Ath grade 5th grade 6th grade Total 
Grades 168 
Nas Athletic 98 
important 
Popular 69 
Total 92 108 135 335 
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Suppose we select one of these students at random. What’s the 
probability of each of the following? 


a. The student is a sixth-grader or rated good grades as important. 


b. The student is not a sixth-grader and did not rate good grades as 
important. 


Roulette An American roulette wheel has 38 slots with numbers 1 
through 36, 0, and 00, as shown in the figure. Of the numbered slots, 
18 are red, 18 are black, and 2—the 0 and 00—are green. When the 
wheel is spun, a metal ball is dropped onto the middle of the wheel. If 
the wheel is balanced, the ball is equally likely to settle in any of the 
numbered slots. Imagine spinning a fair wheel once. Define events B: 
ball lands in a black slot, and E: ball lands in an even-numbered slot. 
(Treat 0 and 00 as even numbers.) 


em 1 00 zz Or = 


Cc 


a. Make a two-way table that displays the sample space in terms of 
events B and E. 


b. Find P(B) and P(E). 


c. Describe the event “B and E” in words. Then find the probability 
of this event. 


d. Explain why P(B or E) # P(B) + P(E). Then use the general 
addition rule to compute P(B or E). 


46. Colorful disks A jar contains 36 disks: 9 each of four colors—red, 
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47. 


48. 


49. 


green, blue, and yellow. Each set of disks of the same color is 
numbered from 1 to 9. Suppose you draw one disk at random from 
the jar. Define events R: get a red disk, and N: get a disk with the 
number 9. 


a. Make a two-way table that describes the sample space in terms of 
events R and N. 


b. Find P(R) and P(N). 


c. Describe the event “R and N” in words. Then find the probability 
of this event. 


d. Explain why P(R or N) # P(R) + P(N) Then use the general 
addition rule to compute P(R or N). 


pg 321 |p) Dogs and cats In one large city, 40% of all households 


own a dog, 32% own a cat, and 18% own both. Suppose we randomly 
select a household. What’s the probability that the household owns a 
dog or a cat? 


Reading the paper In a large business hotel, 40% of guests read the 
Los Angeles Times. Only 25% read the Wall Street Journal. Five 
percent of guests read both papers. Suppose we select a hotel guest at 
random and record which of the two papers the person reads, if either. 
What’s the probability that the person reads the Los Angeles Times or 
the Wall Street Journal? 


Mac or PC? A recent census at a major university revealed that 60% 
of its students mainly used Macs. The rest mainly used PCs. At the 
time of the census, 67% of the school’s students were undergraduates. 
The rest were graduate students. In the census, 23% of respondents 
were graduate students and used a Mac as their main computer. 
Suppose we select a student at random from among those who were 
part of the census. Define events G: is a graduate student and M: 
primarily uses a Mac. 


a. Find P(G U M). Interpret this value in context. 


b. Consider the event that the randomly selected student is an 
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undergraduate student and primarily uses a PC. Write this event in 
symbolic form and find its probability. 


50. Gender and political party In January 2017, 52% of U.S. senators 
were Republicans and the rest were Democrats or Independents. 
Twenty-one percent of the senators were females, and 47% of the 
senators were male Republicans. Suppose we select one of these 
senators at random. Define events R: is a Republican and M: is male. 


a. Find P(R U M). Interpret this value in context. 


b. Consider the event that the randomly selected senator is a female 
Democrat or Independent. Write this event in symbolic form and 
find its probability. 


ol. pg 324 |p) Dogs and cats Refer to Exercise 47. 


a. Make a Venn diagram to display the outcomes of this chance 
process using events D: owns a dog and C: owns a Cat. 


b. Find P(D n C°). 
52. Reading the paper Refer to Exercise 48. 


a. Make a Venn diagram to display the outcomes of this chance 
process using events L: reads the Los Angeles Times and W: reads 
the Wall Street Journal. 


b. Find P(L© n W). 


53. Union and intersection Suppose A and B are two events such that 
P(A) = 0.3, P(B) = 0.4, and P(A U B) = 0.58. Find P(A n B). 


54. Union and intersection Suppose C and D are two events such that 
P(C) = 0.6, P(D) = 0.45, and P(C _ U D) = 0.75. Find P(C n D). 


Multiple Choice Select the best answer for Exercises 55—58. 


55. The partially completed table that follows shows the distribution of 
scores on the 2016 AP® Statistics exam. 


Score 1 2 3 4 5 
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Probability 0.235 0.155 0.249 0.217 : 


Suppose we randomly select a student who took this exam. What’s 
the probability that he or she earned a score of at least 3? 


a. 0.249 
b. 0.361 
c. 0.390 
d. 0.466 
e. 0.610 


56. In a sample of 275 students, 20 say they are vegetarians. Of the 
vegetarians, 9 eat both fish and eggs, 3 eat eggs but not fish, and 7 eat 
neither. Choose one of the vegetarians at random. What is the 
probability that the chosen student eats fish or eggs? 


a. 9/20 
b. 13/20 
c. 22/20 
d. 9/275 


@ 22/275 


Exercises 57 and 58 refer to the following setting. The casino game craps 
is based on rolling two dice. Here is the assignment of probabilities to the 
sum of the numbers on the up-faces when two dice are rolled: 


Outcome 2 3 4 fs) 6 i 8 a 10 «11 = 12 
Probability 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36 


57. The most common bet in craps is the “pass line.” A pass line bettor 
wins immediately if either a 7 or an 11 comes up on the first roll. This 
is called a natural. What is the probability that a natural does not 
occur? 


a. 2/36 


716 


b. 6/36 
c. 8/36 
d. 16/36 
e. 28/36 


58. If a player rolls a 2, 3, or 12, it is called craps. What is the probability 
of getting craps or an even sum on one roll of the dice? 


a. 4/36 
b. 18/36 
c. 20/36 
d. 22/36 


e. 32/36 
Recycle and Review 


59. Crawl before you walk (3.1, 3.2, 4.3) At what age do babies learn to 
crawl? Does it take longer to learn in the winter, when babies are 
often bundled in clothes that restrict their movement? Perhaps there 
might even be an association between babies’ crawling age and the 
average temperature during the month they first try to crawl (around 6 
months after birth). Data were collected from parents who brought 
their babies to the University of Denver Infant Study Center to 
participate in one of a number of studies. Parents reported the birth 
month and the age at which their child was first able to creep or crawl 
a distance of 4 feet within one minute. Information was obtained on 
414 infants (208 boys and 206 girls). Crawling age is given in weeks, 
and average temperature (in degrees Fahrenheit) is given for the 
month that is 6 months after the birth month12 


Birth month Average crawling age Average temperature 
January 29.84 66 
February 30.52 73 
March 29.70 J. 
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April 31.84 63 


May 28.58 52 
June 31.44 39 
July 33.64 33 
August 32.82 30 
September 33.83 33 
October 33.35 37 
November 33.38 48 
December 32.32 57 


a. Make an appropriate graph to display the relationship between 
average temperature and average crawling age. Describe what you 
see. 


Some computer output from a least-squares regression analysis of 
the data is shown. 


Term Coef SE Coef T-Value P-Value 
Constant 35.68 Lae 27.08 0.000 
Average -0.0777 0.0251 -3.10 0.011 
temperature 

S = 1.31920 R-Sq = 48.96% R-Sq(adj)= 43.86% 


b. What is the equation of the least-squares regression line that 
describes the relationship between average temperature and 
average crawling age? Define any variables that you use. 


c. Interpret the slope of the regression line. 


d. Can we conclude that warmer temperatures 6 months after babies 
are born causes them to crawl sooner? Justify your answer. 


60. Treating low bone density (4.2, 4.3) Fractures of the spine are 
common and serious among women with advanced osteoporosis (low 
mineral density in the bones). Can taking strontium ranelate help? A 
large medical trial was conducted to investigate this question. 
Researchers recruited 1649 women with osteoporosis who had 
previously had at least one fracture for an experiment. The women 
were assigned to take either strontium ranelate or a placebo each day. 
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All the women were taking calcium supplements and receiving 


standard medical care. One response variable was the number of new 


fractures over 3 years. 


a. Describe a completely randomized design for this experiment. 


b. Explain why it is important to keep the calcium supplements and 
medical care the same for all the women in the experiment. 


c. The women who took strontium ranelate had statistically 
significantly fewer new fractures, on average, than the women who 
took a placebo over a 3-year period. Explain what this means to 
someone who knows little statistics. 
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SECTION 5.3 Conditional Probability and 
Independence 


LEARNING TARGETS. By the end of the section, you should be able to: 
Calculate and interpret conditional probabilities. 


Determine if two events are independent. 


Use the general multiplication rule to calculate probabilities. 


Use a tree diagram to model a chance process involving a sequence 
of outcomes and to calculate probabilities. 


When appropriate, use the multiplication rule for independent 
events to calculate probabilities. 


The probability of an event can change if we know that some other event 
has occurred. For instance, suppose you toss a fair coin twice. The 
probability of getting two heads is 1/4 because the sample space consists 
of the 4 equally likely outcomes 


HH HT TH Tr  4HH HT TH TT 


Suppose that the first toss lands tails. Now what’s the probability of 
getting two heads? It’s 0. Knowing that the first toss is a tail changes the 
probability that you get two heads. 


This idea is the key to many applications of probability. 


What Is Conditional Probability? 


Let’s return to the college statistics class from Section 5.2. Earlier, we 
used the two-way table shown on the next page to find probabilities 
involving events A: is male and B: has a pierced ear for a randomly 
selected student. 


Gender 
Male Female Total 
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Yes {2 84 103 
75 
Total 90 88 178 


Pierced ear 


Here is a summary of our previous results: 


P(A)=P(male)=90/178P(A nB)=P(male and pierced 
ear)=19/178P(B)=P(pierced ear)=103/178P(A U B)=P(male or pierced 
ear)=174/178 
P(A) = P(male) = 90/178 P(A ™B) = P(male and pierced ear) = 19/178 


P(B) = P(pierced ear) = 103/178 P(AUB) = P(maleor pierced ear) = 174/178 


Now let’s turn our attention to some other interesting probability 
questions. 


. If we know that a randomly selected student has a pierced ear, what 
is the probability that the student is male? There are 103 students in 
the class with a pierced ear. We can restrict our attention to this group, 
since we are told that the chosen student has a pierced ear. Because 
there are 19 males among the 103 students with a pierced ear, the 
desired probability is 


P(male given pierced ear)=19/103=0.184=18.4% 
P(male given pierced ear) = 19/103 = 0.184 = 18.4% 


Gender 
Male Female Total 
Yes 103 
Pierced ear 
No 75 
Total 178 


’. If we know that a randomly selected student is male, what’s the 
probability that the student has a pierced ear? This time, our 
attention is focused on the males in the class. Because 19 of the 90 
males in the class have a pierced ear, 


P(pierced ear given male)=19/90=0.211=21.1% 
P(pierced ear given male) = 19/90 = 0.211 = 21.1% 


Gender 


| | | 
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Male Female Total 


Pierced ear Yes 103 
No 75 
Total 90 88 178 


These two questions sound alike, but they actually ask two very 
different things. Each of these probabilities is an example of a conditional 
probability. The name comes from the fact that we are trying to find the 
probability that one event will happen under the condition that some other 
event is already known to have occurred. We often use the phrase “given 
that” to signal the condition. 


DEFINITION Conditional probability 


The probability that one event happens given that another event is 
known to have happened is called a conditional probability. The 
conditional probability that event B happens given that event A has 
happened is denoted by P(B | A). 


With this new notation available, we can restate the answers to the two 
questions just posed as 


P(male|pierced ear)=P(A|B)=19/103andP(pierced ear|male)=P(B|A)=19/90 
P(male|pierced ear) = P(A|B) = 19/103 
and 
P(pierced ear|male) = P(B|A) = 19/90 
Here’s an example that illustrates how conditional probability works in 
a familiar setting. 


EXAMPLE |A Titanic disaster (») 


Two-way tables and conditional probabilities 
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PROBLEM: In 1912, the luxury liner Titanic, on its first voyage across 
the Atlantic, struck an iceberg and sank. Some passengers got off the 
ship in lifeboats, but many died. The two-way table gives information 
about adult passengers who survived and who died, by class of travel. 


Class of travel 


First Second Third Total 
Survival Survived 442 
status Died 765 
Total 319 261 627 1207 


Suppose we randomly select one of the adult passengers from the 
Titanic. Define events F: first-class passenger, S: survived, and T: third- 
class passenger. 


a. Find P(T|S). Interpret this value in context. 


b. Given that the chosen person is not a first-class passenger, what’s the 
probability that she or he survived? Write your answer as a 
probability statement using correct symbols for the events. 


SOLUTION: 


a. P(T|S) = P(third-class passenger | survived) = 151/442 = 0.342. 
Given that the randomly chosen person survived, there is about a 
34.2% chance that she or he was a third-class passenger. 


To answer part (a), only consider values in the “Survived” row. 
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b. P(survived|not first- 
class passenger)=P(S|FC)=94+151261+627=245888=0.276 
P(survived/|not first-class passenger ) 


2614-627 888 


To answer part (b), only consider values in the “Second class” and 


“Third class” columns. 


FOR PRACTICE, TRY EXERCISE 61 


Is there is a connection between conditional probability and 
conditional relative frequency from Chapter 1? Yes! In part (a) of the 
example, we found the conditional probability P(third-class 
passenger|survived) = 151/442 = 0.342. In Chapter 1, we asked, “What 
proportion of survivors were third-class passengers?” Our answer was also 
151/442 = 0.342. This is a conditional relative frequency because we are 
finding the percent or proportion of third-class passengers among those 
who survived. So a conditional probability is just a conditional relative 
frequency that comes from a chance process—in this case, randomly 
selecting an adult passenger. 


Let’s look more closely at how conditional probabilities are calculated 
using the data from the college statistics class. From the two-way table that 
follows, we see that 


P(male|pierced ear)=19103=number of students who are male and have a pi 


; 19 number of students who are male and have a pierced ear 
P(male|pierced ear) = ee naa a 


~ 103 number of students with a pierced ear 
Gender 
Male Female Total 
Yes 103 
Pierced ear 
No 75 
Total 90 88 178 


What if we focus on probabilities instead of numbers of students? 
Notice that 
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P(male and pierced ear)P(pierced ear)=19178103178=19103=P(male | 
19 


P(male and piercedear) 17g 19 
= — = — = P (male | pierced ear) 
P(pierced ear) 103 103 
pierced ear) 178 


This observation leads to a general formula for computing a 
conditional probability. 


CALCULATING CONDITIONAL PROBABILITIES 


To find the conditional probability P(A | B), use the formula 


P(A | B)=P(A and B)P(B)=P(A n 
B)P(B)=P(both events occur)P(given event occurs) 


P(A|B) = P(AandB) P(ANMB) _ P(both events occur) 
- P(B) 7 P(B) - P(given event occurs) 


By the same reasoning, 


P(B | A)=P(B and A)P(A)=P(B n A)P(A) 
P(BandA) P(BN A) 
P(B| A) =—————_ => ————_ 
P(A) P(A) 


EXAMPLE | Facebook or Instagram? (>) 
Calculating conditional probability 


PROBLEM: A survey of all residents in a large apartment complex 
reveals that 68% use Facebook, 28% use Instagram, and 25% do both. 
Suppose we select a resident at random. Given that the person uses 
Facebook, what’s the probability that she or he uses Instagram? 


SOLUTION: 
P(Instagram|Facebook)=P(I|F)=P(In F)P(F)=0.250.68=0.368 
P(Instagram|Facebook) = P(I|F) = ar = = = 0.368 
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Refer back to the example. If the person chosen is an Instagram user, 
what is the probability that he or she uses Facebook? By the conditional 
probability formula, it’s 


P(Facebook|Instagram)=P(F|ID)=P(F nI)P(1I)=0.250.28=0.893 


AP® EXAM TIP 


You can write statements like P (A | B) if events A and B are clearly 
defined in a problem. Otherwise, it’s probably easier to use contextual 
labels, like P (I | F) in the preceding example. Or you can just use 
words: P (Instagram | Facebook). 


If the chosen resident uses Instagram, it is extremely likely that he or 
she uses Facebook. However, if the chosen resident uses Facebook, he or 
she is not nearly so likely to use Instagram. 

You could also use a two-way table to help you find these conditional 
probabilities. Here’s the table that we made for this setting in Section 5.2 
(page 325). It is easy to see that 


P(Instagram|Facebook)=0.250.68=0.368andP(Facebook|Instagram)=0.250.: 


2 0.25 
P(Instagram|Facebook) = ete = 0.368 and P(Facebook|Instagram) = = 0.893 


0.68 0.28 
Facebook use 
No Total 
Yes 28% 
Instagram use 
No 72% 
Total 68% 32% 100% 


CHECK YOUR UNDERSTANDING 
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y Yellowstone National Park surveyed a random sample of 1526 
winter visitors to the park. They asked each person whether he or she 
owned, rented, or had never used a snowmobile. Respondents were also 
asked whether they belonged to an environmental organization (like the 
Sierra Club). The two-way table summarizes the survey responses. 


Environmental club 


No Yes Total 
Never used 657 
SIDSITD CS Tash 574 
experience 
Owner 295 
Total 1221 305 1526 


Suppose we randomly select one of the survey respondents. Define 
events E: environmental club member, S: snowmobile owner, and N: 
never used. 


1. Find P(N | E). Interpret this value in context. 


2. Given that the chosen person is not a snowmobile owner, what’s the 
probability that she or he is an environmental club member? Write 
your answer as a probability statement using correct symbols for the 
events. 


3. Is the chosen person more likely to not be an environmental club 
member if he or she has never used a snowmobile, is a snowmobile 
Owner, or a snowmobile renter? Justify your answer. 


Conditional Probability and Independence 


Suppose you toss a fair coin twice. Define events A: first toss is a head, 
and B: second toss is a head. We know that P(A) = 1/2 and P(B) = 1/2. 


e What’s P(B | A)? It’s the conditional probability that the second toss is a 
head given that the first toss was a head. The coin has no memory, so 
P(B | A) = 1/2. 
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° What’s P(B | A)? It’s the conditional probability that the second toss is 
a head given that the first toss was not a head. Getting a tail on the first 
toss does not change the probability of getting a head on the second toss, 
so P(B | A®) = 1/2. 

In this case, P(B | A) = P(B | A°) = P(B). Knowing the outcome of the 
first toss does not change the probability that the second toss is a head. We 
say that A and B are independent events. 


DEFINITION Independent events 


A and B are independent events if knowing whether or not one event 
has occurred does not change the probability that the other event will 
happen. In other words, events A and B are independent if 


P(AIB)=P(A|BC)=P(Ay’ (|B) = P(A|B®) = P(A) 


Alternatively, events A and B are independent if 


P(BIA)=P(BIAC)=P(By’ (BIA) = P(BIA®) = PCB) 


Let’s contrast the coin-toss scenario with our earlier pierced-ear 
example. In that case, the chance process involved randomly selecting a 
student from a college statistics class. The events of interest were A: is 
male, and B: has a pierced ear. Are these two events independent? 


Gender 
Male Female Total 
Yes 103 
Pierced ear 
No 75 
Total 90 88 178 


e Suppose that the chosen student is male. We can see from the two-way 
table that P(pierced ear | male) = P(B | A) = 19/90 = 0.211. 


e Suppose that the chosen student is female. From the two-way table, we 
see that P(pierced ear | female) = P(B | A°) = 84/88 = 0.955. 
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Knowing that the chosen student is a male changes (greatly reduces) the 
probability that the student has a pierced ear. So these two events are not 
independent. 


Another way to determine if two events A and B are independent is to 
compare P(A | B) to P(A) or P(B | A) to P(B). For the pierced-ear setting, 
P(pierced ear|male)=P(B|A)=19/90=0.211 
P(pierced ear|male) = P(B|A) = 19/90 = 0.211 
The unconditional probability that the chosen student has a pierced ear is 


P(pierced ear)=P(B)=103/178=0.579 
P(pierced ear) = P(B) = 103/178 = 0.579 


Again, knowing that the chosen student is male changes (reduces) the 
probability that the individual has a pierced ear. So these two events are 
not independent. 


EXAMPLE | Gender and handedness » 
Checking for independence 


Graeme Harris/Getty Images 


PROBLEM: Is there a relationship between gender and handedness? 
To find out, we used Census At School’s Random Data Selector to 
choose an SRS of 100 Australian high school students who completed a 
survey. The two-way table summarizes the relationship between gender 
and dominant hand for these students. 


Gender 
Male Female Total 
Dominant Right 90 
hand Left 10 


Total 46 34 100 


Suppose we choose one of the students in the sample at random. Are the 
events “male” and “left-handed” independent? Justify your answer. 


SOLUTION: 

P(left-handed|male)=7/46=0.152/ (left-handed|male) = 7/46 = 0.152 
P(left-handed|female)=3/5 A=0.056" (left-handed |female) = 3/54 = 0.056 
Because these probabilities are not equal, the events “male” and “left- 


handed” are not independent. Knowing that the student is male increases 
the probability that the student is left-handed. 
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In the example, we could have also determined that the two events are 
not independent by showing that 


P(left-handed | male)=7/46=0.1524P(left-handed)=10/100=0.100 
P (left-handed | male) = 7/46 = 0.152 # P (left-handed) = 10/100 = 0.100 


Or we could have focused on whether knowing that the chosen student is 
left-handed changes the probability that the person is male. Because 


P(male | left-handed)=7/10=0.70#P(male|right-handed)=39/90=0.433 
P (male | left-handed) = 7/10 = 0.70 # P (male|right-handed) = 39/90 = 0.433 


the events “male” and “left-handed” are not independent. 

You might have thought, “Surely there’s no connection between 
gender and handedness. The events ‘male’ and ‘left-handed’ are bound to 
be independent.” As the example shows, you can’t use your intuition to 
check whether events are independent. To be sure, you have to calculate 
some probabilities. 
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Percent 


100 = 
90 + 
80 4 
70 + 
60 
50 
40 
30 - 
20 + 
10 4 

0 


Male Female 
Gender 


Is there a connection between independence of events and association 
between two variables? Yes! In the preceding example, we found that the 
events “male” and “left-handed” were not independent for the sample of 
100 Australian high school students. Knowing a student’s gender helped 
us predict his or her dominant hand. By what you learned in Chapter 1, 
there is an association between gender and handedness for the students in 
the sample. The segmented bar graph shows the association in picture 
form. 

Does that mean an association exists between gender and handedness 
in the larger population? Maybe. If there is no association between the 
variables in the population, it would be surprising to choose a random 
sample of 100 students for which P(left-handed | male), P(left-handed | 
female), and P(left-handed) were exactly equal. But these probabilities 
should be close to equal if there’s no association between the variables in 
the population. How close is close? We’Il discuss this issue further in 


Chapter 11. 


CHECK YOUR UNDERSTANDING 


(\ 


\ 


y For each chance process given, determine whether the events are 
independent. Justify your answer. 


1. Shuffle a standard deck of cards, and turn over the top card. Put it 
back in the deck, shuffle again, and turn over the top card. Define 
events A: first card is a heart, and B: second card is a heart. 


2. Shuffle a standard deck of cards, and turn over the top two cards, one 
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at a time. Define events A: first card is a heart, and B: second card is 
a heart. 


3. The 28 students in Mr. Tabor’s AP® Statistics class took a quiz on 
conditional probability and independence. The two-way table 
summarizes the class’s quiz results based on gender and whether the 
student got an A. Choose a student from the class at random. The 
events of interest are “female” and “got an A.” 


Gender 
Female Male 
Yes 1 
Got anA 
No 6 


The General Multiplication Rule 


Suppose that A and B are two events resulting from the same chance 
process. We can find the probability P(A or B) with the general addition 
rule: 


P(A or B)=P(A)+P(B)-P(A and B) 
P(A or B) = P(A) + P(B) — P(A and B) 


How do we find the probability that both events happen, P(A and B)? 

Consider this situation: about 55% of high school students participate 
in a school athletic team at some level. Roughly 6% of these athletes go on 
to play on a college team in the NCAA.14 What percent of high school 
students play a sport in high school and go on to play on an NCAA team? 
About 6% of 55%, or roughly 3.3%. 

Let’s restate the situation in probability language. Suppose we select a 
high school student at random. What’s the probability that the student 
plays a sport in high school and goes on to play on an NCAA team? The 
given information suggests that 


P(high school sport)=0.55 and P(NCAA team | high school sport)=0.06 
P (high school sport) = 0.55 and P (NCAA team | high school sport) = 0.06 
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By the logic just stated, 


P(high school sport and NCAA team)=P(high school sport): P(NCAA team 
(0.55)(0.06)=0.033 
P(high school sport and NCAA team) 


= P(high school sport) - P(/NCAA team | high school sport) 
= (0.55)(0.06) = 0.033 


This is an example of the general multiplication rule. 


DEFINITION General multiplication rule 


For any chance process, the probability that events A and B both 
occur can be found using the general multiplication rule: 


P(A and B)=P(AnB)=P(A): P(BIA) 
P(AandB) = P(AMB) = P(A)- P(BJA) 


The general multiplication rule says that for both of two events to 
occur, first one must occur. Then, given that the first event has occurred, 
the second must occur. To confirm that this result is correct, start with the 
conditional probability formula 


_ P(BNA) 
P(BJA)=P(Bn A)P(A) P(A) 


The numerator gives the probability we want because P(B n A) is the 
same as P(A n B). Multiply both sides of the previous equation by P(A) to 
get 


P(A): P(BJA)=P(AnB)P (A) : P(BJA) = P(ANB) 


EXAMPLE | Teens and social media >» 


The general multiplication rule 


PROBLEM: The Pew Internet and American Life Project reported that 
79% of teenagers (ages 13 to 17) use social media, and that 39% of 
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teens who use social media feel pressure to post content that will be 
popular and get lots of comments or likes.42 Find the probability that a 
randomly selected teen uses social media and feels pressure to post 
content that will be popular and get lots of comments or likes. 


SOLUTION: 


P (use social media and feel pressure) =P (use social media) : P (feel pressure | 
use social media)=(0.79)(0.39) = 0.308 


P (use social media and feel pressure) = P (use social media) - P (feel pressure | use social media) 
= (0.79) (0.39) = 0.308 
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Tree Diagrams and Conditional Probability 


Shannon hits the snooze button on her alarm on 60% of school days. If she 
hits snooze, there is a 0.70 probability that she makes it to her first class on 
time. If she doesn’t hit snooze and gets up right away, there is a 0.90 
probability that she makes it to class on time. Suppose we select a school 
day at random and record whether Shannon hits the snooze button and 
whether she arrives in class on time. Figure 5.7 shows a tree diagram for 
this chance process. 


P(on time | hits snooze) 


Hits snooze 


0.60_- Pad button ia * 


0.40 0.90 
ema 
snooze button| —~_ 
= oe 
Late 


FIGURE 5.7 A tree diagram displaying the sample space of randomly choosing a 


school day and noting if Shannon hits the snooze button or not and whether she 
gets to her first class on time. 


There are only two possible outcomes at the first “stage” of this chance 
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process: Shannon hits the snooze button or she doesn’t. The first set of 
branches in the tree diagram displays these outcomes with their 
probabilities. The second set of branches shows the two possible results at 
the next “stage” of the process—Shannon gets to her first class on time or 
arrives late—and the probability of each result based on whether or not she 
hit the snooze button. Note that the probabilities on the second set of 
branches are conditional probabilities, like P(on time | hits snooze) = 0.70. 


DEFINITION Tree diagram 


A tree diagram shows the sample space of a chance process 
involving multiple stages. The probability of each outcome is shown 
on the corresponding branch of the tree. All probabilities after the 
first stage are conditional probabilities. 


We can ask some interesting questions related to the tree diagram: 


e What is the probability that Shannon hits the snooze button and is 
late for class on a randomly selected school day? The general 
multiplication rule provides the answer: 


P(hits snooze and late)=P(hits snooze) - P(late | hits snooze)=(0.60) 
(0.30)=0.18 
P (hits snooze and late) = P (hits snooze) - P (late | hits snooze) 
= (0.60) (0.30) 
= 0.18 


There is an 18% chance that Shannon hits the snooze button and is late 
for class. Note that the previous calculation amounts to multiplying 
probabilities along the branches of the tree diagram. 


e What’s the probability that Shannon is late to class on a randomly 
selected school day? Figure 5.8 on the next page illustrates two ways 
this can happen: Shannon hits the snooze button and is late or she 
doesn’t hit snooze and is late. Because these outcomes are mutually 
exclusive, 


P(late)=P(hits snooze and late)+P(doesn’t hits snooze and late) 
P (late) = P (hits snooze and late) + P (doesn’t hits snooze and late) 
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On time 


Hits snooze 
button 


P(hits snooze and late) 
= (0.06)(0.30) = 0.18 


0.60 


P(late) 
naa nes On time = 0.18 + 0.04 = 0.22 


snooze button| ~~ ——— 
0. 10 Late 


FIGURE 5.8 Tree diagram showing the two possible ways that Shannon can be 


P(doesn’t hit snooze and late) 
= (0.40)(0.10) = 0.04 


late to class on a randomly selected day. 
The general multiplication rule tells us that 


P(doesn’t hit snooze and late)=P(doesn’t hit snooze): P(late | 
doesn’t hit snooze)=(0.40)(0.10)=0.04 


P (doesn’t hit snooze and late) = P (doesn’t hit snooze) - P (late | doesn’t hit snooze) 
= (0.40) (0.10) 
= 0.04 


So P(late) = 0.18 + 0.04 = 0.22.. There is a 22% chance that Shannon 
will be late to class. 


Suppose that Shannon is late for class on a randomly chosen school 
day. What is the probability that she hit the snooze button that 
morning? To find this probability, we start with the given information 
that Shannon is late, which is displayed on the second set of branches in 
the tree diagram, and ask whether she hit the snooze button, which is 
shown on the first set of branches. We can use the information from the 
tree diagram and the conditional probability formula to do the required 
calculation: 


P(hit snooze button | late)=P(hit snooze button and 
late)P(late)=0.180.22=0.818 
P(hit snooze button and late) 


P (hit snooze button | late) = P(late) 
ate 

0.18 

0.22 


0.818 


Given that Shannon is late for school on a randomly selected day, there 
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is a 0.818 probability that she hit the snooze button. 


This method for solving conditional probability problems that involve “going 


backward” in a tree diagram is sometimes referred to as Bayes’s theorem. It was 


developed by the Reverend Thomas Bayes in the 1700s. 


Some interesting conditional probability questions—like this one about 
P(hit snooze button | late)—involve “going in reverse” on a tree diagram. 
Note that we just use the conditional probability formula and plug in the 
appropriate values to answer such questions. 


EXAMPLE |Do people read more ebooks or print 
books? |») 


Tree diagrams and probability 


Burlingham/Shuterstock.com 


PROBLEM: Recently, Harris Interactive reported that 20% of 
millennials, 25% of Gen Xers, 21% of baby boomers, and 17% of 
matures (age 68 and older) read more ebooks than print books. 
According to the U.S. Census Bureau, 34% of those 18 and over are 
millennials, 22% are Gen Xers, 30% are baby boomers, and 14% are 
matures. Suppose we select one U.S. adult at random and record which 
generation the person is from and whether she or he reads more ebooks 
or print books. 


a. Draw a tree diagram to model this chance process. 


b. Find the probability that the person reads more ebooks than print 
books. 
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c. Suppose the chosen person reads more ebooks than print books. 
What’s the probability that she or he is a millennial? 


SOLUTION: 


0.20 Reads more ebooks 
Millennial 


0.80 ~ Reads more print books 


0.25 Reads more ebooks 
Gen Xer 


Randomly 0.75 ~ Reads more print books 
chosen : 
US.acmit Baby 0.21 _ Reads more ebooks 


Boomer 0.79 Reads more print books 


. Reads more ebooks 


0.83 Reads more print books 


b. P(reads more ebooks) = (0.34)(0.20)+(0.22)(0.25)+(0.30)(0.21)+(0.14) 


(0.17)= 0.0680+0.0550+0.0630+0.0238= 0.2098 

P (reads more ebooks) = (0.34) (0.20) + (0.22) (0.25) + (0.30) (0.21) + (0.14) (0.17) 
0.0680 + 0.0550 + 0.0630 + 0.0238 

= 0.2098 


II 


c. P(millennial | reads more ebooks) = 


P(millennial and reads more ebooks)P(reads more ebooks)=0.0680.2098: 


é ‘ P(millennial and reads more ebooks) 
P (millennial|reads more ebooks) RARER 7 SOUS" PaRUESSEDT OTE EERE 


P(reads more ebooks) 
0.068 
0.2098 


0.3241 


P(A|B)=P(AnB)P(B) 
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FOR PRACTICE, TRY EXERCISE 81 


One of the most important applications of conditional probability is in 
the area of drug and disease testing. 


EXAMPLE |Mammograms (>) 


Tree diagrams and conditional probability 


Sam Edwards/Getty Images 


PROBLEM: Many women choose to have annual mammograms to 
screen for breast cancer after age 40. A mammogram isn’t foolproof. 
Sometimes the test suggests that a woman has breast cancer when she 
really doesn’t (a “false positive”). Other times, the test says that a 
woman doesn’t have breast cancer when she actually does (a “false 
negative”). 

Suppose that we know the following information about breast cancer 
and mammograms in a particular population: 


e One percent of the women aged 40 or over in this population have 
breast cancer. 


e For women who have breast cancer, the probability of a negative 
mammogram is 0.03. 


e For women who don’t have breast cancer, the probability of a positive 
mammogram is 0.06. 


A randomly selected woman aged 40 or over from this population tests 
positive for breast cancer in a mammogram. Find the probability that 
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she actually has breast cancer. 


SOLUTION: 


Positive 


Hes breast 0.97 mammogram 
cancer 
0.01 0.03 Negative 


mammogram 
Positive 
0.99 , 0.06 mammogram 
Doesn't have 
breast cancer ; 
0.94 Negative 
mammogram 


Start by making a tree diagram to summarize the possible outcomes. 

e Because 1% of women in this population have breast cancer, 99% 
don’t have breast cancer. 
Of those women who do have breast cancer, 3% would test negative 


on a mammogram. The remaining 97% would (correctly) test positive. 
Among the women who don’t have breast cancer, 6% would test 
positive on a mammogram. The remaining 94% would (correctly) test 
negative. 


P (breast cancer | positive mammogram) 
=P(breast cancer and positive mammogram)P(positive mammogram)=(0.01) 
(0.97)(0.01)(0.97)+(0.99)(0.06)=0.00970.0691=0.14 


P(breast cancer and positive mammogram) 


P (breast cancer|positive mammogram) — 
P(positive mammogram) 


(0.01)(0.97) 
(0.01)(0.97)+(0.99) (0.06) 
0.0097 
0.0691 
0.14 


II 
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Are you surprised by the final result of the example—given that a 
randomly selected woman from the population in question has a positive 
mammogram, there is only about a 14% chance that she has breast cancer? 
Most people are. Sometimes a two-way table that includes counts is more 
convincing. 
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To make calculations simple, we’ ll suppose that there are exactly 
10,000 women aged 40 or over in this population, and that exactly 100 
have breast cancer (that’s 1% of the women). 


e How many of those 100 would have a positive mammogram? It would 
be 97% of 100, or 97 of them. That leaves 3 who would test negative. 


e How many of the 9900 women who don’t have breast cancer would get 
a positive mammogram? Six percent of them, or (9900)(0.06) = 594 
women. The remaining 9900 — 594 = 9306 would test negative. 


e In total, 97 + 594 = 691 women would have positive mammograms and 
3 + 9306 = 9309 women would have negative mammograms. 


This information is summarized in the two-way table. 


Has breast cancer? 


Yes No Total 

Mammogram Positive 691 
result Negative 9309 
Total 100 9900 10,000 


Given that a randomly selected woman has a positive mammogram, the 
two-way table shows that the conditional probability 


P(breast cancer|positive mammogram)=97/691=0.14 
P(breast cancer|positive mammogram) = 97/691 = 0.14 


This example illustrates an important fact when considering proposals 
for widespread testing for serious diseases or illegal drug use: if the 
condition being tested is uncommon in the population, many positives will 
be false positives. The best remedy is to retest any individual who tests 
positive. 


CHECK YOUR UNDERSTANDING 


( 


y A computer company makes desktop, laptop, and tablet computers 
at factories in two states: California and Texas. The California factory 
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produces 40% of the company’s computers and the Texas factory makes 
the rest. Of the computers made in California, 25% are desktops, 30% 
are laptops, and the rest are tablets. Of those made in Texas, 10% are 
desktops, 20% are laptops, and the rest are tablets. All computers are 
first shipped to a distribution center in Missouri before being sent out to 
stores. Suppose we select a computer at random from the distribution 
center and observe where it was made and whether it is a desktop, 


laptop, or tablet..4 
1. Construct a tree diagram to model this chance process. 
2. Find the probability that the computer is a tablet. 


3. Given that a tablet computer is selected, what is the probability that it 
was made in California? 


The Multiplication Rule for Independent Events 


What happens to the general multiplication rule in the special case when 
events A and B are independent? In that case, P(B | A) = P(B). We can 
simplify the general multiplication rule as follows: 


P(A and B)=P(AnB)==P(A): P(B|A)P(A): P(B) 
P(AandB) = P(ANB) = P(A)-P(BIA) 
= P(A)-P(B) 


This result is known as the multiplication rule for independent events. 


DEFINITION Multiplication rule for independent events 
If A and B are independent events, the probability that A and B both 


occur is 


P(A and B)=P(AnB)=P(A): P(B) 
P(A andB) = P(AMB) = P(A) - P(B) 


oOo Note that this rule applies only to independent events. 


Suppose that Pedro drives the same route to work on Monday through 
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Friday. His route includes one traffic light. The probability that the light 
will be green when Pedro arrives is 0.42, yellow is 0.03, and red is 0.55. 


. What’s the probability that the light is green on Monday and red on 
Tuesday? Let event A be green light on Monday and event B be red 
light on Tuesday. These two events are independent because knowing 
that the light was green on Monday doesn’t help us predict the color of 
the light on Tuesday. By the multiplication rule for independent events, 


P(green on Monday and red on Tuesday)=P(A and B) =P(A): P(B)= 
(0.42)(0.55)=0.231 
P (green on Monday and red on Tuesday) = P(A and B) 
= P(A)-P(B) 
= (0.42) (0.55) 
= 0.231 


There’s about a 23% chance that the light will be green on Monday and 
red on Tuesday. 


’, What’s the probability that Pedro finds the light red on Monday 
through Friday? We can extend the multiplication rule for independent 
events to more than two events: 


P(red Monday and red Tuesday and red Wednesday and red 
Thursday and red Friday)=P(red Monday): P(red Tuesday) - P(red 
Wednesday): P(red Thursday) : P(red Friday)= (0.55)(0.55)(0.55)(0.55) 
(0.55)=(0.55)5=0.0503 


P(red Monday and red Tuesday and red Wednesday and red Thursday and red Friday) 
=P(red Monday) - P(red Tuesday) - P(red Wednesday) - P(red Thursday) - P(red Friday) 
=(0.55)(0.55) (0.55) (0.55) (0.55) 

—(0.55)° 

= 0.0503 

There is about a 5% chance that Pedro will encounter a red light on all 
five days in a work week. 


EXAMPLE |The Challenger disaster » 


Multiplication rule for independent events 
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PROBLEM: On January 28, 1986, the space shuttle Challenger 
exploded on takeoff. All seven crew members were killed. Following 
the disaster, scientists and statisticians helped analyze what went wrong. 
They determined that the failure of O-ring joints in the shuttle’s booster 
rockets was to blame. Under the cold conditions that day, experts 
estimated that the probability that an individual O-ring joint would 
function properly was 0.977. But there were six of these O-ring joints, 
and all six had to function properly for the shuttle to launch safely. 
Assuming that O-ring joints succeed or fail independently, find the 
probability that the shuttle would launch safely under similar conditions. 


SOLUTION: 
P(O-ring 1 OK and O-ring 2 OK and O-ring 3 OK and O-ring 4 OK and O- 
ring 5 OK and O-ring 6 OK)=P(O-ring 1 OK): P(O-ring 2 OK) - P(O- 
ring 3 OK): P(O-ring 4 OK) P(O-ring 5 OK) P(O-ring 6 OK)= (0.977)(0.977) 
(0.977)(0.977)(0.977)(0.977)=(0.977)6=0.870 
P(O-ring 1 OK and O-ring 2 OK and O-ring 3 OK and O-ring 4 OK and O-ring 5 OK and O-ring 6 OK) 
= P(O-ring 1 OK) - P(O-ring 2 OK) - P(O-ring 3 OK) - P(O-ring 4 OK) - P(O-ring 5 OK) - P(O-ring 6 OK) 
= (0.977) (0.977) (0.977) (0.977) (0.977) (0.977) 
= (0.977)° 
= 0.870 


FOR PRACTICE, TRY EXERCISE 89 


The multiplication rule for independent events can also be used to help 
find P(at least one). In the preceding example, the shuttle would not launch 
safely under similar conditions if 1 or 2 or 3 or 4 or 5 or all 6 O-ring joints 
fail—that is, if at least one O-ring fails. The only possible number of O- 
ring failures excluded is 0. So the events “at least one O-ring joint fails” 
and “no O-ring joints fail” are complementary events. By the complement 
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rule, 
P(at least one O-ring fails)=1—P(no O-ring fails)=1—0.87=0.13 
P (at least one O-ring fails) = 1-— P(no O-ring fails) 
= 1-087 
0.13 


That’s a very high chance of failure! As a result of this analysis following 
the Challenger disaster, NASA made important safety changes to the 
design of the shuttle’s booster rockets. 


EXAMPLE | Rapid HIV testing (») 
Finding the probability of “at least one” 


PROBLEM: Many people who visit clinics to be tested for HIV, the 
virus that causes AIDS, don’t come back to learn their test results. 
Clinics now use “rapid HIV tests” that give a result while the client 
waits. In a clinic in Malawi, for example, use of rapid tests increased the 
percentage of clients who learned their test results from 69% to over 
99%. 


The trade-off for fast results is that rapid tests are less accurate than 
slower laboratory tests. Applied to people who have no HIV antibodies, 
one rapid test has a probability of about 0.004 of producing a false 
positive (i.e., of falsely indicating that antibodies are present).12 


If a clinic tests 200 randomly selected people who are free of HIV 
antibodies, what is the probability that at least one false positive will 
occur? Assume that test results for different individuals are independent. 
SOLUTION: 


P(no false positives)=P(all 200 tests negative)=(0.996)(0.996):- 
(0.996)=0.996200=0.4486P(at least one false positive)=1 — 0.4486 = 
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P (no false positives) = P (all 200 tests negative) 
(0.996) (0.996) - - - (0.996) 
= 0.9967 

0.4486 

0.5514 P (at least one false positive) = 1 — 0.4486 = 0.5514 


Start by finding P (no false positives). 


The probability that any individual test result is negative is 1 — 0.004 = 


0.996. 


FOR PRACTICE, TRY EXERCISE 91 


USING THE MULTIPLICATION RULE FOR INDEPENDENT 
EVENTS WISELY The multiplication rule P(A and B) = P(A) - P(B) 
holds if A and B are independent but not otherwise. The addition rule P(A 
or B) = P(A) + P(B) holds if A and B are mutually exclusive but not 
otherwise. oo Resist the temptation to use these simple rules when the 


conditions that justify them are not met. 
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EXAMPLE | Watch the weather! (») 


Beware lack of independence! 


PROBLEM: Hacienda Heights and La Puente are two neighboring 
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suburbs in the Los Angeles area. According to the local newspaper, 
there is a 50% chance of rain tomorrow in Hacienda Heights and a 50% 
chance of rain in La Puente. Does this mean that there is a (0.5)(0.5) = 
0.25 probability that it will rain in both cities tomorrow? 


SOLUTION: 


No; it is not appropriate to multiply the two probabilities, because 
“raining tomorrow in Hacienda Heights” and “raining tomorrow in La 
Puente” are not independent events. If it is raining in one of these 
locations, there is a high probability that it is raining in the other 
location because they are geographically close to each other. 


FOR PRACTICE, TRY EXERCISE 93 


Is there a connection between mutually exclusive and independent? 
Let’s start with a new chance process. Choose a U.S. adult at random. 
Define event A: the person is male, and event B: the person is pregnant. 
It’s fairly clear that these two events are mutually exclusive (can’t happen 
together)! Are they also independent? 

If you know that event A has occurred, does this change the probability 
that event B happens? Of course! If we know the person is male, then the 
chance that the person is pregnant is 0. But the probability of selecting 
someone who is pregnant is greater than 0. Because P(B | A) # P(B), the 
two events are not independent. Two mutually exclusive events (with 
nonzero probabilities) can never be independent, because if one event 
happens, the other event is guaranteed not to happen. 


CHECK YOUR UNDERSTANDING 


. 


y Questions 1 and 2 refer to the following setting. New Jersey Transit 
claims that its 8:00 A.M. train from Princeton to New York has 
probability 0.9 of arriving on time. Assume that this claim is true. 


747 


1. Find the probability that the train arrives late on Monday but on time 
on Tuesday. 

2. What’s the probability that the train arrives late at least once in a 5- 
day week? 

3. Government data show that 8% of adults are full-time college 
students and that 15% of adults are age 65 or older. If we randomly 
select an adult, is P(full-time college student and age 65 or older) = 
(0.08)(0.15)? Why or why not? 


Section 5.3 Summary 


e A conditional probability describes the probability that one event 
happens given that another event is already known to have happened. 


e One way to calculate a conditional probability is to use the formula 


P(A|B)=P(A and B)P(B)=P(AnB)P(B)=P(both events occur)P(given 
P(A\B) = P(AandB) P(ANB)  P(both events occur) 


event occurs) P(B) P(B) P(given event occurs) 


e When knowing whether or not one event has occurred does not change 
the probability that another event happens, we say that the two events 
are independent. Events A and B are independent if 


P(A|B)=P(A|BC)=P(A)P(AIB) = P(A|B®) = P(A) 
or, alternatively, if 
P(BIA)=P(BIAC)=P(B)P (BIA) = P(BIA®) = P(B) 


e Use the general multiplication rule to calculate the probability that 
events A and B both occur: 


P(A and B)=P(AnB)=P(A): P(BIA) 
P(AandB) = P(AMB) = P(A) - P(BJA) 


e When a chance process involves multiple stages, a tree diagram can be 
used to display the sample space and to help answer questions involving 
conditional probability. 
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e In the special case of independent events, the multiplication rule 
becomes 


P(A and B)=P(AnB)=P(A): P(B)P(A and B) = P(A MB) = P(A)- P(B) 


Section 5.3 Exercises 


61. 


62. 


pg 332 |p) Superpowers A random sample of 415 children from 


England and the United States who completed a survey in a recent 
year was selected. Each student’s country of origin was recorded 
along with which superpower they would most like to have: the 
ability to fly, ability to freeze time, invisibility, superstrength, or 
telepathy (ability to read minds). The data are summarized in the two- 
way table. 


Country 
England US. Total 
Fly he 
Freeze time 96 
Superpower _ Invisibility 67 
Superstrength 43 
Telepathy 110 
Total 200 215 A415 


Suppose we randomly select one of these students. Define events E: 
England, T: telepathy, and S: superstrength. 


a. Find P(T | E). Interpret this value in context. 


b. Given that the student did not choose superstrength, what’s the 
probability that this child is from England? Write your answer as a 
probability statement using correct symbols for the events. 


Get rich A survey of 4826 randomly selected young adults (aged 19 
to 25) asked, “What do you think are the chances you will have much 
more than a middle-class income at age 30?” The two-way table 


summarizes the responses.1® 
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Gender 


Total 
Almost no 194 
chance 
Some chance 712 
but probably 

Opinion poe 

A 50-50 1416 
chance 
A good chance 1421 
Almost certain 1083 
Total 4826 


Choose a survey respondent at random. Define events G: a good 
chance, M: male, and N: almost no chance. 


a. Find P(G | M). 


Interpret this value in context. 


b. Given that the chosen survey respondent didn’t say “almost no 
chance,” what’s the probability that this person is female? Write 
your answer as a probability statement using correct symbols for 


the events. 


Body image A random sample of 1200 U.S. college students was 
asked, “What is your perception of your own body? Do you feel that 
you are overweight, underweight, or about right?” The two-way table 
below summarizes the data on perceived body image by gender.44 


Body image 


Gender 
Female Male 
About right 295 
Overweight 72 
Underweight 37 73 


Suppose we randomly select one of the survey respondents. 


a. Given that the person perceived his or her body image as about 
right, what’s the probability that the person is female? 


b. If the person selected is female, what’s the probability that she did 
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not perceive her body image as overweight? 


Temperature and hatching How is the hatching of water python 
eggs influenced by the temperature of a snake’s nest? Researchers 
randomly assigned newly laid eggs to one of three water 
temperatures: cold, neutral, or hot. Hot duplicates the extra warmth 
provided by the mother python, and cold duplicates the absence of the 
mother. 


Nest temperature 


Cold Neutral Hot 
Hatching Hatched 75 
status Didn’t hatch 11 18 29 


Suppose we select one of the eggs at random. 


a. Given that the chosen egg was assigned to hot water, what is the 
probability that it hatched? 


b. If the chosen egg hatched, what is the probability that it was not 


assigned to hot water? 


Foreign-language study Choose a student in grades 9 to 12 at 
random and ask if he or she is studying a language other than English. 
Here is the distribution of results: 


Language Spanish French German All others None 
Probability 0.26 0.09 0.03 0.03 0.59 


a. What’s the probability that the student is studying a language other 
than English? 


b. What is the probability that a student is studying Spanish given 


that he or she is studying some language other than English? 


Income tax returns Here is the distribution of the adjusted gross 
income (in thousands of dollars) reported on individual federal 
income tax returns in a recent year: 


Income <25 25-49 50-99 100-499 2500 
Probability 0.431 0.248 0.215 0.100 0.006 
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a. What is the probability that a randomly chosen return shows an 


adjusted gross income of $50,000 or more? 


b. Given that a return shows an income of at least $50,000, what is 


the conditional probability that the income is at least $100,000? 


Tall people and basketball players Select an adult at random. 
Define events T: person is over 6 feet tall, and B: person is a 
professional basketball player. Rank the following probabilities from 
smallest to largest. Justify your answer. 


P(T) P(B) P (T|B) P (BIT) 


Teachers and college degrees Select an adult at random. Define 
events D: person has earned a college degree, and T: person’s career 
is teaching. Rank the following probabilities from smallest to largest. 
Justify your answer. 


P(D) P(T) P (DIT) P (TID) 


pg 333 |p) Dogs and cats In one large city, 40% of all households 


own a dog, 32% own a cat, and 18% own both. Suppose we randomly 
select a household and learn that the household owns a cat. Find the 
probability that the household owns a dog. 


Mac or PC? A recent census at a major university revealed that 60% 
of its students mainly used Macs. The rest mainly used PCs. At the 
time of the census, 67% of the school’s students were undergraduates. 
The rest were graduate students. In the census, 23% of respondents 
were graduate students and used a Mac as their main computer. 
Suppose we select a student at random from among those who were 
part of the census and learn that the person mainly uses a Mac. Find 
the probability that the person is a graduate student. 


pg 336 |p) Who owns a home? What is the relationship between 


educational achievement and home ownership? A random sample of 
500 U.S. adults was selected. Each member of the sample was 
identified as a high school graduate (or not) and as a homeowner (or 
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not). The two-way table summarizes the data. 


High school graduate 
Total 
Yes 340 
Homeowner 
No 160 
Total 500 


Are the events “homeowner” and “high school graduate” 
independent? Justify your answer. 


. Is this your card? A standard deck of playing cards (with jokers 
removed) consists of 52 cards in four suits—clubs, diamonds, hearts, 
and spades. Each suit has 13 cards, with denominations ace, 2, 3, 4, 5, 
6, 7, 8, 9, 10, jack, queen, and king. The jacks, queens, and kings are 
referred to as “face cards.” Imagine that we shuffle the deck 
thoroughly and deal one card. The two-way table summarizes the 
sample space for this chance process based on whether or not the card 
is a face card and whether or not the card is a heart. 


Type of card 
Face card Nonface card Total 
Heart 13 
Suit 
Nonheart 39 
Total 52 


Are the events “heart” and “face card” independent? Justify your 
answer. 


. Cell phones The Pew Research Center asked a random sample of 
2024 adult cell-phone owners from the United States their age and 
which type of cell phone they own: iPhone, Android, or other 
(including non-smartphones). The two-way table summarizes the 
data. 


Age 
18-34 35-54 55+ Total 


iPhone 467 


Type of cell 
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phone Android 214 189 100 503 
Other 1054 


Total 517 637 870 2024 


Suppose we select one of the survey respondents at random. 
a. Find P(iPhone | 18-34). 


b. Use your answer from part (a) to help determine if the events 
“iPhone” and “18—34” are independent. 


. Middle school values Researchers carried out a survey of fourth-, 
fifth-, and sixth-grade students in Michigan. Students were asked 
whether good grades, athletic ability, or being popular was most 
important to them. The two-way table summarizes the survey data.12 


Grade 
4th 5th 6th Total 
Grades 168 
net Athletic 98 
important 
Popular 69 
Total 92 108 135 Sie ie. 


Suppose we select one of these students at random. 
a. Find P(athletic | 5th grade). 


b. Use your answer from part (a) to help determine if the events “5th 
grade” and “athletic” are independent. 


. Rolling dice Suppose you roll two fair, six-sided dice—one red and 
one green. Are the events “sum is 7” and “green die shows a 4” 
independent? Justify your answer. (See Figure 5.2 on page 314 for the 
sample space of this chance process.) 


. Rolling dice Suppose you roll two fair, six-sided dice—one red and 
one green. Are the events “sum is 8” and “green die shows a 4” 
independent? Justify your answer. (See Figure 5.2 on page 314 for the 
sample space of this chance process.) 
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pg 338 |») Free downloads? Illegal music downloading is a big 


problem: 29% of Internet users download music files, and 67% of 
downloaders say they don’t care if the music is copyrighted.42 Find 
the probability that a randomly selected Internet user downloads 
music and doesn’t care if it’s copyrighted. 


At the gym Suppose that 10% of adults belong to health clubs, and 
40% of these health club members go to the club at least twice a 
week. Find the probability that a randomly selected adult belongs to a 
health club and goes there at least twice a week. 


Box of chocolates According to Forrest Gump, “Life is like a box of 
chocolates. You never know what you’re gonna get.” Suppose a 
candymaker offers a special “Gump box” with 20 chocolate candies 
that look alike. In fact, 14 of the candies have soft centers and 6 have 
hard centers. Suppose you choose 3 of the candies from a Gump box 
at random. Find the probability that all three candies have soft 
centers. 


Sampling students A statistics class with 30 students has 10 males 
and 20 females. Suppose you choose 3 of the students in the class at 
random. Find the probability that all three are female. 


pg 341 >» Fill ’er up! In a certain month, 88% of automobile 


drivers filled their vehicles with regular gasoline, 2% purchased 
midgrade gas, and 10% bought premium gas.12 Of those who bought 
regular gas, 28% paid with a credit card; of customers who bought 
midgrade and premium gas, 34% and 42%, respectively, paid with a 
credit card. Suppose we select a customer at random. 


a. Draw a tree diagram to model this chance process. 
b. Find the probability that the customer paid with a credit card. 


c. Suppose the chosen customer paid with a credit card. What’s the 
probability that the customer bought premium gas? 


Media usage and good grades The Kaiser Family Foundation 
released a study about the influence of media in the lives of young 
people aged 8-18.22 In the study, 17% of the youth were classified as 
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light media users, 62% were classified as moderate media users, and 
21% were classified as heavy media users. Of the light users who 
responded, 74% described their grades as good (A’s and B’s), while 
only 68% of the moderate users and 52% of the heavy users described 
their grades as good. Suppose that we select one young person from 
the study at random. 


a. Draw a tree diagram to model this chance process. 


b. Find the probability that this person describes his or her grades as 
good. 


c. Suppose the chosen person describes his or her grades as good. 
What’s the probability that he or she is a heavy user of media? 


pg 342 |») First serve Tennis great Andy Murray made 60% of his 


first serves in a recent season. When Murray made his first serve, he 
won 76% of the points. When Murray missed his first serve and had 
to serve again, he won only 54% of the points.2! Suppose you 
randomly choose a point on which Murray served. You get distracted 
before seeing his first serve but look up in time to see Murray win the 
point. What’s the probability that he missed his first serve? 


Lactose intolerance Lactose intolerance causes difficulty in 
digesting dairy products that contain lactose (milk sugar). It is 
particularly common among people of African and Asian ancestry. In 
the United States (not including other groups and people who 
consider themselves to belong to more than one race), 82% of the 
population is White, 14% is Black, and 4% is Asian. Moreover, 15% 
of Whites, 70% of Blacks, and 90% of Asians are lactose intolerant.22 
Suppose we select a U.S. person at random and find that the person is 
lactose intolerant. What’s the probability that she or he is Asian? 


HIV testing Enzyme immunoassay (EIA) tests are used to screen 
blood specimens for the presence of antibodies to HIV, the virus that 
causes AIDS. Antibodies indicate the presence of the virus. The test 
is quite accurate but is not always correct. A false positive occurs 
when the test gives a positive result but no HIV antibodies are 
actually present in the blood. A false negative occurs when the test 
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gives a negative result but HIV antibodies are present in the blood. 
Here are approximate probabilities of positive and negative EIA 
outcomes when the blood tested does and does not actually contain 
antibodies to HIV: 22 


Test result 


Antibodies present 
Truth 


Antibodies absent 


Suppose that 1% of a large population carries antibodies to HIV in 
their blood. Imagine choosing a person from this population at 
random. If the person’s EIA test is positive, what’s the probability 
that the person has the HIV antibody? 


Metal detector A boy uses a homemade metal detector to look for 
valuable metal objects on a beach. The machine isn’t perfect—it 
beeps for only 98% of the metal objects over which it passes, and it 
beeps for 4% of the nonmetallic objects over which it passes. Suppose 
that 25% of the objects that the machine passes over are metal. 
Choose an object from this beach at random. If the machine beeps 
when it passes over this object, find the probability that the boy has 
found a metal object. 


Fundraising by telephone Tree diagrams can organize problems 
having more than two stages. The figure shows probabilities for a 
charity calling potential donors by telephone.24 Each person called is 
either a recent donor, a past donor, or a new prospect. At the next 
stage, the person called either does or does not pledge to contribute, 
with conditional probabilities that depend on the donor class to which 
the person belongs. Finally, those who make a pledge either do or 
don’t actually make a contribution. Suppose we randomly select a 
person who is called by the charity. 
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No pledge 


Contribute 


Not 


Contribute 


Not 


Contribute 


Not 


a. What is the probability that the person contributed to the charity? 


b. Given that the person contributed, find the probability that he or 


she is a recent donor. 


HIV and confirmation testing Refer to Exercise 85. Many of the 
positive results from EIA tests are false positives. It is therefore 
common practice to perform a second EJA test on another blood 
sample from a person whose initial specimen tests positive. Assume 
that the false positive and false negative rates remain the same for a 
person’s second test. Find the probability that a person who gets a 
positive result on both EIA tests has HIV antibodies. 


pg 345 |p) Merry and bright? A string of Christmas lights 


contains 20 lights. The lights are wired in series so that if any light 
fails, the whole string will go dark. Each light has probability 0.98 of 
working for a 3-year period. The lights fail independently of each 

other. Find the probability that the string of lights will remain bright 


for 3 years. 


Get rid of the penny Harris Interactive reported that 29% of all U.S. 
adults favor abolishing the penny. Assuming that responses from 
different individuals are independent, what is the probability of 
randomly selecting 3 U.S. adults who all say that they favor 


abolishing the penny? 


pg 346 |p) Is the package late? A shipping company claims that 
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90% of its shipments arrive on time. Suppose this claim is true. If we 
take a random sample of 20 shipments made by the company, what’s 
the probability that at least 1 of them arrives late? 


On a roll Suppose that you roll a fair, six-sided die 10 times. What’s 
the probability that you get at least one 6? 


pg 347 |») Who’s pregnant? According to the Current Population 


Survey (CPS), 27% of U.S. females are older than 55. The Centers for 
Disease Control and Prevention (CDC) report that 6% of all U.S. 
females are pregnant. Suppose that these results are accurate. If we 
randomly select a U.S. female, is P(pregnant and over 55) = (0.06) 
(0.27) = 0.0162? Why or why not? 


Late flights An airline reports that 85% of its flights arrive on time. 
To find the probability that a random sample of 4 of this airline’s 
flights into LaGuardia Airport in New York City on the same night all 
arrive on time, can we multiply (0.85)(0.85)(0.85)(0.85)? Why or 
why not? 


Fire or medical? Many fire stations handle more emergency calls for 
medical help than for fires. At one fire station, 81% of incoming calls 
are for medical help. Suppose we choose 4 incoming calls to the 
station at random. 


a. Find the probability that all 4 calls are for medical help. 


b. What’s the probability that at least 1 of the calls is not for medical 
help? 


c. Explain why the calculation in part (a) may not be valid if we 
choose 4 consecutive calls to the station. 


Broken links Internet sites often vanish or move so that references to 
them can’t be followed. In fact, 87% of Internet sites referred to in 
major scientific journals still work within two years of publication.22 
Suppose we randomly select 7 Internet references from scientific 


journals. 


a. Find the probability that all 7 references still work two years later. 
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b. What’s the probability that at least 1 of them doesn’t work two 
years later? 


c. Explain why the calculation in part (a) may not be valid if we 
choose 7 Internet references from one issue of the same journal. 


97. Mutually exclusive versus independent The two-way table 
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summarizes data on the gender and eye color of students in a college 
Statistics class. Imagine choosing a student from the class at random. 
Define event A: student is male, and event B: student has blue eyes.28 


Gender 
Male Female Total 
Blue 10 
Eye color 
Brown 40 
Total 50 


a. Copy and complete the two-way table so that events A and B are 
mutually exclusive. 


b. Copy and complete the two-way table so that events A and B are 
independent. 


c. Copy and complete the two-way table so that events A and B are 
not mutually exclusive and not independent. 


Independence and association The two-way table summarizes data 
from an experiment comparing the effectiveness of three different 
diets (A, B, and C) on weight loss. Researchers randomly assigned 
300 volunteer subjects to the three diets. The response variable was 
whether each subject lost weight over a 1-year period. 


Diet 
Total 
Lost Yes 180 
weight? No 120 
Total 90 100 110 300 


a. Suppose we randomly select one of the subjects from the 
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experiment. Show that the events “Diet B” and “Lost weight” are 
independent. 


b. Copy and complete the table so that there is no association 
between type of diet and whether a subject lost weight. 


c. Copy and complete the table so that there is an association between 
type of diet and whether a subject lost weight. 


99. Checking independence Suppose A and B are two events such that 
P(A) = 0.3, P(B) = 0.4, and P(A n B) = 0.12. Are events A and B 
independent? Justify your answer. 


100. Checking independence Suppose C and D are two events such that 
P(C) = 0.6, P(D) = 0.45, and P(C n D) = 0.3. Are events C and D 
independent? Justify your answer. 


101. The geometric distributions You are tossing a pair of fair, six-sided 
dice in a board game. Tosses are independent. You land in a danger 
zone that requires you to roll doubles (both faces showing the same 
number of spots) before you are allowed to play again. 


a. What is the probability of rolling doubles on a single toss of the 
dice? 


b. What is the probability that you do not roll doubles on the first 
toss, but you do on the second toss? 


c. What is the probability that the first two tosses are not doubles 
and the third toss is doubles? This is the probability that the first 
doubles occurs on the third toss. 


d. Do you see the pattern? What is the probability that the first 
doubles occurs on the kth toss? 


102. Matching suits A standard deck of playing cards consists of 52 cards 
with 13 cards in each of four suits: spades, diamonds, clubs, and 
hearts. Suppose you shuffle the deck thoroughly and deal 5 cards 
face-up onto a table. 


a. What is the probability of dealing five spades in a row? 
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b. Find the probability that all 5 cards on the table have the same 
suit. 


Multiple Choice Select the best answer for Exercises 103-106. 


03. An athlete suspected of using steroids is given two tests that operate 
independently of each other. Test A has probability 0.9 of being 
positive if steroids have been used. Test B has probability 0.8 of 
being positive if steroids have been used. What is the probability 
that neither test is positive if the athlete has used steroids? 


a. 0.08 
b. 0.28 
c. 0.02 
d. 0.38 


e. 0.72 

04. Inan effort to find the source of an outbreak of food poisoning at a 
conference, a team of medical detectives carried out a study. They 
examined all 50 people who had food poisoning and a random 
sample of 200 people attending the conference who didn’t get food 
poisoning. The detectives found that 40% of the people with food 
poisoning went to a cocktail party on the second night of the 
conference, while only 10% of the people in the random sample 
attended the same party. Which of the following statements is 
appropriate for describing the 40% of people who went to the party? 
(Let F = got food poisoning and A = attended party.) 


a. P(F|A) = 0.40 
b. P(A|F°) = 0.40 
c. P(F|A) = 0.40 
d. P(AC|F) = 0.40 
e. P(A|F) = 0.40 
05. Suppose a loaded die has the following probability model: 


Outcome 1 2 3 4 5 6 
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Probability 0.3 0.1 0.1 0.1 0.1 0.3 


If this die is thrown and the top face shows an odd number, what is 
the probability that the die shows a 1? 


a. 0.10 
b. 0.17 
c. 0.30 
d. 0.50 


e. 0.60 
06. If P(A) = 0.24, P(B) = 0.52, and A and B are independent events, 
what is P(A or B)? 


a. 0.1248 
b. 0.28 
c. 0.6352 
d. 0.76 


e. The answer cannot be determined from the information given. 
Recycle and Review 


07. BMI (2.2, 5.2, 5.3) Your body mass index (BMI) is your weight in 
kilograms divided by the square of your height in meters. Online 
BMI calculators allow you to enter weight in pounds and height in 
inches. High BMI is a common but controversial indicator of being 
overweight or obese. A study by the National Center for Health 
Statistics found that the BMI of American young women (ages 20 to 
29) is approximately Normally distributed with mean 26.8 and 


standard deviation 7.4.22 
a. People with BMI less than 18.5 are often classed as 


“underweight.” What percent of young women are underweight 
by this criterion? 


b. Suppose we select two American young women in this age group 
at random. Find the probability that at least one of them is 
classified as underweight. 
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08. Snappy dressers (4.2, 4.3) Matt and Diego suspect that people are 
more likely to agree to participate in a survey if the interviewers are 
dressed up. To test this idea, they went to the local grocery store to 
survey customers on two consecutive Saturday mornings at 10 A.M. 
On the first Saturday, they wore casual clothing (tank tops and 
jeans). On the second Saturday, they dressed in button-down shirts 
and nicer slacks. Each day, they asked every fifth person who 
walked into the store to participate in a survey. Their response 
variable was whether or not the person agreed to participate. Here 
are their results: 


Clothing 
Casual Nice 
bes Agreed 27 
Participation 
Declined 36 23 


a. Calculate the difference (Casual — Nice) in the proportion of 
subjects that agreed to participate in the survey in the two groups. 


b. Assume the study design is equivalent to randomly assigning 
shoppers to the “casual” or “nice” groups. A total of 100 trials of 
a simulation were performed to see what differences in 
proportions would occur due only to chance variation in this 
random assignment. Use the results of the simulation in the 
following dotplot to determine if the difference in proportions 
from part (a) is statistically significant. Explain your reasoning. 


T 
—().2 —0.1 0 


T 
0.2 


Simulated difference (Casual — Nice) in proportion who agreed 


c. What flaw in the design of this experiment would prevent Matt 
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and Diego from drawing a cause-and-effect conclusion about the 
impact of an interviewer’s attire on nonresponse in a survey? 
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_Mé _. FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


A statistics teacher has 40 students in his class, 23 females and 17 
males. At the beginning of class on a Monday, the teacher planned to 
spend time reviewing an assignment due that day. Unknown to the 
teacher, only 19 of the females and 11 of the males had completed 
the assignment. The teacher plans to randomly select students to do 
problems from the assignment on the whiteboard. 


a. What is the probability that a randomly selected student has 
completed the assignment? 


b. Are the events “selecting a female” and “selecting a student 
who completed the assignment” independent? Justify your 
answer. 

Suppose that the teacher randomly selects 4 students to do a problem 
on the whiteboard and only 2 of the students had completed the 
assignment. 
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c. Describe how to use a table of random digits to estimate the 
probability that 2 or fewer of the 4 randomly selected students 
completed the assignment. 


d. Complete three trials of your simulation using the random 
digits below and use the results to estimate the probability 
described in part (c). 


12975 =. 13258)=—s- 13048-45144) 72321) = 81940 3=— 00360 )=—— (002428 
96767 35964 23822 96012 94951 65194 50842 55372 
37609 59057 66967 83401 60705 02384 90597 93600 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score. 
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Chapter 5 Review 


Section 5.1: Randomness, Probability, and Simulation 


In this section, you learned about the idea of probability. The law of large 
numbers says that when you repeat a chance process many, many times, 
the relative frequency of an outcome will approach a single number. This 
single number is called the probability of the outcome—how often we 
expect the outcome to occur in a very large number of repetitions of the 
chance process. Be sure to remember the “large” part of the law of large 
numbers. Although clear patterns emerge in a large number of repetitions, 
we shouldn’t expect such regularity in a small number of repetitions. 


Simulation is a powerful tool that we can use to imitate a chance 
process and estimate a probability. To perform a simulation, describe how 
to use a chance device to imitate one trial of the simulation. Tell what you 
will record at the end of each trial. Then perform many trials, and use the 
results of your simulation to answer the question of interest. If you are 
using random digits to perform your simulation, be sure to consider 
whether digits can be repeated within each trial. 


Section 5.2: Probability Rules 


In this section, you learned that chance behavior can be described by a 
probability model. Probability models have two parts, a list of possible 
outcomes (the sample space) and a probability for each outcome. The 
probability of each outcome in a probability model must be between 0 and 
1, and the probabilities of all the outcomes in the sample space must add to 
ie 

An event is a collection of possible outcomes from the sample space. 
The complement rule says the probability that an event occurs is 1 minus 
the probability that the event doesn’t occur. In symbols, the complement 
rule says that P(A) = 1 — P(A®). Given two events A and B from some 
chance process, use the general addition rule to find the probability that 
event A or event B occurs: 
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P(A or B)=P(A U B)=P(A)+P(B)-P(AnB) 
P(AorB) = P(A UB) = P(A) + P(B) — P(ANB) 
If the events A and B have no outcomes in common, use the addition 
rule for mutually exclusive events: P(A U B) = P(A) + P(B). 


Finally, you learned how to use two-way tables and Venn diagrams to 
display the sample space for a chance process involving two events. Using 
a two-way table or a Venn diagram is a helpful way to organize 
information and calculate probabilities involving the union (A or B) and 
the intersection (A and B) of two events. 


Section 5.3: Conditional Probability and Independence 


In this section, you learned that a conditional probability describes the 
probability of an event occurring given that another event is known to have 
already occurred. To calculate the probability that event A occurs given 
that event B has occurred, use the formula 


P(ANB) P(AandB) 

P(A|B) =————=—_————_- 

P(A|B)=P(AnB)P(B)=P(A and B)P(B) P(B) P(B) 

Two-way tables and tree diagrams are useful ways to organize the 
information provided in a conditional probability problem. Two-way 
tables are best when the problem describes the number or proportion of 
cases with certain characteristics. Tree diagrams are best when the 
problem provides the conditional probabilities of different events or 
describes a sequence of events. 


Use the general multiplication rule for calculating the probability that 
event A and event B both occur: 


P(A and B)=P(AnB)=P(A): P(BIA) 
P(AandB) = P(ANB) = P(A) - P(BIA) 


If knowing whether or not event B occurs doesn’t change the probability 
that event A occurs, then events A and B are independent. That is, events 
A and B are independent if P(A | B) = P(A | B®) = P(A). If events A and B 


are independent, use the multiplication rule for independent events to find 
the probability that events A and B both occur: P(ANB) = P(A) - P(B). 
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What Did You Learn? 


Learning Objective 


Interpret probability as 
a long-run relative 
frequency. 


Use simulation to 
model chance 
behavior. 


Give a probability 
model for a chance 
process with equally 
likely outcomes and 
use it to find the 
probability of an 
event. 


Use basic probability 
rules, including the 
complement rule and 
the addition rule for 
mutually exclusive 
events. 


Use a two-way table 
or Venn diagram to 
model a chance 
process and calculate 
probabilities involving 
two events. 


Apply the general 
addition rule to 
calculate probabilities. 


Calculate and interpret 
conditional 
probabilities. 


Determine if two 
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events are 
independent. 


Use the general 
multiplication rule to 
calculate probabilities. 


Use a tree diagram to 
model a chance 
process involving a 
sequence of outcomes 
and to calculate 
probabilities. 


When appropriate, use 
the multiplication rule 
for independent events 
to calculate 
probabilities. 


3.3 


5.3 


3.3 


338 


341, 342 


345, 346, 347 


771 


Chapter 5 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R5.1 |») Butter side down Researchers at Manchester Metropolitan 


University in England determined that if a piece of toast is dropped 
from a 2.5-foot-high table, the probability that it lands butter side 
down is 0.81. 


a. Explain what this probability means. 


Suppose that the researchers dropped 4 pieces of toast, and all of 
them landed butter side down. Does that make it more likely that 
the next piece of toast will land with the butter side up? Explain 
your answer. 


R5.2 |») Butter side down Refer to the preceding exercise. Maria 


decides to test this probability and drops 10 pieces of toast from a 
2.5-foot table. Only 4 of them land butter side down. Maria wants to 
perform a simulation to estimate the probability that 4 or fewer 
pieces of toast out of 10 would land butter side down if the 
researchers’ 0.81 probability value is correct. 


a. 


Describe how you would use a table of random digits to perform 
the simulation. 


. Perform 3 trials of the simulation using the random digits given. 


Copy the digits onto your paper and mark directly on or above 
them so that someone can follow what you did. 


29077 14863 61683 47052 62224 91025 
95052 90908 73592 75186 87136 95761 
27102 56027 5959892 33063 41842 81868 


. The dotplot displays the results of 50 simulated trials of dropping 


10 pieces of toast. Is there convincing evidence that the 
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researchers’ 0.81 probability value is incorrect? Explain your 
answer. 
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Simulated number of butter side down 


R5.3 |) Rock smashes scissors Almost everyone has played the game 


rock-paper-scissors at some point. Two players face each other and, 
at the count of 3, make a fist (rock), an extended hand, palm side 
down (paper), or a “V” with the index and middle fingers (scissors). 
The winner is determined by these rules: rock smashes scissors; 
paper covers rock; and scissors cut paper. If both players choose the 
same object, then the game is a tie. Suppose that Player 1 and Player 
2 are both equally likely to choose rock, paper, or scissors. 


a. Give a probability model for this chance process. 


b. Find the probability that Player 1 wins the game on the first 
throw. 


R5.4 |) What kind of vehicle? Randomly select a new vehicle sold in 
the United States in a certain month.22 The probability distribution 


for the type of vehicle chosen is given here. 


Vehicle type Passenger car Pickup truck SUV _ Crossover Minivan 
Probability 0.46 0.15 0.10 2 0.05 


a. What is the probability that the vehicle is a crossover? How do 
you know? 


b. Find the probability that the vehicle is not an SUV or a minivan. 


c. Given that the vehicle is not a passenger car, what is the 
probability that it is a pickup truck? 


R5.5 |) Drive to exercise? The two-way table summarizes the 


responses of 120 people to a survey in which they were asked, “Do 
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you exercise for at least 30 minutes four or more times per week?” 
and “What kind of vehicle do you drive?” 


Car type 
Truck 
: Yes 12 
Exercise? 
No 24 


Suppose one person from this sample is randomly selected. 
a. Find the probability that the person drives an SUV. 


b. Find the probability that the person drives a sedan or exercises 
for at least 30 minutes four or more times per week. 


c. Find the probability that the person does not drive a truck, given 
that she or he exercises for at least 30 minutes four or more times 
per week. 


R5.6 |») Mike’s pizza You work at Mike’s pizza shop. You have the 


following information about the 9 pizzas in the oven: 3 of the 9 have 
thick crust and 2 of the 3 thick-crust pizzas have mushrooms. Of the 
remaining 6 pizzas, 4 have mushrooms. 


a. Are the events “thick-crust pizza” and “pizza with mushrooms” 
mutually exclusive? Justify your answer. 


b. Are the events “thick-crust pizza” and “pizza with mushrooms” 
independent? Justify your answer. 


c. Suppose you randomly select 2 of the pizzas in the oven. Find the 
probability that both have mushrooms. 


R5.7 |) Does the new hire use drugs? Many employers require 


prospective employees to take a drug test. A positive result on this 
test suggests that the prospective employee uses illegal drugs. 
However, not all people who test positive use illegal drugs. The test 
result could be a false positive. A negative test result could be a false 
negative if the person really does use illegal drugs. Suppose that 4% 
of prospective employees use drugs, and that the drug test has a false 
positive rate of 5%, and a false negative rate of 10%.22 Imagine 
choosing a prospective employee at random. 
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a. Draw a tree diagram to model this chance process. 
b. Find the probability that the drug test result is positive. 


c. If the prospective employee’s drug test result is positive, find the 
probability that she or he uses illegal drugs. 


R5.8 |p) Lucky penny? Harris Interactive reported that 33% of U.S. 


adults believe that finding and picking up a penny is good luck. 
Assuming that responses from different individuals are independent, 
what is the probability of randomly selecting 10 U.S. adults and 
finding at least 1 person who believes that finding and picking up a 
penny is good luck? 
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Chapter 5 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 
Questions T5.1 to [5.3 refer to the following setting. A group of 125 truck 
owners were asked what brand of truck they owned and whether or not the 
truck has four-wheel drive. The results are summarized in the two-way 
table below. Suppose we randomly select one of these truck owners. 


Four-wheel drive? 


No 

Ford 17 

Brand of truck Chevy 18 
Dodge 10 


T5.1 What is the probability that the person owns a Dodge or has four- 
wheel drive? 


a. 20/80 

b. 20/125 
c. 80/125 
d. 90/125 
e. 110/125 


T5.2 What is the probability that the person owns a Chevy, given that 
the truck has four-wheel drive? 


a. 32/50 
b. 32/80 
c. 32/125 
d. 50/125 
e. 80/125 


T5.3 Which one of the following is true about the events “Owner has a 
Chevy” and “Owner’s truck has four-wheel drive’’? 
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These two events are mutually exclusive and independent. 
These two events are mutually exclusive, but not independent. 


These two events are not mutually exclusive, but they are 
independent. 


These two events are neither mutually exclusive nor 
independent. 


These two events are mutually exclusive, but we do not have 
enough information to determine if they are independent. 


T5.4 A spinner has three equally sized regions: blue, red, and green. 
Jonny spins the spinner 3 times and gets 3 blues in a row. If he 
spins the spinner 297 more times, how many more blues is he most 
likely to get? 


a. 97 
b. 

C. 
d. 


e. 


99 

100 
101 
103 


Questions T5.5 and 15.6 refer to the following setting. Wilt is a fine 
basketball player, but his free-throw shooting could use some work. For 
the past three seasons, he has made only 56% of his free throws. His coach 
sends him to a summer clinic to work on his shot, and when he returns, his 
coach has him step to the free-throw line and take 50 shots. He makes 34 
shots. Is this result convincing evidence that Wilt’s free-throw shooting 
has improved? We want to perform a simulation to estimate the probability 
that a 56% free-throw shooter would make 34 or more in a sample of 50 


shots. 


T5.5 Which of the following is a correct way to perform the simulation? 


a. 


Let integers from 1 to 34 represent making a free throw and 35 
to 50 represent missing a free throw. Generate 50 random 
integers from 1 to 50. Count the number of made free throws. 
Repeat this process many times. 
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b. Let integers from 1 to 34 represent making a free throw and 35 
to 50 represent missing a free throw. Generate 50 random 
integers from 1 to 50 with no repeats allowed. Count the 
number of made free throws. Repeat this process many times. 


c. Let integers from 1 to 56 represent making a free throw and 57 
to 100 represent missing a free throw. Generate 50 random 
integers from 1 to 100. Count the number of made free throws. 
Repeat this process many times. 


d. Let integers from 1 to 56 represent making a free throw and 57 
to 100 represent missing a free throw. Generate 50 random 
integers from 1 to 100 with no repeats allowed. Count the 
number of made free throws. Repeat this process many times. 


e. None of the above is correct. 


T5.6 The dotplot displays the number of made shots in 100 simulated 
sets of 50 free throws by someone with probability 0.56 of making 
a free throw. 


18 20 22 24 2 28 30 32 «34 ~« 36 
Simulated number of made shots 


Which of the following is an appropriate statement about Wilt’s 
free-throw shooting based on this dotplot? 


a. If Wilt were still only a 56% shooter, the probability that he 
would make at least 34 of his shots is about 0.03. 


b. If Wilt were still only a 56% shooter, the probability that he 
would make at least 34 of his shots is about 0.97. 


c. If Wilt is now shooting better than 56%, the probability that he 
would make at least 34 of his shots is about 0.03. 


d. If Wilt is now shooting better than 56%, the probability that he 
would make at least 34 of his shots is about 0.97. 


e. If Wilt were still only a 56% shooter, the probability that he 
would make at least 34 of his shots is about 0.01. 
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T5.7 The partially complete table that follows shows the distribution of 
scores on the AP® Statistics exam for a class of students. 


Score 1 2 3 4 5 
Probability 0.10 0.20 ??? 0.25 0.15 


Select a student from this class at random. If the student earned a 
score of 3 or higher on the AP® Statistics exam, what is the 
probability that the student scored a 5? 


a. 0.150 
b. 0.214 
c. 0.300 
d. 0.428 
e. 0.700 


T5.8 In a class, there are 18 girls and 14 boys. If the teacher selects two 
students at random to attend a party with the principal, what is the 
probability that the two students are the same sex? 


a. 0.49 
b. 0.50 
c. 0.51 
d. 0.52 
e. 0.53 


T5.9 Suppose that a student is randomly selected from a large high 
school. The probability that the student is a senior is 0.22. The 
probability that the student has a driver’s license is 0.30. If the 
probability that the student is a senior or has a driver’s license is 
0.36, what is the probability that the student is a senior and has a 
driver’s license? 


a. 0.060 
b. 0.066 
c. 0.080 
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d. 0.140 
e. 0.160 


T5.10 The security system in a house has two units that set off an alarm 
when motion is detected. Neither one is entirely reliable, but one or 
both always go off when there is motion anywhere in the house. 
Suppose that for motion in a certain location, the probability that 
detector A goes off and detector B does not go off is 0.25, and the 
probability that detector A does not go off is 0.35. What is the 
probability that detector B goes off? 


a. 0.1 
b. 0.35 
c. 0.4 
d. 0.65 
e. 0.75 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T5.11 The two-way table summarizes data on whether students at a 
certain high school eat regularly in the school cafeteria by grade 


level. 
Grade 
9th 10th 11th 12th Total 
Eatin Yes 495 
cafeteria? No 310 
Total 148 209 210 238 805 


a. If you choose a student at random, what is the probability that 
the student eats regularly in the cafeteria and is not a 10th- 
grader? 


b. If you choose a student at random who eats regularly in the 
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cafeteria, what is the probability that the student is a 10th- 
grader? 


c. Are the events “10th-grader” and “eats regularly in the 
cafeteria” independent? Justify your answer. 


T5.12 Three machines—A, B, and C—are used to produce a large 
quantity of identical parts at a factory. Machine A produces 60% of 
the parts, while Machines B and C produce 30% and 10% of the 
parts, respectively. Historical records indicate that 10% of the parts 
produced by Machine A are defective, compared with 30% for 
Machine B and 40% for Machine C. Suppose we randomly select a 
part produced at the factory. 


a. Find the probability that the part is defective. 


b. Ifthe part is inspected and found to be defective, what’s the 
probability that it was produced by Machine B? 


T5.13 At Dicey Dave’s Diner, the dinner buffet usually costs $12.99. 
Once a month, Dave sponsors “lucky buffet” night. On that night, 
each patron can either pay the usual price or roll two fair, six-sided 
dice and pay a number of dollars equal to the product of the 
numbers showing on the two faces. The table shows the sample 
space of this chance process. 


First die 


Second 
die 


a. A customer decides to play Dave’s “lucky buffet” game. Find 
the probability that the customer will pay less than the usual 
cost of the buffet. 


b. A group of 4 friends comes to Dicey Dave’s Diner to play the 
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“lucky buffet” game. Find the probability that all 4 of these 
friends end up paying less than the usual cost of the buffet. 


c. Find the probability that at least 1 of the 4 friends ends up 
paying more than the usual cost of the buffet. 


T5.14 Based on previous records, 17% of the vehicles passing through a 
tollbooth have out-of-state plates. A bored tollbooth worker 
decides to pass the time by counting how many vehicles pass 
through until he sees two with out-of-state plates. We would like to 
perform a simulation to estimate the average number of vehicles it 


takes to find two with out-of-state plates.22 


a. Describe how you would use a table of random digits to 
perform the simulation. 


b. Perform 3 trials of the simulation using the random digits given 
here. Copy the digits onto your paper and mark directly on or 
above them so that someone can follow what you did. 


41050 92031 06449 05059 59884 31880 
93115 84469 94868 97967 05811 84514 
84177 06757 17613 15582 91506 81435 
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Chapter 6 Random Variables 


Stefano Paterna/Alamy 
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Section 6.1 Discrete and Continuous Random Variables 
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INTRODUCTION 


Do you drink bottled water or tap water? According to an online news 
report, about 75% of people drink bottled water regularly. Some people do 
so because they believe bottled water is safer than tap water. (There’s little 
evidence to support this belief.) Others say they prefer the taste of bottled 
water. Can people really tell the difference? 


ACTIVITY Bottled water versus tap water 


This activity will give you and your classmates a chance to discover 
whether or not you can taste the difference between bottled water and 
tap water. 


1. Before class begins, your teacher will prepare numbered stations with 
cups of water. You will be given an index card with a station number 
on it. 


2. Go to the corresponding station. Pick up three cups (labeled A, B, and 
C) and take them back to your seat. 


3. Your task is to determine which one of the three cups contains the 
bottled water. Drink all the water in Cup A first, then the water in 
Cup B, and finally the water in Cup C. Write down the letter of the 
cup that you think held the bottled water. Do not discuss your results 
with any of your classmates yet! 


4. While you’re tasting, your teacher will make a chart on the board like 
this one: 


Station Bottled water Truth 
number cup? 


5. When you are told to do so, go to the board and record your station 
number and the letter of the cup you identified as containing bottled 
water. 


6. Your teacher will now reveal the truth about the cups of drinking 
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water. How many students in the class identified the bottled water 
correctly? What percent of the class is this? 


7. Let’s assume that no one in your class can distinguish tap water from 
bottled water. In that case, students would just be guessing which cup 
of water tastes different. If so, what’s the probability that an 
individual student would guess correctly? 


8. How many correct identifications would you need to provide 
convincing evidence that the students in your class aren’t just 
guessing? With your classmates, design and carry out a simulation to 
answer this question. What do you conclude about your class’s ability 
to distinguish tap water from bottled water? 


magnetcreative/E+/Getty Images 


The ABC News program 20/20 set up a blind taste test in which people were asked to 


rate four different brands of bottled water and New York City tap water without 
knowing which they were drinking. Can you guess the result? Tap water came out 


the clear winner in terms of taste. 


When Mr. Hogarth’s class did the preceding activity, 13 out of 21 
students made correct identifications. If we assume that the students in his 
class can’t tell tap water from bottled water, then each one is basically 
guessing, with a 1/3 chance of being correct. So we’d expect about one- 
third of Mr. Hogarth’s 21 students (i.e., about 7 students) to guess 
correctly. How likely is it that 13 or more of the 21 students would guess 
correctly? To answer this question without a simulation, we need a 
different kind of probability model from the ones we saw in Chapter 5. 


Section 6.1 introduces the concept of a random variable, a numerical 
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outcome of some chance process (like the 13 students who guessed 
correctly in Mr. Hogarth’s class). Each random variable has a probability 
distribution that gives us information about the likelihood that a specific 
event happens (like 13 or more correct guesses out of 21) and about what’s 
expected to happen if the chance behavior is repeated many times. Section 
6.2 examines the effect of transforming and combining random variables 
on the shape, center, and variability of their probability distributions. In 
Section 6.3, we’ll look at two random variables with probability 
distributions that are used enough to have their own names— binomial and 
geometric. 
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SECTION 6.1 Discrete and Continuous Random 
Variables 


LEARNING TARGETS By the end of the section, you should be able to: 


e Use the probability distribution of a discrete random variable to 
calculate the probability of an event. 


Make a histogram to display the probability distribution of a 
discrete random variable and describe its shape. 


Calculate and interpret the mean (expected value) of a discrete 
random variable. 


Calculate and interpret the standard deviation of a discrete random 
variable. 


Use the probability distribution of a continuous random variable 
(uniform or Normal) to calculate the probability of an event. 


A probability model describes the possible outcomes of a chance process 
and the likelihood that those outcomes will occur. For example, suppose 
you toss a fair coin 3 times. The sample space for this chance process is 


HHH HHT HTH THH HTT THI TTH TIT 
HHH HHT HTH THH HTT THT TTH TTT 


Because there are 8 equally likely outcomes, the probability is 1/8 for each 
possible outcome. 


Define the random variable X = the number of heads obtained in 3 
tosses. The value of X will vary from one set of tosses to another, but it 
will always be one of the numbers 0, 1, 2, or 3. How likely is X to take 
each of those values? It will be easier to answer this question if we group 
the possible outcomes by the number of heads obtained: 


X=0: TTT X=1: HTT THT TTH X=2: HHT HTH THH 


X=3: HHH 
X =0:TTT X =1:HTT THT TTH X = 2:HHT HTH THH X = 3: HHH 


We can summarize the probability distribution of X in a table: 
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Value 0 af 2 3 
Probability 1/8 3/8 3/8 1/8 


DEFINITION Random variable, Probability distribution 


A random variable takes numerical values that describe the 
outcomes of a chance process. 


The probability distribution of a random variable gives its possible 
values and their probabilities. 


We use capital, italic letters (like X or Y) to designate random 
variables. There are two main types of probability distributions, 
corresponding to two types of random variables: discrete and continuous. 


Discrete Random Variables 


The random variable X in the coin-tossing setting is a discrete random 
variable. 


DEFINITION Discrete random variable 


A discrete random variable X takes a fixed set of possible values 
with gaps between them. 


We can list the possible values of X = the number of heads in 3 tosses 
of a coin as 0, 1, 2, 3. Note that there are gaps between these values on a 
number line. For instance, a gap exists between X=1X = 1 and X=2X = 2 
because X cannot take values such as 1.2 or 1.84. 


The probability distribution of X is 


Value 0) 1 2 3 
Probability 1/8 3/8 3/8 1/8 


This probability distribution is valid because all the probabilities are 
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between 0 and 1, and their sum is 


1/8+3/8+3/8+1/8=11/8 + 3/8 + 3/8 +1/8=1 


PROBABILITY DISTRIBUTION FOR A DISCRETE 
RANDOM VARIABLE 


The probability distribution of a discrete random variable X lists the 
values x; and their probabilities p;: 


Value X41 x2 X3 


Probability Pi =P2~ ?P3 


For the probability distribution to be valid, the probabilities p; must 
satisfy two requirements: 


1. Every probability p; is a number between 0 and 1, inclusive. 


2. The sum of the probabilities is 1:p1+p2+p3+. . .=1 
1 [Py + Po t+ p3t.. = 1 


We can use the probability distribution of a discrete random variable to 
find the probability of an event. For instance, what’s the probability that 
we get at least one head in three tosses of the coin? In symbols, we want to 


find P(x>1)?(* = 1). We know that 
P(X>1)=P(X=1 Or X=? or X=3)P (X = 1) = P(X — ah Bs) a er ee ee Re 3) 


Because the events X=1X = 1, X=2X = 2, and X=3X = 3 are mutually 
exclusive, we can add their probabilities to get the answer: 
P(X21)=P(X=1)+P(X=2)+P(X=3)=3/8+3/8+ 1/8=7/8 
P(X >1)= P(X =1)+ P(X = 2) + P(X = 3) 
= 3/8+3/8+1/8=7/8 


Or we could use the complement rule from Chapter 5: 


P(X>1)=1-P(X<1)=1-P(X=0)=1-1/8=7/8 
P(X>1) = 1-P(X<1)=1-P(X=0) 
= 1-1/8=7/8 
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EXAMPLE | Apgar scores: Babies’ health at birth >» 


Discrete random variables 


susaro/iStock/Getty Images 


PROBLEM: In 1952, Dr. Virginia Apgar suggested five criteria for 
measuring a baby’s health at birth: skin color, heart rate, muscle tone, 
breathing, and response when stimulated. She developed a 0-1-2 scale to 
rate a newborn on each of the five criteria. A baby’s Apgar score is the 
sum of the ratings on each of the five scales, which gives a whole- 
number value from 0 to 10. Apgar scores are still used today to evaluate 
the health of newborns. Although this procedure was later named for Dr. 
Apgar, the acronym APGAR also represents the five scales: 
Appearance, Pulse, Grimace, Activity, and Respiration. 

What Apgar scores are typical? To find out, researchers recorded the 
Apgar scores of over 2 million newborn babies in a single year 
Imagine selecting a newborn baby at random. (That’s our chance 
process.) Define the random variable X = Apgar score of a randomly 
selected newborn baby. The table gives the probability distribution of X. 


Value x; Ot ae! 2 3 4 3) 6 7 8 2 10 


Probability ??? 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 
Pj 


a. Write the event “the baby has an Apgar score of 0” in terms of X. 
Then find its probability. 


b. Doctors decided that Apgar scores of 7 or higher indicate a healthy 
newborn baby. What’s the probability that a randomly selected 
newborn baby is healthy? 


791 


SOLUTION: 


a. P(X=0)=1-(0.006+0.007+~+0.053)=1-0.999=0.001 
P(X =0) = 1—(0.006 + 0.007+--- + 0.053) 
= 1—0.999 
= 0.001 


Use the complement rule: 


P(X=0)=1-P(X#0) 


b. P(X27)=0.099+0.319+0.437+0.053=0.908 
P(X >7) = 0.099 + 0.319 + 0.437 + 0.053 


= 0.908 


The probability of choosing a healthy baby is 
Ps PO 7) Px 8) 


P(X = 9) + P(X =10) 


FOR PRACTICE, TRY EXERCISE 1 


Note that the probability of randomly selecting a newborn whose 
Apgar score is at least 7 is not the same as the probability that the baby’s 
Apgar score is greater than 7. The latter probability is 

P(X>7)=P(X=8)+P(X=9)+P(X=10)=0.319+0.437+0.053=0.809 
PAST) = P(A =8)+P(4—9) +4 =—10) 
0.319 + 0.437 + 0.053 
= 0.809 


I| 


The outcome X = 7 is included in “at least 7” but is not included in 
“sreater than 7.” @ Be sure to consider whether to include the 


boundary value in your calculations when dealing with discrete 
random variables. 


Analyzing Discrete Random Variables: Describing Shape 
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When we analyzed distributions of quantitative data in Chapter 1, we made 
it a point to discuss their shape, center, and variability. We’ll do the same 
with probability distributions of random variables. 

For the discrete random variable X = Apgar score of a randomly 
selected newborn baby, the probability distribution is 


Value x; 0 1 2 3 4 5 6 7 8 ) 10 
Probability p; 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


We can display the probability distribution with a histogram. Values of the 
variable go on the horizontal axis and probabilities go on the vertical axis. 
There is one bar in the histogram for each value of X. The height of each 
bar gives the probability for the corresponding value of the variable. 
Figure 6.1 shows a histogram of the probability distribution of X. This 
distribution is skewed to the left with a single peak at an Apgar score of 9. 
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FIGURE 6.1 Histogram of the probability distribution for the random variable X = 
Apgar score of a randomly selected newborn baby. 


A probability distribution histogram is really just a relative frequency 
histogram because probabilities are long-run relative frequencies. 


EXAMPLE |Pete’s Jeep Tours (>) 
Displaying a probability distribution 
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Judy M. Starnes 


PROBLEM: Pete’s Jeep Tours offers a popular day trip in a tourist 
area. There must be at least 2 passengers for the trip to run, and the 
vehicle will hold up to 6 passengers. Pete charges $150 per passenger. 
Let C = the total amount of money that Pete collects on a randomly 
selected trip. The probability distribution of C is given in the table. 


Total collected c; 300 450 600 750 900 
Probability pj = 0.15 0.25 0.35 0.20 0.05 


Make a histogram of the probability distribution. Describe its shape. 


SOLUTION: 


0.4 
0.3 


02 


Probability 


0.7 


0.0 
0 6150 «300 #50 «600s #50=—- 900 
C = Money collected (+) 


The graph is roughly symmetric and has a single peak at $600. 


Remember: Values of the variable go on the horizontal axis and 


probabilities go on the vertical axis. Don’t forget to properly label and 
scale each axis! 


FOR PRACTICE, TRY EXERCISE 5 


Notice the use of the label C (collects) for the random variable in the 
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example. Sometimes we prefer contextual labels like this to the more 
generic X and Y. 


CHECK YOUR UNDERSTANDING 


( 


,, Indiana University Bloomington posts the grade distributions for its 
courses online.2 Suppose we choose a student at random from a recent 
semester of this university’s Business Statistics course. The student’s 
grade on a 4-point scale (with A = 4) is a random variable X with this 
probability distribution: 


Value 0 il y 3 4 
Probability 0.011 0.032 ??? 0.362 0.457 


1. Write the event “the student got a C” using probability notation. Then 
find this probability. 

2. Explain in words what P(X23)P (X > 3) means. What is this 
probability? 

3. Make a histogram of the probability distribution. Describe its shape. 


Measuring Center: The Mean (Expected Value) of a 
Discrete Random Variable 


In Chapter 1, you learned how to summarize the center of a distribution of 
quantitative data with either the median or the mean. For discrete random 
variables, the mean is typically used to summarize the center of a 
probability distribution. The mean x # of a quantitative data set with n 
observations is 

x =sum of data valuesnumber of data values=x1+x2+-::+xnn=y xin 
sum ofdatavalues = fy +@gte-+2n © 2 vi 


number of data values n n 


z= 
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How do we find the mean of a discrete random variable? 

Consider the random variable C = the total amount of money that Pete 
collects on a randomly selected jeep tour from the previous example. The 
probability distribution of C is given in the table. 


Total collected c; 300 450 600 750 900 
Probability p; 0.15 0.25 0.35 0.20 0.05 


What’s the average amount of money that Pete collects on his jeep tours? 


Imagine a hypothetical 100 trips. Pete will collect $300 on 15 of these 
trips, $450 on 25 trips, $600 on 35 trips, $750 on 20 trips, and $900 on 5 
trips. Pete’s average amount collected for these trips is 


300° 15+450 -25+600 -35+750 -20+900 -5100=300 - 15100+450 -25100+60( 
300-15+450-25+-600-35+750-20+-900-5 


100 
300-15 450-25 600-35 750-20 900-5 

100 100 100 100 100 

= 300 (0.15) + 450 (0.25) + 600 (0.35) + 750 (0.20) + 900 (0.05) 

= 562.50 


The third line of the calculation is just the values of the random variable C times 


their corresponding probabilities. 


That is, the mean of the discrete random variable C is »C=$562.50 
Hc = $562.50 This is also known as the expected value of C, denoted by 
E(C). 

The mean (expected value) of any discrete random variable is found in 
a similar way. It is an average of the possible outcomes, but a weighted 
average in which each outcome is weighted by its probability. 


DEFINITION Mean (expected value) of a discrete random 
variable 


The mean (expected value) of a discrete random variable is its 
average value over many, many repetitions of the same chance 
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process. 


Suppose that X is a discrete random variable with probability 
distribution 


Value X41 x2 X3 


Probability Pi P2 ?p3 


To find the mean (expected value) of X, multiply each possible value 
of X by its probability, then add all the products: 


UX=E(X)=x1p1+x2p2+x3p3+...=); xipi 
x = E(X) = 2p, + repo + 3p3 +... 


= So aipi 


Recall that the mean is the balance point of a distribution. For Pete’s 
distribution of money collected on a randomly selected jeep tour, the 
histogram balances at pC=562.50HC = 562.50 How do we interpret this 
value? If we randomly select many, many jeep tours, Pete will make about 
$562.50 per trip, on average. 


0.4 


Probability 
o Ss 
N Ww 


9 
ra 


© 
° 


0 150 300 450 600 750 900 
b, = 562.50 


C = Money collected ($) 


EXAMPLE | Apgar scores: What’s typical? >» 


Finding and interpreting the mean 


PROBLEM: Earlier, we defined the random variable X to be the Apgar 
score of a randomly selected newborn baby. The table gives the 
probability distribution of X once again. 
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Value x; 0 1 2 3 4 5 6 7 8 3 10 


Probability 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 
Pi 


Calculate and interpret the expected value of X. 


SOLUTION: 


E(X)=»X=(0)(0.001)+(1)(0.006)+...+(10)(0.053)=8.128 
E(X) = px = (0)(0.001) + (1)(0.006) +... + (10)(0.053) = 8.128 


F(X )=x=y Kipie ~) =e = > Xip; 


If many, many newborns are randomly selected, their average Apgar 
score will be about 8.128. 


FOR PRACTICE, TRY EXERCISE 7 


Notice that the mean Apgar score, 8.128, is not a possible value of the 
random variable X because it is not a whole number between 0 and 10. The 
non-integer value of the mean shouldn’t bother you if you think of the 
mean (expected value) as a long-run average over many repetitions. 


AP® EXAM TIP 


If the mean of a random variable has a non-integer value but you report 
it as an integer, your answer will not get full credit. 


How can we find the median of a discrete random variable? In Chapter 
1, we defined the median as “the midpoint of a distribution, the number 
such that about half the observations are smaller and about half are larger.’ 
The median of a discrete random variable is the 50th percentile of its 
probability distribution. We can find the median by adding a cumulative 
probability row to the probability distribution table, and then locating the 
smallest value for which the cumulative probability equals or exceeds 0.5. 
For the distribution of amount of money collected on Pete’s Jeep Tours, 


> 
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we see that the median is $600. 


Total collected 300 
Probability 0.15 0.25 
Cumulative probability 0.15 


0.20 0.05 


Measuring Variability: The Standard Deviation (and 
Variance) of a Discrete Random Variable 


With the mean as our measure of center for a discrete random variable, it 
shouldn’t surprise you that we’ll use the standard deviation as our measure 
of variability. In Chapter 1, we defined the standard deviation sx* of a 
distribution of quantitative data as the typical distance of the values in the 
data set from the mean. To get the standard deviation, we started by 


“averaging” the squared deviations from the mean (xi-x )2(2 if) to 
get the sample variance 5x28 and then took the Square root: 
sx=(xl-x )2+(x2-x )2+...+(xn-x )2n-1 
| (a4 -z)? +(a9 -z)? +...+(2n ~)? 
sz = || —————— 
\ n—l 
We can modify this approach to calculate the standard deviation of a 
discrete random variable X. Start by finding a weighted average of the 
9 
squared deviations (xi-px)2(© i x)” of the values of the variable X 
from its mean pX#* . The probability distribution gives the appropriate 
weight for each squared deviation. We call this weighted average the 
variance of X, denoted by oX27X. Then take the square root to get the 
standard deviation oX?X . 


DEFINITION Standard deviation of a discrete random 
variable, Variance 


The standard deviation of a discrete random variable measures 
how much the values of the variable typically vary from the mean. 


Suppose that X is a discrete random variable with probability 
distribution 
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Value Xt XD x3 


Probability Pir po" ps 


and that p, is the mean of X. The variance of X is 


Var(X)=o0 X2=(x1-pX) 2p 1+ (x 2-p"X) 2p 2+ (x 3-pX) 2p3+...=y, 
(xi- UX)2pi 


Var (X)=o07%, = (2; — px)? Pit (t2—px) *p2+ (t3 —px) 7p3t... 


= So (« i — Ux) ° Di 


The standard deviation of X is the square root of the variance: 


oX=0X2 =(x1-pX)2p1+(x2-pX)2p2+(x3-pX)2p3+...=¥ (xi 
——X)2pi 


j 


| p72 
ox = 4/Fx 


/(a1 — wx)’pi + (v2 — px)’ pe + (x3 — wx)” ps +... 
\ 


= fei - ux) ux) pi 


Let’s return to the random variable C = the total amount of money that 
Pete collects on a randomly selected jeep tour. The left two columns of the 
following table give the probability distribution. Recall that the mean of C 
is Uc = 562.50. The third column of the table shows the squared deviation 


of each value from the mean. The fourth column gives the weighted 


squared deviations. 


Total Probability Squared deviation from the Weighted squared 
collected ¢; Pj mean (c;-H,)” deviation (c—,)"p; 
300 0.15 (300 — 562.50) (300 — 562.50)2 (0.15) = 
10335.94 
450 0.25 (450 — 562.50)2 (450 — 562.50)2 (0.25) = 
3164.06 
600 0.35 (600 — 562.50) (600 — 562.50)? (0.35) = 
492.19 
750 0.20 (750 — 562.50)2 (750 — 562.50) (0.20) = 
7031.25 
900 0.05 (900 — 562.50) (900 — 562.50)? (0.05) = 
5695.31 


800 


Sum = 26,718.75 


Adding the weighted average of the squared deviations in the fourth 
column gives the variance of C: 

Var(C)=0C2=> (ci-pC)2pi=(300-562.50)2(0.15)+ 
(450-562.50)2(0.25)+---+(900-562.50)2(0.05)=10335.94+3164.06+--- 
+5695.31=26,718.75 (squared dollars) 

Var(C) = 08 =>) (G —Hc)’p: 
= (300 — 562.50)” (0.15) + (450 — 562.50)° (0.25) + --- + (900 — 562.50)” (0.05) 
= 10335.94 + 3164.06 + --- + 5695.31 
= 26,718.75 (squared dollars) 


The standard deviation of C is the square root of the variance: 


6C=26,718.75=$163. 4600 mental” = ORO 


The amount of money that Pete collects on a randomly selected trip 
typically varies from the mean of $562.50 by about $163.46. 


EXAMPLE |How much do Apgar scores vary? » 


Finding and interpreting the standard deviation 


PROBLEM: Earlier, we defined the random variable X to be the Apgar 
score of a randomly selected newborn baby. The table gives the 
probability distribution of X once again. In the last example, we 
calculated the mean Apgar score of a randomly chosen newborn to be 1, 


= 8.128. 
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Value x; 0 1 2 3 4 5 6 7 8 9 10 
Probability 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


Pj 


Calculate and interpret the standard deviation of X. 


SOLUTION: 


oX2=(0-8.128)2(0.001)+(1-8.128)2(0.006)+---+ 
(10-8.128)2(0.053)=2.066 


2 


o% = (0 — 8.128) (0.001) + (1 — 8.128)” (0.006) + --- + (10 — 8.128)” (0.053) = 2.066 
oX = 2.066 = 1.4377x = Vv 2.066 = 1.437 


A randomly selected newborn baby’s Apgar score will typically vary 
from the mean (8.128) by about 1.437 units. 


Start by calculating the variance oX2=)' (xi-pX)2pi 


oe = Do (ti — wx) Pi 


FOR PRACTICE, TRY EXERCISE 13 


You can use your calculator to graph the probability distribution of a 
discrete random variable and to calculate measures of center and 
variability, as the following Technology Corner illustrates. 


12. Technology Corner | ANALYZING DISCRETE 
RANDOM VARIABLES |p) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Let’s explore what the TI-83/84 can do using the random variable X = 
Apgar score of a randomly selected newborn. 


1. Enter the values of the random variable in list L1 and the 
corresponding probabilities in list L2. 
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Shes a Das a ts sd 


Lacv=O 


2. To graph a histogram of the probability distribution: 


e In the statistics plot menu, define Plot 1 to be a histogram with 
Xlist: L1 and Freq: L2. 


e Adjust your window settings as follows: Xmin = -1, Xmax = 11, 
Xscl = 1, Ymin = —-0.1, Ymax = 0.5, Yscl = 0.1. 


e Press |IGRAPH. 


Ploti:LasL2 


= 


min=3 
max<ie n=.437 


3. To calculate the mean and standard deviation of the random variable, 
use one-variable statistics with the values in L1 and the probabilities 
(relative frequencies) in L2. Press arrow to the CALC menu, 
and choose 1-Var Stats. 


OS 2.55 or later: In the dialog box, specify List: L1 and FreqList: 
L2. Then choose Calculate. 


Older OS: Execute the command 1-Var Stats L1,L2. 


=-Var Stats) 
x=8.128 
=x=8.128 
2x?=68.13 


Sx= 
ox=1.437225104 
n=1 
minx=0 

4Q1=8 
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Note: If you leave Freq: L2 and try to calculate summary statistics for a 
quantitative data set that does not include frequencies, you will likely 
get an error message. Be sure to clear Freq when you are done with 
calculations to avoid this issue. 


AP® EXAM TIP 


If you are asked to calculate the mean or standard deviation of a 
discrete random variable on a free response question, you must show 
numerical values substituted into the appropriate formula, as in the 
previous two examples. Feel free to use ellipses (...) if there are many 
terms in the summation, as we did. You may then use the method 
described in Technology Corner 12 to perform the calculation with 1- 
Var Stats. Writing only 1-Var Stats L1, L2 and then giving the correct 
values of the mean and standard deviation will not earn credit for 
showing work. 


CHECK YOUR UNDERSTANDING 


( 


y Indiana University Bloomington posts the grade distributions for its 
courses online.2 Suppose we choose a student at random from a recent 
semester of this university’s Business Statistics course. The student’s 
grade on a 4-point scale (with A = 4) is a random variable X with this 
probability distribution: 


Value 0 il D 3 4 
Probability 0.011 0.032 0.138 0.362 0.457 


1. Calculate and interpret the mean of X. 


2. Calculate and interpret the standard deviation of X. 
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Continuous Random Variables 


When we use Table D of random digits to select an integer from 0 to 9, the 
result is a discrete random variable (call it X). The probability distribution 
assigns probability 1/10 to each of the 10 equally likely values of X. 


Suppose we want to choose a number at random between 0 and 9, 
allowing any number between 0 and 9 as the outcome (like 0.84522 or 
7.1111119). Calculator and computer random number generators will do 
this. The sample space of this chance process is the entire interval of 
values between 0 and 9 on the number line. If we define Y = randomly 
generated number between 0 and 9, then Y is a continuous random 
variable. 


DEFINITION Continuous random variable 


A continuous random variable can take any value in an interval on 
the number line. 


Most discrete random variables result from counting something, like 
the number of siblings that a randomly selected student has. Continuous 
random variables typically result from measuring something, like the 
height of a randomly selected student or the time it takes that student to 
run a mile. 


How can we find the probability P(3<Y<7)P (3 < Y <7) that the 
random number generator produces a value between 3 and 7? As in the 
case of selecting a random digit, we would like all possible outcomes to be 
equally likely. But we cannot assign probabilities to each individual value 
of Y and then add them, because there are infinitely many possible values. 


The probability distribution of a continuous random variable is 
described by a density curve. Recall from Chapter 2 that any density curve 
has area exactly 1 underneath it, corresponding to a total probability of 1. 
We use areas under the density curve to assign probabilities to events. 


For the continuous random variable Y = randomly generated number 
between 0 and 9, its probability distribution is a uniform density curve 
with constant height 1/9 on the interval from 0 to 9. Note that this 
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probability distribution is valid because the total area under the density 
curve is 


Area=basexheight=9x1/9-147¢4 = base x height = 9 x 1/9-1 


Figure 6.2 shows the probability distribution of Y with the area of 
interest shaded. The area under the density curve between 3 and 7 is 


Area=basexheight=4*1/9=4/94rea = base x height = 4 x 1/9 = 4/9 
So P(3<= Y <= 7) = 4/9 = 0.444. 


es 


cr 


Height = 7 < 


0 1 2 2 & 3 6 7 8 9 
Y = Randomly generated number 


FIGURE 6.2 The probability distribution of the continuous random variable Y = 
randomly generated number between 0 and 9. The shaded area represents P(3 < Y< 
7). 


HOW TO FIND PROBABILITIES FOR A CONTINUOUS 
RANDOM VARIABLE 


The probability of any event involving a continuous random variable is 
the area under the density curve and directly above the values on the 
horizontal axis that make up the event. 


Consider a specific outcome from the random number generator 
setting, such as P(Y = 7). The probability of this event is the area under the 
density curve that’s above the point 7.0000 ... on the horizontal axis. But 
this vertical line segment has no width, so the area is 0. In fact, all 
continuous probability distributions assign probability 0 to every 
individual outcome. For that reason, 


P(3<Y<7)=P(3<Y<7)=P(3<Y<7)=P(3<Y<7)=0.444 
P(3<Y<D)D=P3<Y<7)=P(3<Y <7)=P(3<Y <7) =0.444 
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Remember: The probability distribution for a continuous random variable 
assigns probabilities to intervals of outcomes rather than to individual 
outcomes. 


EXAMPLE | Will it be quicker to walk to work? (») 


Continuous random variables 


PROBLEM: Selena works at a bookstore in the Denver International 
Airport. She takes the airport train from the main terminal to get to work 
each day. The airport just opened a new walkway that would allow 
Selena to get from the main terminal to the bookstore in 4 minutes. She 
wonders if it will be faster to walk or take the train to work. Let Y = 
Selena’s journey time to work (in minutes) by train on a randomly 
selected day. The probability distribution of Y can be modeled by a 
uniform density curve on the interval from 2 to 5 minutes. Find the 
probability that it will be quicker for Selena to take the train than to 
walk that day. 


SOLUTION: 


| Density curve | 


20 425 30 35 40 45 $0 
Y = Journey time (min) 


Start by drawing the density curve with the area of interest shaded. The 


height of the curve needs to be 1/3 so that 
Area = base x height = 3 x 1/3 =1 


Shaded area = base x height = 2 x 1/3 = 2/3P(Y < 4) = 2/3 = 0.667 
Shaded area = base x height = 2 x 1/3=2/3 


P(Y < 4) = 2/3 = 0.667 


There is a 66.7% chance that it will be quicker for Selena to take the 
train to work on a randomly selected day. 
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FOR PRACTICE, TRY EXERCISE 23 


The density curves that are most familiar to us are the Normal 
distributions from Chapter 2. Normal curves can be probability 
distributions as well as models for data. 


EXAMPLE | Young women’s heights » 


Normal probability distributions 


Alamy Images 


PROBLEM: The heights of young women can be modeled by a 
Normal distribution with mean p = 64 inches and standard deviation 
o=2.70 = 2.7 inches. Suppose we choose a young woman at random 
and let Y = her height (in inches). Find P(68<Y<70)P (68 < Y < 70)) 
Interpret this value. 


SOLUTION: 


55.9 58.6 61.3 64 66.7 62 69.4 70 F421 


Y = Young woman's height (in.) 


1. Draw a Normal distribution. 


2. Perform calculations—show your work! 
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(i) Standardize and use Table A or technology; or 


(ii) Use technology without standardizing. 
Be sure to answer the question that was asked. 


, — 68-64 _ 
i, 2= 68-642,7 = 1.48" 27 = 1-48 


70—64 


Z = 70-642.7 =2.22° 27 
Using Table A: 0.9868 — 0.9306 = 0.0562 


Using technology: normalcdf(lower:1.48, upper:2.22, mean:0, SD:1) 
= 0.0562 


P(68 < Y< 70) = P (1.48 < Z< 2.22) 


ii. normalcdf(lower:68, upper:70, mean:64, SD:2.7) = 0.0561 


The probability that a randomly selected young woman has a height 
between 68 and 70 inches is about 0.06. 


= 2.22 


FOR PRACTICE, TRY EXERCISE 27 


AP® EXAM TIP 


Students often do not get full credit on the AP® Statistics exam because 
they only use option (ii) with “calculator-speak” to show their work on 


Normal calculation questions— for example, normalcdf(68,70,64,2.7). 
This is not considered clear communication. To get full credit, follow 
the two-step process above, making sure to carefully label each of the 
inputs in the calculator command if you use technology in Step 2: 
normalcdf(lower:68, upper:70, mean:64, SD: 2.7). 


The calculation in the preceding example is the same as those we did 
in Chapter 2. Only the language of probability is new. 


What about the mean and standard deviation for continuous random 
variables? The probability distribution of a continuous random variable X 
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is described by a density curve. Chapter 2 showed us how to find the mean 
of the distribution: it is the point at which the area under the density curve 
would balance if it were made out of solid material. The mean lies at the 
center of symmetric density curves such as Normal curves. We can locate 
the standard deviation of a Normal distribution from its inflection points. 
Exact calculation of the mean and standard deviation for most continuous 


random variables requires advanced mathematics.2 
a a3 
~ |, 
/ \ 
_— J = oe 
A 
je 


Section 6.1 Summary 


e A random variable takes numerical values determined by the outcome 
of a chance process. The probability distribution of a random variable 
gives its possible values and their probabilities. There are two types of 
random variables: discrete and continuous. 


e A discrete random variable has a fixed set of possible values with 
gaps between them. 


. A valid probability distribution assigns each of these values a 
probability between 0 and 1 such that the sum of all the probabilities is 
exactly 1. 


» The probability of any event is the sum of the probabilities of all the 
values that make up the event. 


= We can display the probability distribution as a histogram, with the 
values of the variable on the horizontal axis and the probabilities on 
the vertical axis. 


e A continuous random variable can take any value in an interval on the 
number line. 


: A valid probability distribution for a continuous random variable is 
described by a density curve with area 1 undemeath. 
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= The probability of any event is the area under the density curve 
directly above the values on the horizontal axis that make up the event. 


e We can describe the shape of a probability distribution histogram or 
density curve in the same way as we did a distribution of quantitative 
data—by identifying symmetry or skewness and any major peaks. 


e Use the mean to summarize the center of a probability distribution. The 
mean of a random variable /y is the balance point of the probability 
distribution histogram or density curve. 

: The mean is the long-run average value of the variable after many 


repetitions of the chance process. It is also known as the expected 
value of the random variable, E(X). 


= If X is a discrete random variable, the mean is the average of the 
values of X, each weighted by its probability: 
UX=E(X)=>. xipi=x1p1+x2p2+x3p3+::: 
px = E(X) = S- zip, = Z1pPi + L2p2 + T3p3 + °° 


e Use the standard deviation to summarize the variability of a probability 
distribution. The standard deviation of a random variable 0X7 
measures how much the values of the variable typically vary from the 
mean. 


= If X is a discrete random variable, the variance of X is the “average” 
squared deviation of the values of the variable from their mean: 


oX2=)> (xi-pX)2pi=(x1-pX)2p1+(x2-uX)2p2+(x3-uX)2p3+... 
0% = >) (ai — ux)’pi = (a1 — wx)’ Pr + (@2 — wx)" po + (23 — wx)’ Ds +... 


The standard deviation of X is the square root of the variance. 


6.1 Technology Corner (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
12. Analyzing discrete random variables Page 370 
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Section 6.1 Exercises 


1, 


pg 363 |») Kids and toys In an experiment on the behavior of 
young children, each subject is placed in an area with five toys. Past 
experiments have shown that the probability distribution of the 
number X of toys played with by a randomly selected subject is as 
follows: 


Number of toys x; 0 1 2 3 4 5 
Probability p; 0.03 016 030 0.23 0.17 ~~ ??? 


a. Write the event “child plays with 5 toys” in terms of X. Then find 


its probability. 


b. What’s the probability that a randomly selected subject plays with 


at most 3 toys? 


Spell-checking Spell-checking software catches “nonword errors,” 
which result in a string of letters that is not a word, as when “the” is 
typed as “teh.” When undergraduates are asked to write a 250-word 
essay (without spell-checking), the number Y of nonword errors in a 
randomly selected essay has the following probability distribution. 


Value y; 0 1 2 3 4 
Probability p; 0.1 gee 0.3 0.3 0.1 


a. Write the event “one nonword error” in terms of Y. Then find its 


probability. 


b. What’s the probability that a randomly selected essay has at least 


two nonword errors? 


Get on the boat! A small ferry runs every half hour from one side 
of a large river to the other. The probability distribution for the 
random variable Y = money collected (in dollars) on a randomly 
selected ferry trip is shown here. 


Money collected 0 5 10 15 20 25 
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Probability 0.02 0.05 0.08 0.16 0.27 0.42 


a. Find P(X>20)?(X > 20). Mmterpret this result. 


b. Express the event “at least $20 is collected” in terms of Y. What is 
the probability of this event? 


4. Skee Ball Ana is a dedicated Skee Ball player (see photo) who 
always rolls for the 50-point slot. The probability distribution of 
Ana’s score X on a randomly selected roll of the ball is shown here. 


Score 10 20 30 40 =~ = 50 
Probability 0.32 0.27 0.19 0.15 0.07 
a. Find P(Y<20)?(¥ < 29). Interpret this result. 


b. Express the event “Anna scores at most 20” in terms of X. What is 
the probability of this event? 


Stan Rohrer/Getty Images 


5. Dg 365 |p) Get on the boat! Refer to Exercise 3. Make a histogram 
of the probability distribution. Describe its shape. 


6. Skee Ball Refer to Exercise 4. Make a histogram of the probability 
distribution. Describe its shape. 


7. pg 367 |p) Get on the boat! Refer to Exercise 3. Find the mean of 


Y. Interpret this value. 


Money collected 0 a 10 15 20 25 
Probability 0.02 0.05 008 016 £0.27 & 0.42 


8. Skee Ball Refer to Exercise 4. Find the mean of X. Interpret this 
value. 
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Score 10 20 30 40 50 
Probability 0.32 0.27 0.19 0.15 0.07 


9. Benford’s law Faked numbers in tax returns, invoices, or expense 
account claims often display patterns that aren’t present in legitimate 
records. Some patterns, like too many round numbers, are obvious 
and easily avoided by a clever crook. Others are more subtle. It is a 
striking fact that the first digits of numbers in legitimate records often 
follow a model known as Benford’s law. Call the first digit of a 
randomly chosen legitimate record X for short. The probability 
distribution for X is shown here (note that a first digit cannot be 0). 


First digit x; 1 2 3 4 fs) 6 Z 8 a 
Probability p; 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046 


a. A histogram of the probability distribution is shown. Describe its 
shape. 


b. Calculate and interpret the expected value of X. 


0.30 + 


0.25 4 
0.20 4 
0.15 4 


Probability 


0.10 - 
0.05 5 


0.00 


0" 2bh2S& 35S & F & S 
X = First digit 


10. Working out Choose a person aged 19 to 25 years at random and 
ask, “In the past seven days, how many times did you go to an 
exercise or fitness center or work out?” Call the response Y for short. 
Based on a large sample survey, here is the probability distribution of 


Y.2 


Days y; 0 1 2 3 4 5 6 7 
Probability p; 0.68 0.05 0.07 0.08 0.05 0.04 0.01 0.02 


a. A histogram of the probability distribution is shown. Describe its 
shape. 
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b. Calculate and interpret the expected value of Y. 


a 
> A Dn ~~) 


Probability 


0123 4 5 6 7 
Y = Number of days working out 


11. pg 369 |») Get on the boat! A small ferry runs every half hour 


from one side of a large river to the other. The probability distribution 
for the random variable Y = money collected on a randomly selected 
ferry trip is shown here. From Exercise 7, pY=$19.35.4Y =$19.35. 


Money collected 0 5 10 15 20 23 
Probability 0.02 0.005 008 0.16 0.27 # 0.42 


a. Find the median of Y. 
b. Compare the mean and median. Explain why this relationship 
makes sense based on the probability distribution. 


12. Skee Ball Ana is a dedicated Skee Ball player (see photo in Exercise 
4) who always rolls for the 50-point slot. The probability distribution 
of Ana’s score X on a randomly selected roll of the ball is shown 
here. From Exercise 8, uX=23.8HX = 23.8. 


Score 10 20 30 40 50 
Probability 0.32 G27 0.19 0.15 0.07 


a. Find the median of X. 


b. Compare the mean and median. Explain why this relationship 
makes sense based on the probability distribution. 


13. pg 369 |») Get on the boat! A small ferry runs every half hour 


from one side of a large river to the other. The probability distribution 
for the random variable Y = money collected on a randomly selected 
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14. 


15. 


16. 


17, 


ferry trip is shown here. From Exercise 7, pY=$19.35.4Y =$19.35. 


Calculate and interpret the standard deviation of Y. 


Money collected 0 5 10 15 20 25 
Probability 0.02 005 0.08 016 O27 &# 0.42 


Skee Ball Ana is a dedicated Skee Ball player (see photo in Exercise 
4) who always rolls for the 50-point slot. The probability distribution 
of Ana’s score X on a randomly selected roll of the ball is shown 
here. From Exercise 8, uX=23.8HX — 23.8 Calculate and interpret 
the standard deviation of X. 


Score 10 20 30 40 50 
Probability 0.32 0.27 0.19 0.15 0.07 


Benford’s law Exercise 9 described how the first digits of numbers in 
legitimate records often follow a model known as Benford’s law. Call 
the first digit of a randomly chosen legitimate record X for short. The 
probability distribution for X is shown here (note that a first digit 
can’t be 0). From Exercise 9, E(X) = 3.441. Find the standard 
deviation of X. Interpret this value. 


First digit x; 1 2 3 4 fs) 6 7 8 7g 
Probability p; 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046 


Working out Exercise 10 described a large sample survey that asked 
a sample of people aged 19 to 25 years, “In the past seven days, how 
many times did you go to an exercise or fitness center or work out?” 
The response Y for a randomly selected survey respondent has the 
probability distribution shown here. From Exercise 10, E(Y) = 1.03. 
Find the standard deviation of Y. Interpret this value. 


Days y; 0 1 2 3 4 5 6 7 
Probability p; 0.68 0.05 0.07 0.08 0.05 0.04 0.01 0.02 


Life insurance A life insurance company sells a term insurance 
policy to 21-year-old males that pays $100,000 if the insured dies 
within the next 5 years. The probability that a randomly chosen male 
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will die each year can be found in mortality tables. The company 
collects a premium of $250 each year as payment for the insurance. 
The amount Y that the company earns on a randomly selected policy 
of this type is $250 per year, less the $100,000 that it must pay if the 
insured dies. Here is the probability distribution of Y: 


Age at death ea 22 23 
Profit y; —$99,750 —$99,500 —$99,250 
Probability p; 0.00183 0.00186 0.00189 
Age at death 24 25 26 or more 
Profit y; —$99,000 —$98,750 $1250 
Probability p; 0.00191 0.00193 0.99058 


a. Explain why the company suffers a loss of $98,750 on such a 
policy if a client dies at age 25. 


b. Calculate the expected value of Y. Explain what this result means 
for the insurance company. 


c. Calculate the standard deviation of Y. Explain what this result 
means for the insurance company. 


18. Fire insurance Suppose a homeowner spends $300 for a home 


19. 


insurance policy that will pay out $200,000 if the home is destroyed 
by fire in a given year. Let P = the profit made by the company on a 
single policy. From previous data, the probability that a home in this 
area will be destroyed by fire is 0.0002. 


a. Make a table that shows the probability distribution of P. 


b. Calculate the expected value of P. Explain what this result means 
for the insurance company. 


c. Calculate the standard deviation of P. Explain what this result 
means for the insurance company. 


Size of American households In government data, a household 
consists of all occupants of a dwelling unit, while a family consists of 
two or more persons who live together and are related by blood or 
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marriage. So all families form households, but some households are 
not families. Here are the distributions of household size and family 
size in the United States: 


Number of aie 


Household probability 


Family probability 


Let H = the number of people in a randomly selected U.S. household 
and F = the number of people in a randomly chosen U.S. family. 


a. Here are histograms comparing the probability distributions of H 
and F. Describe any differences that you observe. 


b. Find the expected value of each random variable. Explain why this 
difference makes sense. 


c. The standard deviations of the two random variables are oH=1.421 
oH = 1.421 and oF=1.2497F = 1.249. Explain why this difference 
makes sense. 


0.4 


0.3 


Probability 
Probability 


0.0 


0123 45 6 F 0123 4 5 6 7 
H = Number of persons F = Number of persons 
in household in family 


20. Housing in San José How do rented housing units differ from units 
occupied by their owners? Here are the distributions of the number of 
rooms for owner-occupied units and renter-occupied units in San 


José, California:® 


Number of rooms 


10 


1 2 3 4 5 6 7 8 9 


Rented 0.008 | 0.027 | 0.287 | 0.363 | 0.164 | 0.093 | 0.039 | 0.013 | 0.003 | 0.003 


Owned 
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21. 


Let X = the number of rooms in a randomly selected owner-occupied 
unit and Y = the number of rooms in a randomly chosen renter- 
occupied unit. 


a. Here are histograms comparing the probability distributions of X 
and Y. Describe any differences you observe. 


b. Find the expected number of rooms for both types of housing unit. 


Explain why this difference makes sense. 


c. The standard deviations of the two random variables are oX=1.640 
ox = 1.640 and cY=1.3087Y = 1.308. Explain why this difference 
makes sense. 


0.4 


203 
z 
& 0.2 
) 
ih 
3 
01 
0.0 
Cow2232° 4 5 6 FF & F410 
X = Number of rooms in 
owner-occupied units 
0.4 7 
+ 03-4 
2 
2 0.2 4 
= 
oe 
0.14 


OG dk 2 345 6 4 8 940 
Y = Number of rooms in 


renter-occupied units 
Exercises 21 and 22 examine how Benford’s law (Exercise 9) can be 
used to detect fraud. 


Benford’s law and fraud A not-so-clever employee decided to fake 
his monthly expense report. He believed that the first digits of his 
expense amounts should be equally likely to be any of the numbers 
from 1 to 9. In that case, the first digit Y of a randomly selected 
expense amount would have the probability distribution shown in the 
histogram. 
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a. What’s P(Y<6)P (Y <6) According to Benford’s law (see 
Exercise 9), what proportion of first digits in the employee’s 
expense amounts should be greater than 6? How could this 
information be used to detect a fake expense report? 


b. Explain why the mean of the random variable Y is located at the 
solid red line in the figure. 


c. According to Benford’s law, the expected value of the first digit is 
UX=3.441HX = 3.441. Explain how this information could be used 
to detect a fake expense report. 


S 
io 


Probability for 
each value = 1/9 


Probability 
—) 
N 


S 
Fa 


= 
o 


0 1 2 3 4 3 6 7 8 9 
Outcomes 


22. Benford’s law and fraud 


a. Using the graph from Exercise 21, calculate the standard deviation 
oY°Y . This gives us an idea of how much variation we’d expect in 
the employee’s expense records if he assumed that first digits from 
1 to 9 were equally likely. 


b. The standard deviation of the first digits of randomly selected 
expense amounts that follow Benford’s law is oX=2.467x = 2-46, 
Would using standard deviations be a good way to detect fraud? 
Explain your answer. 


23. pg 372 |) Still waiting for the server? How does your web 


browser get a file from the Internet? Your computer sends a request 
for the file to a web server, and the web server sends back a response. 
Let Y = the amount of time (in seconds) after the start of an hour at 
which a randomly selected request is received by a particular web 
server. The probability distribution of Y can be modeled by a uniform 
density curve on the interval from 0 to 3600 seconds. Find the 
probability that the request is received by this server within the first 5 
minutes (300 seconds) after the hour. 
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24. 


25. 


Where’s the bus? Sally takes the same bus to work every morning. 
Let X = the amount of time (in minutes) that she has to wait for the 
bus on a randomly selected day. The probability distribution of X can 
be modeled by a uniform density curve on the interval from 0 minutes 
to 8 minutes. Find the probability that Sally has to wait between 2 and 
5 minutes for the bus. 


Class is over! Mr. Shrager does not always let his statistics class out 
on time. In fact, he seems to end class according to his own “internal 
clock.” The density curve here models the distribution of Y, the 
amount of time after class ends (in minutes) when Mr. Shrager 
dismisses the class on a randomly selected day. (A negative value 
indicates he ended class early.) 


a. Find and interpret P(-1<Y<1)P (ter st) 
b. What is pY/Y ? Explain your answer. 


c. Find the value of k that makes this statement true: P(Y>k)=0.25 
P(Y > k) =0.25 


—] 0 4 
Y = Dismissal time (min. after class ends) 


26. Quick, click! An Internet reaction time test asks subjects to click 


their mouse button as soon as a light flashes on the screen. The light 
is programmed to go on at a randomly selected time after the subject 
clicks “Start.” The density curve models the amount of time Y (in 
seconds) that the subject has to wait for the light to flash. 


a. Find and interpret P(Y>3.75)/ (¥ > 3.75), 
b. What is pY#Y ? Explain your answer. 


c. Find the value of k that makes this statement true: P(Y<k)=0.38 
P(Y <k) =0.38 
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24s 


28. 


20. 


30. 


hf hk a a 
0 1 2 3 4 #5 


Y = Time until light 
flashes (sec) 


pg 373 |») Running a mile A study of 12,000 able-bodied male 


students at the University of Illinois found that their times for the mile 
run were approximately Normal with mean 7.11 minutes and standard 
deviation 0.74 minute.2 Choose a student at random from this group 
and call his time for the mile Y. Find P(Y<6)P (Y <6). Interpret this 
value. 


Give me some sugar! Machines that fill bags with powdered sugar 
are supposed to dispense 32 ounces of powdered sugar into each bag. 
Let X = the weight (in ounces) of the powdered sugar dispensed into a 
randomly selected bag. Suppose that X can be modeled by a Normal 
distribution with mean 32 ounces and standard deviation 0.6 ounce. 


Find P(X<31)P (X < 31), Interpret this value. 


Horse pregnancies Bigger animals tend to carry their young longer 
before birth. The length of horse pregnancies from conception to birth 
varies according to a roughly Normal distribution with mean 336 days 
and standard deviation 6 days. Let X = the length of a randomly 
selected horse pregnancy. 


a. Write the event “pregnancy lasts between 325 and 345 days” in 
terms of X. Then find this probability. 


b. Find the value of c such that P(X>c)=0.20" (X > c) = 0.20. 


Ace! Professional tennis player Novak Djokovic hits the ball 
extremely hard. His first-serve speeds follow an approximately 
Normal distribution with mean 115 miles per hour (mph) and 
standard deviation 6 mph. Choose one of Djokovic’s first serves at 
random. Let Y = its speed, measured in miles per hour. 


a. Write the event “speed is between 100 and 120 miles per hour” in 
terms of Y. Then find this probability. 
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b. Find the value of c such that P(Y<c)=0.15P(¥ Sc) = 0.15 


Multiple Choice Select the best answer for Exercises 31—34. 


Exercises 31—33 refer to the following setting. Choose an American 
household at random and let the random variable X be the number of cars 
(including SUVs and light trucks) they own. Here is the probability 
distribution if we ignore the few households that own more than 5 cars: 


Number of cars 0 1 2 3 4 5 
Probability 0.09 0.36 0.35 0.13 0.05 0.02 


31. What’s the expected number of cars in a randomly selected American 
household? 


a. 1.00 
b. 1.75 
c. 1.84 
d. 2.00 


e. 2.50 


32. The standard deviation of X is oX=1.087X = 1-08. If many 
households were selected at random, which of the following would be 
the best interpretation of the value 1.08? 


a. The mean number of cars would be about 1.08. 
b. The number of cars would typically be about 1.08 from the mean. 
c. The number of cars would be at most 1.08 from the mean. 


d. The number of cars would be within 1.08 from the mean about 
68% of the time. 


e. The mean number of cars would be about 1.08 from the expected 
value. 


33. About what percentage of households have a number of cars within 2 
standard deviations of the mean? 
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a. 68% 
b. 71% 
c. 93% 
d. 95% 
e. 98% 


34. A deck of cards contains 52 cards, of which 4 are aces. You are 
offered the following wager: Draw one card at random from the deck. 
You win $10 if the card drawn is an ace. Otherwise, you lose $1. If 
you make this wager very many times, what will be the mean amount 
you win? 


a. About —$1, because you will lose most of the time. 

b. About $9, because you win $10 but lose only $1. 

c. About —$0.15; that is, on average, you lose about 15 cents. 
d. About $0.77; that is, on average, you win about 77 cents. 


e. About $0, because the random draw gives you a fair bet. 


Recycle and Review 


Exercises 35 and 36 refer to the following setting. Many chess masters and 
chess advocates believe that chess play develops general intelligence, 
analytical skill, and the ability to concentrate. According to such beliefs, 
improved reading skills should result from study to improve chess-playing 
skills. To investigate this belief, researchers conducted a study. All the 
subjects in the study participated in a comprehensive chess program, and 
their reading performances were measured before and after the program. 
The graphs and numerical summaries that follow provide information on 
the subjects’ pretest scores, posttest scores, and the difference (Post — Pre) 
between these two scores. 


Descriptive Statistics: Pretest, Posttest, Post — Pre 
Variable N Mean Median  StDev Min Max Qi Q3 
Pretest 53 57.70 58.00 17.84 23.00 99.00 44.50 70.50 
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Posttest 53 «63.08 64.00 18.70 28.00 99.00 48.00 76.00 
Post — Pre 5935.38 3.00 13.02 -19.00 42.00 -3.50 14.00 


lee 


T T T 
Pretest Posttest Post — Pre 


Reading performance 


35. Better readers? (1.3, 4.3) 


a. Did students tend to have higher reading scores after participating 
in the chess program? Justify your answer. 


b. If the study found a statistically significant improvement in the 
average reading score, could you conclude that playing chess 
causes an increase in reading skills? Justify your answer. 

Some graphical and numerical information about the relationship 
between pretest and posttest scores is provided here. 


100 = ° ° 


90 4 e ee 


Posttest 
os 
e 
ec 


oI T T l T l T l 
20 30 40 50 60 70 80 90 100 


Pretest 


Regression Analysis: Posttest Versus Pretest 


Predictor Coef SE Coef a8 P 
Constant 17.897 5.889 3.04 0.004 
Pretest 0.78301 0.09758 8.02 0.000 
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S = 12.55 R-Sq = 55.8% R-Sq(adj)= 54.9% 


Residual 


Fitted value 


36. Predicting posttest scores (3.2) 


a. What is the equation of the least-squares regression line relating 
posttest and pretest scores? Define any variables used. 


b. Is a linear model appropriate for describing this relationship? 
Justify your answer. 


c. If we use the least-squares regression line to predict students’ 
posttest scores from their pretest scores, how far off will our 
predictions typically be? 
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SECTION 6.2 Transforming and Combining 
Random Variables 


LEARNING TARGETS By the end of the section, you should be able to: 


e Describe the effect of adding or subtracting a constant or 
multiplying or dividing by a constant on the probability distribution 
of a random variable. 


e Calculate the mean and standard deviation of the sum or difference 
of random variables. 


e Find probabilities involving the sum or difference of independent 
Normal random variables. 


In Section 6.1, we looked at several examples of random variables and 
their probability distributions. We also saw that the mean py and standard 


deviation oy give us important information about a random variable. 


LS 
ay 
{i324 - 


sjharmon/Getty Images 


Consider this new setting. An American roulette wheel has 38 slots 
numbered 1 through 36, plus O and 00. Half of the slots from 1 to 36 are 
red; the other half are black. Both the 0 and 00 slots are green. Suppose 
that a player places a $1 bet on red. If the ball lands in a red slot, the player 
gets the original dollar back, plus an extra dollar for winning the bet. If the 
ball lands in a different-colored slot, the player loses the $1 bet. Let X = 
the net gain on a single $1 bet on red. Because there is an 18/38 chance 
that the ball lands in a red slot, the probability distribution of X is as shown 
in the table. 
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Value x; — $1 $1 
Probability pj 20/38 18/38 


The mean of X is 
uX=(—1)(2038)+(1)(1838)=—-$0.05 


20 18 — 
fx = (-1) (=) + (1) (=) = —$0.05 
That is, a player can expect to lose an average of 5 cents per $1 bet if he 
plays many, many games. You can verify that the standard deviation is dy 
= $1.00. If the player only plays a few games, his actual net gain could be 
much better or worse than this expected value. 

Would the player be better off playing one game of roulette with a $2 
bet on red or playing two games and betting $1 on red each time? To find 
out, we need to compare the probability distributions of the random 
variables Y = gain from a $2 bet and T = total gain from two $1 bets. 
Which random variable (if either) has the higher expected gain in the long 
run? Which has the larger variability? By the end of this section, you’Il be 
able to answer questions like these. 


Transforming a Random Variable 


In Chapter 2, we studied the effects of transformations on the shape, 
center, and variability of a distribution of quantitative data. Here’s what 
we discovered: 


.. Adding (or subtracting) a constant: Adding the same positive number a 
to (subtracting a from) each observation: 


e Adds a to (subtracts a from) measures of center and location (mean, 
median, quartiles, percentiles). 


e Does not change measures of variability (range, IQR, standard 
deviation). 


e Does not change the shape of the distribution. 


’, Multiplying or dividing by a constant: Multiplying (or dividing) each 
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observation by the same positive number b: 


e Multiplies (divides) measures of center and location (mean, median, 
quartiles, percentiles) by b. 


e Multiplies (divides) measures of variability (range, IQR, standard 
deviation) by b. 


e Does not change the shape of the distribution. 


How are the probability distributions of random variables affected by 
similar transformations? 


EFFECT OF ADDING OR SUBTRACTING A CONSTANT Let’s 
return to a familiar setting from Section 6.1. Pete’s Jeep Tours offers a 
popular day trip in a tourist area. There must be at least 2 passengers for 
the trip to run, and the vehicle will hold up to 6 passengers. Pete charges 
$150 per passenger. Let C = the total amount of money that Pete collects 
on a randomly selected trip. The probability distribution of C is shown in 
the table and the histogram. 


0 150 300 450 600 750 900 
C = Money collected ($) 


Total 300 450 600 750 900 
collected c; 


Probability pj 0.15 0.25 0.35 0.20 0.05 
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Earlier, we calculated the mean of C as pC=$562.50HC = $562.50 and the 
standard deviation of C as oC=$163.467¢ = $163.46 We can describe the 
probability distribution of C as follows: 


Shape: Roughly symmetric with a single peak 
Center: }C=$562.50HC = $562.50 
Variability: oC=$163.467c = $163.46 


It costs Pete $100 to buy permits, gas, and a ferry pass for each day 
trip. The amount of profit V that Pete makes on a randomly selected trip is 
the total amount of money C that he collects from passengers minus $100. 
That is, V= C — 100. The probability distribution of V is 


Profit v; 200 350 500 650 800 
Probability p; 0.15 0.25 0.35 0.20 0.05 


A histogram of this probability distribution is shown here. 


0.4 


S 
w 


Probability 
oS 
Nm 


0.1 


0 200 350 500 650 800 
V = Profit ($) 


We can see that the probability distribution of V has the same shape as 
the probability distribution of C. The mean of V is 


uv=(200)(0.15) + (350)(0.25) + (500)(0.35) + (650)(0.20) + (800) 
(0.05)=$462.50 


(200)(0.15) + (350)(0.25) + (500)(0.35) + (650)(0.20) + (800)(0.05) 
$462.50 


I| 


fly 


II 


On average, Pete will make a profit of $462.50 from the trip. That’s 


$100 less than pC¥C , his mean amount of money collected per trip. The 
standard deviation of V is 


oC=(200-462.50)2(0.15)+(350-462.50)2(0.25)+++-+ 
(800-462.50)2(0.05)=$163.46 
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oc = 4/(200 — 462.50)? (0.15) + (350 — 462.50)” (0.25) + --- + (800 — 462.50)? (0.05) 
= $163.46 
That’s the same as the standard deviation of C. 


It’s fairly clear that subtracting 100 from the values of the random 
variable C just shifts the probability distribution to the left by 100. This 
transformation decreases the mean by 100 (from $562.50 to $462.50), but 
it doesn’t change the standard deviation ($163.46) or the shape. These 
results can be generalized for any random variable. 


THE EFFECT OF ADDING OR SUBTRACTING A 
CONSTANT ON A PROBABILITY DISTRIBUTION 


Adding the same positive number a to (subtracting a from) each value 
of a random variable: 


e Adds a to (subtracts a from) measures of center and location (mean, 
median, quartiles, percentiles). 


e Does not change measures of variability (range, JQR, standard 
deviation). 


e Does not change the shape of the probability distribution. 


Note that adding or subtracting a constant affects the distribution of a 
quantitative variable and the probability distribution of a random variable 
in exactly the same way. 


EXAMPLE | Scaling test scores » 


Effect of adding/subtracting a constant 
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PROBLEM: In a large introductory statistics class, the score X of a 
randomly selected student on a test worth 50 points can be modeled by a 
Normal distribution with mean 35 and standard deviation 5. Due to a 
difficult question on the test, the professor decides to add 5 points to 
each student’s score. Let Y be the scaled test score of the randomly 
selected student. Describe the shape, center, and variability of the 
probability distribution of Y. 


SOLUTION: 

Shape: Approximately Normal 

Center: pY = pX+5 = 35+5 = 40HY ~ HX +5 = 35+5= 40 
Variability: oY = 0X =50Y =%x =5 


Notice that Y= X + 5. Adding a constant doesn’t affect the shape or the 


standard deviation of the probability distribution. 


FOR PRACTICE, TRY EXERCISE 37 


EFFECT OF MULTIPLYING OR DIVIDING BY A CONSTANT 
The professor in the preceding example decides to convert his students’ 
scaled test scores Y to percentages. Because the test was scored out of 50 
points, the professor multiplies each student’s scaled score by 2 to convert 
to a percent score W. That is, W = 2Y. Figure 6.3 displays the probability 
distributions of the random variables Y and W. From the graphs, we can 
see that the measures of center, location, and variability have all doubled 
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—just like the individual students’ scores. But the shape of the two 
distributions is the same. 


i \ 
Ke 
~*~ 
20 30 40 50 60 70 80 90 100 110 
Y = scaled test score (out of 50) W = test score as a percent 
Shape: Approximately Normal Shape: Approximately Normal 
Center: y = 40 Center: w = 80 
(@) Variability: y = 5 (D) Variability: w = 10 


FIGURE 6.3 Probability distribution of (a) Y= a randomly selected statistics 
student’s scaled test score out of 50 and (b) W = the student’s scaled test score as a 
percent. 


THE EFFECT OF MULTIPLYING OR DIVIDING BY A 
CONSTANT ON A PROBABILITY DISTRIBUTION 


Multiplying (or dividing) each value of a random variable by the 
same positive number b: 


Multiplies (divides) measures of center and location (mean, median, 
quartiles, percentiles) by b. 


Multiplies (divides) measures of variability (range, JQR, standard 
deviation) by b. 


Does not change the shape of the distribution. 


Once again, multiplying or dividing by a constant has the same effect 
on the probability distribution of a random variable as it does on a 
distribution of quantitative data. 


EXAMPLE |How much does college cost? » 
Effect of multiplying/dividing by a constant 
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PROBLEM: El Dorado Community College considers a student to be 
full-time if he or she is taking between 12 and 18 units. The number of 
units X that a randomly selected E] Dorado Community College full- 
time student is taking in the fall semester has the following distribution. 


Number of 12 13 14 15 16 17 18 
units 


Probability 0.25 0.10 0.05 0.30 0.10 0.05 0.15 


At right is a histogram of the probability distribution. The mean is 


uX=14.654x = 14.65 and the standard deviation is oX=2.056 
ox = 2.056 


At El Dorado Community College, the tuition for full-time students is 
$50 per unit. That is, if T = tuition charge for a randomly selected full- 
time student, T = 50X. 


a. What shape does the probability distribution of T have? 


Multiplying by a constant doesn’t change the shape. 


b. Find the mean of T. 


c. Calculate the standard deviation of T. 


0.30 4 


0.25 4 


Probability 
i) 
8 
n 


12 13 14 15 16 17 18 
X = Number of units 


SOLUTION: 


a. The same shape as the probability distribution of X: roughly 
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symmetric with three peaks. 
b. pT=50pX = 50(14.65) = $732.50HT = 50ux = 50 (14.65) = $732.50 


c. oT =500X = 50(2.056) = $102.807F = 50x = 50 (2.056) = $102.80 


FOR PRACTICE, TRY EXERCISE 41 


It is not common to multiply (or divide) a random variable by a 
negative number b. Doing so would multiply (or divide) the measures of 
variability by |b]. We can’t have a negative amount of variability! 
Multiplying or dividing by a negative number would also affect the shape 
of the probability distribution, as all values would be reflected over the y 
axis. 


Think About It 


HOW DOES MULTIPLYING BY A CONSTANT AFFECT THE 
VARIANCE? For El Dorado Community College, the variance of the 
number of units that a randomly selected full-time student takes is 


oX2=4,22757x = 4.2275 The variance of the tuition charge for such a 
student is oT2=10,568.7577 ~ 10,568.75. That’s (2500)(4.2275). So 
oT2=25000X27r = 25000% Where did 2500 come from? It’s just 
(50)2. In other words, oT2=(50)20X2°r = (50)°o% Multiplying a 
random variable by a constant b multiplies the variance by b2. 


CHECK YOUR UNDERSTANDING 


(0 


y A large auto dealership keeps track of sales made during each hour 
of the day. Let X = the number of cars sold during the first hour of 
business on a randomly selected Friday. Based on previous records, the 
probability distribution of X is as follows: 
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Cars sold 0 1 2 2 
Probability 03 04 O02 O.1 


The random variable X has mean pX=1.14X ~ 1.1 and standard 

deviation oX=0.9437x = 0.943, 

Suppose the dealership’s manager receives a $500 bonus from the 
company for each car sold. Let Y = the bonus received from car sales 
during the first hour on a randomly selected Friday. 

1. Sketch a graph of the probability distribution of X and a separate 
graph of the probability distribution of Y. How do their shapes 
compare? 

2. Find the mean of Y. 


3. Calculate and interpret the standard deviation of Y. 


The manager spends $75 to provide coffee and doughnuts to 
prospective customers each morning. So the manager’s net profit T 
during the first hour on a randomly selected Friday is $75 less than 
the bonus earned. 


4. Describe the shape, center, and variability of the probability 
distribution of T. 


PUTTING IT ALL TOGETHER: ADDING/SUBTRACTING AND 
MULTIPLYING/DIVIDING What happens if we transform a random 
variable by both adding or subtracting a constant and multiplying or 
dividing by a constant? We just use the order of operations and the facts 
about transforming data that we’ve already established. 


EXAMPLE |The baby and the bathwater >» 


Analyzing the effect of transformations 
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PROBLEM: One brand of baby bathtub comes with a dial to set the 
water temperature. When the “babysafe” setting is selected and the tub 
is filled, the temperature X of the water in a randomly selected bath 
follows a Normal distribution with a mean of 34°C and a standard 
deviation of 2°C. Let Y be the water temperature in degrees Fahrenheit 
pct 
for the randomly selected bath. Recall that F=95 eae yt $2] 
a. Find the mean of Y. 


b. Calculate and interpret the standard deviation of Y. 


SOLUTION: 


a. WY = 95pX+32 = 95 (34)+32 = 93.2F° 
py = 2px +32 = 2 (34) + 32 = 93.2°F 


= 2g, =» (2) =3.6° 
peo = ooo <= ose ocne ’ 5 OX 5 (7) S40 2 


When the dial is set on “babysafe,” the temperature of a randomly 
selected bath typically varies about 3.6°F from the mean of 93.2°F. 


Multiplying each value of X by 9/5 multiplies the standard deviation 
by 9/5. However, adding 32 to each value doesn’t affect measures of 
variability. 
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FOR PRACTICE, TRY EXERCISE 47 


The probability distribution of Y = water temperature on a randomly 
selected day when the “babysafe” setting is used is Normal because the 
original distribution is Normal, and adding a constant and multiplying by a 
constant don’t affect shape. We can use this Normal distribution to find 
probabilities as we did in Section 6.1. For instance, according to Babies R 
Us, the temperature of a baby’s bathwater should be between 90°F and 
100°F. What’s P(90<Y<100)? (99 < ¥ < 100)? Figure 6.4 shows the 
desired probability as an area under a Normal curve. 


Y = Water temperature (°F) 
FIGURE 6.4 The Normal probability distribution of the random variable Y = the 
temperature (in degrees Fahrenheit) of the bathwater when the dial is set on 


“babysafe.” The shaded area is the probability that the water temperature is 
between 90°F and 100°F. 


To find the probability, we can either (i) standardize the boundary 
values and use Table A or technology; or (ii) use technology without 
standardizing. 


(i) z=90-93.23.6=—0.89 2=100-93.23.6=1.89 
2a OS = 08900 z= = 189 


Using Table A: 0.9706-0.1867=0.78390-9706 — 0.1867 = 0.7839 


Using technology: normalcdf(lower:—0.89, upper:1.89, mean:0, SD:1) 
= 0.7839 


(ii) normalcdf(lower:90, upper:100, mean:93.2, SD:3.6) = 0.7835 


When set on “babysafe” mode, there’s about a 78% probability that the 
water temperature meets the recommendation for a randomly selected 
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bath. 


Combining Random Variables 


So far, we have looked at settings that involved a single random variable. 
Many interesting statistics problems require us to combine two or more 
random variables. 

Let’s return to the familiar setting of Pete’s Jeep Tours. Earlier, we 
focused on the amount of money C that Pete collects on a randomly 
selected day trip. This time we’ll consider a different but related random 
variable: X = the number of passengers on a randomly selected trip. Here is 
its probability distribution: 


Number of 2 3 4 5 6 
passengers x; 


Probability p; 0.15 0.25 0.35 0.20 0.05 


You can use what you learned earlier to confirm that pX=3.754X = 3.75 
passengers and oX=1.08979X = 1.0897 passengers. 

Pete’s sister Erin runs jeep tours in another part of the country on the 
same days as Pete in her slightly smaller vehicle, under the name Erin’s 
Adventures. The number of passengers Y on a randomly selected trip has 
the following probability distribution. You can confirm that pY=3.10 
py = 3.10 passengers and oY=0.9437y = 9.943 passengers. 


Number of passengers y; 2 3 4 °5 
Probability p; 0.3 0.4 0.2 0.1 


Here are two questions that we would like to answer based on this 
scenario: 


e What is the sum S = X + Y of the number of passengers Pete and Erin 
will have on their tours on a randomly selected day? 


e What is the difference D = X — Y in the number of passengers Pete and 
Erin will have on a randomly selected day? 
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As this setting suggests, we want to investigate what happens when we add 
or subtract random variables. 


MEAN (EXPECTED VALUE) OF THE SUM OR DIFFERENCE OF 
TWO RANDOM VARIABLES How many total passengers S can Pete 
and Erin expect to have on their tours on a randomly selected day? 
Because Pete averages pX=3.754X = 3.75 passengers per trip and Erin 
averages pY=3.10HY = 3.10 passengers per trip, they will average a total 
of pS=3.75+3.10=6.8545 = 3.75 + 3.10 = 6.85 passengers per day. We 
can generalize this result for any two random variables. 


MEAN (EXPECTED VALUE) OF A SUM OF RANDOM 
VARIABLES 


For any two random variables X and Y, if S = X Y, the mean (expected 


value) of S is 
uS=pX+Y=pX+pYHs = Mx+y = Mx + My 


In other words, the mean of the sum of two random variables is equal to 
the sum of their means. 


What’s the mean of the difference D = X — Y in the number of 
passengers that Pete and Erin have on their tours on a randomly selected 
day? Because Pete averages pX=3.754X = 3.75 passengers per trip and 
Erin averages pY=3.10HY — 3.10 passengers per trip, the mean difference 
is }D=3.75-3.10=0.654D = 3-75 — 3.10 = 0.65 passengers. That is, Pete 
averages 0.65 more passengers per day than Erin does. Once again, we can 
generalize this result for any two random variables. 


MEAN (EXPECTED VALUE) OF A DIFFERENCE OF 
RANDOM VARIABLES 


For any two random variables X and Y, if D = X — Y, the mean 
(expected value) of D is 


uD=pX-Y=yX-pYHp = Hx-y = Mx — by 
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In other words, the mean of the difference of two random variables is 
equal to the difference of their means. 


@ The order of subtraction is important. If we had defined D = Y 
— X, then pD=pY-pX=3.10-3.75=—-0.65 
Hp = by — Ux = 3.10 — 3.75 = —0.65— 3 10 - 3.75 = -0.65. In other 
words, Erin averages 0.65 fewer passengers than Pete does on a randomly 
chosen day. 


EXAMPLE |How much do Pete and Erin make? » 


Mean of a sum or difference of random variables 


moodboard/Superstock 


PROBLEM: Pete charges $150 per passenger and Erin charges $175 
per passenger for a jeep tour. Let C = the amount of money that Pete 
collects and E = the amount of money that Erin collects on a randomly 
selected day. From our earlier work, we know that pC=562.50 

tc = 562.50 and it is easy to show that pE=542.50He = 542-50. Define 
S=C +E. Calculate and interpret the mean of S. 


SOLUTION: 


Us=pct+pE=562.50+542.50=$1105.00 
lls = fe + fe = 562.50 + 542.50 = $1105.00 


Pete and Erin expect to collect a total of $1105 per day, on average, over 
many randomly selected days. 


FOR PRACTICE, TRY EXERCISE 49 
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How did we calculate uC¥C and pEM£ in the example? Earlier, we 
defined X = the number of passengers that Pete has and Y = the number of 
passengers that Erin has on a randomly selected day trip. Recall that 
UX=3.75HX = 3.75 and wY=3.10HY = 3.10 Because Pete charges $150 
per passenger, the amount of money that he collects on a randomly 
selected day is C =150X. Multiplying a random variable by a constant 
multiplies the value of the mean by the same constant: 


Because Erin charges $175 per passenger, E = 175Y and 
pE=175pY=175(3.10)=$542.50He = 175my = 175 (3.10) = $542.50 
What’s the mean of the difference D = C — E in the amounts that Pete 

and Erin collect on a randomly chosen day? It’s 


uD=pC-pE=562.50-542.50=$20.00 
Lip = Uc — br = 562.50 — 542.50 = $20.00 


On average, Pete collects $20 more per day than Erin does. 


Standard Deviation of the Sum or Difference of Two 
Random Variables 


How much variation is there in the total number of passengers S = X + Y 
who go on Pete’s and Erin’s tours on a randomly chosen day? Here are the 
probability distributions of X and Y once again. Let’s think about the 
possible values of S. The number of passengers X on Pete’s tour is between 
2 and 6, and the number of passengers Y on Erin’s tour is between 2 and 5. 
So the total number of passengers S is between 4 and 11. That is, there’s 
more variability in the values of S than in the values of X or Y alone. This 
makes sense, because the variation in X and the variation in Y both 
contribute to the variation in S. 


Pete’s Jeeps 


Number of 2 2 4 5 6 
passengers x; 


0.15 0.25 0.35 0.20 0.05 
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Probability p; 


px=3.75  ox=1.0897H2 = 3.75 oo, = 1.0897 


Erin’s Adventures 
Number of passengers y; 2 3 4 5 
Probability p; 0.3 0.4 0.2 0.1 


uY=3.10 oY=0.943HY =3.10 cy = 0.943 


What’s the standard deviation of S = X + Y? If we had the probability 
distribution of S, then we could calculate oS7S. Let’s try to construct this 
probability distribution starting with the smallest possible value, S = 4. The 
only way to get a total of 4 passengers is if Pete has X = 2 passengers and 
Erin has Y = 2 passengers. We know that P(X = 2) = 0.15 and that P(Y = 2) 
= 0.3. If the events X = 2 and Y = 2 are independent, we can use the 
multiplication rule for independent events to find P(X = 2 and Y= 2). 
Otherwise, we’re stuck. In fact, we can’t calculate the probability for any 
value of S unless X and Y are independent random variables. 


DEFINITION Independent random variables 


If knowing the value of X does not help us predict the value of Y, then 
X and Y are independent random variables. 


It’s reasonable to treat the random variables X = number of passengers 
on Pete’s trip and Y = number of passengers on Erin’s trip on a randomly 
chosen day as independent, because the siblings operate their trips in 
different parts of the country. Because X and Y are independent, 

P(S=4)=P(X=2 and Y=2)=(0.15)(0.3)=0.045 
ra=4) = Piss —Zaend? =2) 
= (0.15)(0.3) = 0.045 


There are two ways to get a total of S = 5 passengers on a randomly 
selected day: X = 3, Y=2 or X = 2, Y=3. So 


P(S=5)=P(X=2 and Y=3)+P(X=3 and Y=2)=(0.15)(0.4)+(0.25) 
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(0.3)=0.06+0.075=0.135 
Pis=6) = PiA—ZendY =3)+ P(X —SandyY = 72) 


= (0.15)(0.4) + (0.25)(0.3) 
= 0.06 + 0.075 = 0.135 


We can construct the probability distribution by listing all combinations of 
X and Y that yield each possible value of S and adding the corresponding 
probabilities. Here is the result: 


Sum s; 4 5 6 7 8 9 10 11 
Probability p; 0.045 0.135 0.235 0.265 0.190 0.095 0.030 0.005 


The mean of S is 


uS=¥. sipi=(4)(0.045)+(5)(0.135)+-+(11)(0.005)=6.85 
us = S~ sip; = (4) (0.045) + (5) (0.135) +--+ + (11) (0.005) = 6.85 


Our calculation confirms that 


pS=pX+pY=3.75+3.10=6.85Hs = Mx + My = 3.75 + 3.10 = 6.85 


The variance of S is 


0S2=Y (si-uS)2 pi=(4—6.85)2(0.045)+(5-6.85)2(0.135)+-+:+ 
(11-6.85)2(0.005)=2.0775 
oO = > (s pus)” Pi 
= (4—6.85)° (0.045) + (5 — 6.85)” (0.135) + --- + (11 — 6.85)” (0.005) 
= 2.0775 


The variances of X and Y are 0oX2=(1.0897)2=1.1875 
o% = (1.0897)" = 1.1875 ang gy2=(0.943)2=0.897F = (0-943)" = 0.89 
Notice that 


oX2+0Y2=1.1875+0.89=2.0775=082 
o% +02 = 1.1875 + 0.89 = 2.0775 = 02 


In other words, the variance of a sum of two independent random variables 
is the sum of their variances. To find the standard deviation of S, take the 
square root of the variance: 
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0S=2.0775=1.4419s = V 2.0775 = 1.441 


The total number of passengers on Pete’s and Erin’s trips on a randomly 
selected day typically varies by about 1.441 passengers from the mean of 
6.85 passengers. 


STANDARD DEVIATION OF THE SUM OF TWO 
INDEPENDENT RANDOM VARIABLES 


For any two independent random variables X and Y, if S = X + Y, the 
variance of S is 


a — | 2 
oS2=0X+Y2=0X2+0Y275 ~ Txiy = 7x Tt Fy 


To get the standard deviation of S, take the square root of the variance: 


Os = Oxsy = Ox = ox. 
OS=0X+Y=0X2+0Y2 


o.=-07,+02. ; 
The formula oS2=o0X2+o0Y2° S$ X Y is sometimes referred to as the 


“Pythagorean theorem of statistics.” It certainly looks similar to c2=a2+b2 
2 — 72 2 
c* = a* + b*) Just as the real Pythagorean theorem only applies to right triangles, 


the f = o=or.+02% a i 
e formula oS2=0X2+o0Y2° § X Y only applies if X and Y are independent 


random variables. 


You might be wondering whether there’s a formula for computing the 
variance or standard deviation of the sum of two random variables that are 
not independent. There is, but it’s beyond the scope of this course. 


When we add two independent random variables, their variances add. 
oo Standard deviations do not add. For Pete’s and Erin’s passenger 


totals, 


oX+o0Y=1.0897+0.943=2.03279x + oy = 1.0897 + 0.943 = 2.0327 


This is very different from oS=1.441.7s = 1.441. 


Can you guess what the variance of the difference of two independent 
random variables will be? If you were thinking something like “the 
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difference of their variances,” think again! Here are the probability 
distributions of X and Y from the jeep tours scenario once again: 


Pete’s Jeeps 
Number of passengers x; 2 3 4 5 6 
Probability p; 0.15 0.25 035 0.20 0.05 


UX = 3.754X = 3.75 
oX = 1.08979X = 1.0897 
Erin’s Adventures 


Number of passengers y; y 5 4 5 
Probability p; 0.3 0.4 0.2 0.1 


pY =3.10#¥ = 3.10 
oY =0.943¢Y = 0.943 
By following the process we used earlier with the random variable S = 


X + Y, you can build the probability distribution of D = X — Y. 


Value d; -3 -2 -1 0 1 2 3 4 
Probability p; 0.015 0.055 0.145 0.235 0.260 0.195 0.080 0.015 


You can use the probability distribution to confirm that: 
|. pD=0.65=3.75-3.10=pX—pYHD = 0.65 = 3.75 — 3.10 = x — py 
», OD2=2.0775=1.1875+0.89=0X2+0Y2 
o2, = 2.0775 = 1.1875 + 0.89 = 0% +02, 
§. 0D=2.0775=1.4419D = V2.0775 = 1.441 


Result 2 shows that, just as with addition, when we subtract two 
independent random variables, variances add. There’s more variability in 
the values of the difference D than in the values of X or Y alone. This 
should make sense, because the variation in X and the variation in Y both 
contribute to the variation in D. 
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STANDARD DEVIATION OF THE DIFFERENCE OF 
TWO INDEPENDENT RANDOM VARIABLES 


For any two independent random variables X and Y, if D = X — Y, the 
variance of D is 


) _ 9 2 
oD2=0X-Y2=0X2+0Y 27D — oy — Oy i Oy 


To get the standard deviation of D, take the square root of the variance: 


On=dr_y = fteodl 2 
oD=0X-Y=0X2+0 22 > OXY = Vx 1% 


Let’s put this new rule to use in a familiar setting. 


EXAMPLE |How much do Pete’s and Erin’s earnings 
vary? |» 


SD of a sum or difference of random variables 


PROBLEM: Pete charges $150 per passenger and Erin charges $175 
per passenger for a jeep tour. Let C = the amount of money that Pete 
collects and E = the amount of money that Erin collects on a randomly 
selected day. From our earlier work, it is easy to show that oC=$163.46 
oc = $163.46 and oE=$165.0372 = $165.03, you may assume that 
these two random variables are independent. Define D = C — E. Earlier, 


we found that pD=$2042 ~ $20 Calculate and interpret the standard 
deviation of D. 


SOLUTION: 


D=C~—E. Because C and E are independent random variables, 
oD2 = oC2+0E2 = (163.46)2+(165.03)2 = 53,954.07 
o?, = 02, + 03, = (163.46)” + (165.03)” = 53, 954.07 


OD = 53,954.07 = $232.287D = V/93, 954.07 = $232.28 


The difference (Pete — Erin) in the amount collected on a randomly 
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selected day typically varies by about $232.28 from the mean difference 
of $20. 


Note that variances add when you are dealing with the sum or difference 


of independent random variables. 


FOR PRACTICE, TRY EXERCISE 57 


How did we calculate oC7C and oE?£ in the example? Earlier, we 
defined X = the number of passengers that Pete has and Y = the number of 
passengers that Erin has on a randomly selected day trip. Recall that 
oX=1.08977x = 1.0897 and cY=0.9437Y = 9.943, Because Pete charges 
$150 per passenger, the amount of money that he collects on a randomly 
selected day is C =150X. Multiplying a random variable by a constant 
multiplies the value of the standard deviation by the same constant: 


Because Erin charges $175 per passenger, E = 175Y and 
OE=1750Y=175(0.943)=$165.0378 = 175ey = 175 (0.943) = $165.03 


COMBINING VERSUS TRANSFORMING RANDOM VARIABLES 
We can extend our rules for combining random variables to situations 
involving repeated observations of the same chance process. Let’s return 
to the gambler we met at the beginning of this section. Suppose he plays 
two games of roulette, each time placing a $1 bet on red. What can we say 
about his total gain (or loss) from playing two games? Earlier, we showed 
that if X = the amount gained on a single $1 bet on red, then pX=—$0.05 
px = —$0.05 and oX=-$1.007x = —$1.00, Because we’re interested in 
the player’s total gain over two games, we’|l define X; as the amount he 
gains from the first game and X> as the amount he gains from the second 
game. Then his total gain T= X, + X. Both X; and X> have the same 
probability distribution as X and, therefore, the same mean (—$0.05) and 
standard deviation ($1.00). The player’s expected gain in two games is 


uT=yX1+pX2=(-$0.05)+(-$0.05)=—-$0.10 
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lr = wx, + wx, = (—$0.05) + (—$0.05) = —$0.10 


Because knowing the result of one game tells the player nothing about the 
result of the other game, X, and X> are independent random variables. As a 


result, 


6T2=0X12+0X22=(1.00)2+(1.00)2=2.00 
Op = 0% + 0%, = (1.00)” + (1.00)” = 2.00 


and the standard deviation of the player’s total gain is 
oT=.00=$1.4199T =—v¥ .00 = $1.41 


At the beginning of the section, we asked whether a roulette player 
would be better off placing two separate $1 bets on red or a single $2 bet 
on red. We just showed that the expected total gain from two $1 bets is 
uT=—$0.104T = ~$0.10 With a standard deviation of oT=$1.4197r =$1.41, 
Now think about what happens if the gambler places a $2 bet on red ina 
single game of roulette. Because the random variable X represents a 
player’s gain from a $1 bet, the random variable Y = 2X represents his gain 
from a $2 bet. 


What’s the player’s expected gain from a single $2 bet on red? It’s 


WY =2uX=2(-$0.05)=—-$0.10HY = 2ux = 2(—$0.05) = —$0.10 
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That’s the same as his expected gain from playing two games of roulette 
with a $1 bet each time. But the standard deviation of the player’s gain 
from a single $2 bet is 


0 Y=20X=2($1.00)=$2.00°¥ = 20x =2 ($1.00) = $2.00 
Compare this result to oT=$1.419r =$1.41, There’s more variability 
in the gain from a single $2 bet than in the total gain from two $1 bets. oOo 
Bottom line: X, + Xz is not the same as 2X. 
Let’s take this one step further. Would it be better for the player to 
place a single $100 bet on red or to play 100 games and bet $1 each time 
on red? With the single $100 bet, the player will either win big or lose big 


(lots of variation). But with 100 $1 bets, there will be some wins and some 
losses, and the end result will be much less variable. 


CHECK YOUR UNDERSTANDING 


A 


\ 


y A large auto dealership keeps track of sales and lease agreements 
made during each hour of the day. Let X = the number of cars sold and Y 
= the number of cars leased during the first hour of business on a 
randomly selected Friday. Based on previous records, the probability 
distributions of X and Y are as follows: 


Cars sold x; 0 1 2 3 
Probability p; 03 04 0.2 0.1 


Mean: tX=1.1Standard deviation: oX=0.943 
Mean: px = 1.1 Standard deviation: cx = 0.943 


Cars leased y; 0 i 2 
Probability p; 04 05 O.1 


Mean: pY=0.7Standard deviation: oY =0.64 
Mean: py = 0.7 Standard deviation: cy = 0.64 
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Define T = X + Y. Assume that X and Y are independent. 
1. Find and interpret pTH7. 
2. Calculate and interpret oT?7. 


3. The dealership’s manager receives a $500 bonus for each car sold and 
a $300 bonus for each car leased. Find the mean and standard 
deviation of the manager’s total bonus B. 


Combining Normal Random Variables 


So far, we have concentrated on developing rules for means and variances 
of random variables. If a random variable is Normally distributed, we can 
use its mean and standard deviation to compute probabilities. What 
happens if we combine two independent Normal random variables? 

We used software to simulate separate random samples of size 1000 
for each of two independent, Normally distributed random variables, X and 
Y. Their means and standard deviations are as follows: 


UX=34X = e oX=0.97x = 9.9 uY=1HY = 1 oY=1.20Y = 1.2 


Figure 6.5(a) shows the results. What do we know about the sum and 
difference of these two random variables? The histograms in Figure 6.5(b) 
came from adding and subtracting the corresponding values of X and Y for 
the 1000 randomly generated observations from each probability 
distribution. 


300 4 300 
y m- sum 200 
' 100 4 
" _ 100 
S 0 5 0 
> 30-4 Es 300 
E p 
200 4 = 200 
Y diff 
100 4 100 
i 
(a) 4 2 0 2 4 6 8 10 (b) 42024 6 8 10 


FIGURE 6.5 (a) Histograms showing the results of randomly selecting 1000 
values of two independent, Normal random variables X and Y. (b) Histograms of 
the sum and difference of the 1000 randomly selected values of X and Y. 
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As the simulation illustrates, any sum or difference of independent 
Normal random variables is also Normally distributed. The mean and 
standard deviation of the resulting Normal distribution can be found using 
the appropriate rules for means and standard deviations: 


Sum X + Y Difference X — Y 
Mean pX+Y=pX+pY=3+1=4 UX-Y=pX-pY=3-1=2 
Bxyy = px t+py =3+1=4 Hx-y = px —-py =3-1=2 
SD OX-Y2=0X2+0Y2=0.92+1.22=2.250X 


OX+Y2=0X2+0Y2=0.9241.22=2.250X+ Y=2 V52V5=1.5 
Oey = 0% +0; = 0.97 + 1.2? = 2.20% _) = 0%, + 0% = 0.97 +1.2? = 2.25 


Oxiy = /2.25 =: ich ox-Y = V'2.25 =1.5 


EXAMPLE | Will the lid fit? (») 


Combining normal random variables 


PROBLEM: The diameter C of the top of a randomly selected large 
drink cup at a fast-food restaurant follows a Normal distribution with a 
mean of 3.96 inches and a standard deviation of 0.01 inch. The diameter 
L of arandomly selected large lid at this restaurant follows a Normal 
distribution with mean 3.98 inches and standard deviation 0.02 inch. 
Assume that L and C are independent random variables. Let the random 
variable D = L — C be the difference between the lid’s diameter and the 
cup’s diameter. 


a. Describe the distribution of D. 


b. For a lid to fit on a cup, the value of L has to be bigger than the value 
of C, but not by more than 0.06 inch. Find the probability that a 
randomly selected lid will fit on a randomly selected cup. Interpret 
this value. 
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karandaev/ 
Getty Images 


SOLUTION: 


a. Shape: Normal 
Center:1)D=3.98-3.96=0.02 inchCenter > Pp = 3.98 — 3.96 = 0.02 inch 


Variability:oD=(0.02)2+(0.01)2=0.0224 inch 
nn 
Variability : op = 4/ (0.02)* + (0.01)* = 0.0224 inch 


D is the difference of two independent Normal random variables. 


uD=pL-pCHP = Mit — he 


aD=,/o7 +0 
oD=0L2+0C2 V er ate 


b. The lid will fit if 0<L-c<0.069 < L — C < 0.06 that is, if 0<D<0.06 


0.0472 -0.0248 -0.0024 0.02 0.0424 0.0648 0.0872 
D = Difference (Lid — Cup) in diameter (in.) 
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1. Draw a Normal distribution. 


2. Perform calculations—show your work! 


(i) Standardize and use Table A or technology; or 
(ii) Use technology without standardizing. 
Be sure to answer the question that was asked. 


i. z=0-0.020.0224=-0.89 z=0.06-0.020.0224=1.79 
Pees es ee 


0.022 0.0224 


co 


Using Table A: 0.9633 — 0.1867 = 0.7766 


Using technology: normalcdf(lower: —0.89, upper:1.79, mean:0, 
SD:1) = 0.7765 


P (0 < D< 0.06) = P (-0.89 < Z < 1.79) 


P(0<D < 0.06) =P (-0.89<Z < 1.79) 


ii. normalcdf(lower:0, upper:0.06, mean:0.02, SD:0.0224) = 0.7770 


There’s about a 78% chance that a randomly selected lid will fit 
on a randomly selected cup. 


FOR PRACTICE, TRY EXERCISE 65 


ee 


domin_domin/Getty Images 
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We can extend what we have learned about combining independent 
Normal random variables to settings that involve repeated observations 
from the same probability distribution. Consider this scenario. Mr. Starnes 
likes sugar in his hot tea. From experience, he needs between 8.5 and 9 
grams of sugar in a cup of tea for the drink to taste right. While making his 
tea one moming, Mr. Starnes adds four randomly selected packets of 
sugar. Suppose the amount of sugar in these packets follows a Normal 
distribution with mean 2.17 grams and standard deviation 0.08 gram. 
What’s the probability that Mr. Starnes’s tea tastes right? 


Let X = the amount of sugar in a randomly selected packet. Then X; = 
amount of sugar in Packet 1, X> = amount of sugar in Packet 2, X3 = 
amount of sugar in Packet 3, and X4 = amount of sugar in Packet 4. Each 


of these random variables has a Normal distribution with mean 2.17 grams 
and standard deviation 0.08 grams. We’re interested in the total amount of 
sugar that Mr. Starnes puts in his tea: T= X; + Xp + X3 + Xq. 


The random variable T is a sum of four independent Normal random 
variables. So T follows a Normal distribution with mean 


wT=pX1+pX2+uX3+pX4=.17+.17+.17+.17=8.68 grams 
er = bx, + Ux, + Mx, + ux, = 17+ .17+ .17+ .17 = 8.68 grams 


and variance 
o0T2=0X12+0X22+0X32+0X42=(0.08)2+(0.08)2+(0.08)2+ 
(0.08)2=0.0256 


Op = 0%, + 0%, + 0%, +0, = (0.08) + (0.08)* + (0.08)? + (0.08)” = 0.0256 


The standard deviation of T is 


oT=0.0256=0.16 gram?r = V 0.0256 = 0.16 gram 


We want to find the probability that the total amount of sugar in Mr. 
Starnes’s tea is between 8.5 and 9 grams. Figure 6.6 shows this probability 
as the area under a Normal curve. 


To find this area, we can use either of our two familiar methods: 
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=—g50°° 
T = Total amount of sugar (g) 


FIGURE 6.6 Normal distribution of the total amount of sugar in Mr. Starnes’s tea. 


(i) Standardize the boundary values and use Table A or technology: 


z=8.5—8.680.16=—1.13andz=9-8.680.16=2.00 


8.5—8.68 9—8.68 
Fe —_—_——"- — -—-l. d | —— 
z 0.16 1.13 and z 0.16 


Using Table A: P(-1.13 < Z < 2.00) = 0.9772 — 0.1292 = 0.8480 


Using technology: normalcdf(lower:—1.13, upper:2.00, mean:0, SD:1) = 
0.8480 


(ii) Use technology to find the desired area without standardizing. 


normalcdf(lower:8.5, upper:9, mean:8.68, SD:0.16) = 0.8470 


There’s about an 85% probability that Mr. Starnes’s tea will taste right. 


Section 6.2 Summary 


e Adding a positive constant a to (subtracting a from) a random variable 
increases (decreases) measures of center and location by a, but does not 
affect measures of variability (range, JQR, standard deviation) or the 
shape of its probability distribution. 


e Multiplying (dividing) a random variable by a positive constant b 
multiplies (divides) measures of center and location by b and multiplies 
(divides) measures of variability (range, IQR, standard deviation) by b, 
but does not change the shape of its probability distribution. 


e If X and Y are any two random variables, 
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uxXt+Y=yX+pYex+y = Hx + HY: The mean of the sum of two 
random variables is the sum of their means. 

uxX—-Y="X-pYHx-y = Hx ~ FY: The mean of the difference of two 
random variables is the difference of their means. 


e If X and Y are independent random variables, then knowing the value 
of one variable tells you nothing about the value of the other. In that 
case, variances add: 


2 2 2 
OX+Y2=0X2+0Y27X+¥ ~ 7X * PY: The variance of the sum of two 
independent random variables is the sum of their variances. 


2 2 2 
oX-Y2=0X2+0Y2°xX-Y ~ 7x * °Y: The variance of the difference 
of two independent random variables is the sum of their variances. 


e To get the standard deviation of the sum or difference of two 
independent random variables, calculate the variance and then take the 
square root: 


ae ae 
y= oxy-y — V/ Oy 1 Oy 


rr 
oX+Y=o0X-Y=o0X2+0Y2 * 


e The sum or difference of independent Normal random variables is a 
Normal random variable. 


Section 6.2 Exercises 


37. 


38. 


pg 384 |») Driving to work The time X it takes Hattan to drive to 


work on a randomly selected day follows a distribution that is 
approximately Normal with mean 15 minutes and standard deviation 
6.5 minutes. Once he parks his car in his reserved space, it takes 5 
more minutes for him to walk to his office. Let T = the total time it 
takes Hattan to reach his office on a randomly selected day, so T= X 
+ 5. Describe the shape, center, and variability of the probability 
distribution of T. 


Toy shop sales Total gross profits G on a randomly selected day at 
Tim’s Toys follow a distribution that is approximately Normal with 
mean $560 and standard deviation $185. The cost of renting and 
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maintaining the shop is $65 per day. Let P = profit on a randomly 
selected day, so P = G — 65. Describe the shape, center, and 
variability of the probability distribution of P. 


. Airline overbooking Airlines typically accept more reservations for a 
flight than the number of seats on the plane. Suppose that for a certain 
route, an airline accepts 40 reservations on a plane that carries 38 
passengers. Based on experience, the probability distribution of Y = 
the number of passengers who actually show up for a randomly 
selected flight is given in the following table. You can check that pY 
HY = 37.4 and oY%Y = 1.24. 


Number of passengers y; 35 36 37 38 39 40 
Probability p; 0.10 0.10 0.30 0.35 0.10 0.05 


There is also a crew of two flight attendants and two pilots on each 
flight. Let X = the total number of people (passengers plus crew) on a 
randomly selected flight. 


a. Make a graph of the probability distribution of X. Describe its 
shape. 


b. Find and interpret pX/* . 


c. Calculate and interpret oX°7% . 


. City parking Victoria parks her car at the same garage every time 
she goes to work. Because she stays at work for different lengths of 
time each day, the fee the parking garage charges on a randomly 
selected day is a random variable, G. The table gives the probability 
distribution of G. You can check that uGH@G = $14 and oG7G= $2.74. 


Garage fee g; $10 $13 $15 $20 
Probability p; 0.20 0.25 0.45 0.10 


In addition to the garage’s fee, the city charges a $3 use tax each time 
Victoria parks her car. Let T = the total amount of money she pays on 
a randomly selected day. 


a. Make a graph of the probability distribution of T. Describe its 
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shape. 
b. Find and interpret pTH7. 
c. Calculate and interpret oT 77. 


41. pg 385 |») Get on the boat! A small ferry runs every half hour 


from one side of a large river to the other. The number of cars X ona 
randomly chosen ferry trip has the probability distribution shown here 


with mean pX=3.87HX — 3.87 and standard deviation oX=1.29 
ox = 1.29 


0 1 Zz 3 4 5 
X = Number of cars 


The cost for the ferry trip is $5. Define M = money collected on a 
randomly selected ferry trip. 


a. What shape does the probability distribution of M have? 
b. Find the mean of M. 


c. Calculate the standard deviation of M. 


42. Skee Ball Ana is a dedicated Skee Ball player who always rolls for 
the 50-point slot. Ana’s score X on a randomly selected roll of the ball 
has the probability distribution shown here with mean pX*X = 23.8 
and standard deviation oX9X = 12.63. 


0.35 
0.30 
0.25 
0.20 
0.15 
0.10 
0.05 
0.00 


Probability 
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43. 


44. 


A player receives one ticket from the game for every 10 points 
scored. Define T = number of tickets Ana gets on a randomly selected 
roll. 


a. What shape does the probability distribution of T have? 
b. Find the mean of T. 


c. Calculate the standard deviation of T. 


Still waiting for the server? How does your web browser get a file 
from the Internet? Your computer sends a request for the file to a web 
server, and the web server sends back a response. Let Y = the amount 
of time (in seconds) after the start of an hour at which a randomly 
selected request is received by a particular web server. The 
probability distribution of Y can be modeled by a uniform density 
curve on the interval from 0 to 3600 seconds. Define the random 
variable W = Y/60. 


a. Explain what W represents. 


b. What probability distribution does W have? 


Where’s the bus? Sally takes the same bus to work every morning. 
Let X = the amount of time (in minutes) that she has to wait for the 
bus on a randomly selected day. The probability distribution of X can 
be modeled by a uniform density curve on the interval from 0 minutes 
to 8 minutes. Define the random variable V = 60X. 


a. Explain what V represents. 


b. What probability distribution does V have? 


Exercises 45 and 46 refer to the following setting. Ms. Hall gave her 
class a 10-question multiple-choice quiz. Let X = the number of 
questions that a randomly selected student in the class answered 
correctly. The computer output gives information about the 
probability distribution of X. To determine each student’s grade on 
the quiz (out of 100), Ms. Hall will multiply his or her number of 
correct answers by 5 and then add 50. Let G = the grade of a 
randomly chosen student in the class. 
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45. 


46. 


47. 


48. 


Mean Median StDev Min Max Q, Q3 
7.6 8.5 1,32 4 10 8 9 


Easy quiz 
a. Find the median of G. 


b. Find the interquartile range (IQR) of G. 


More easy quiz 
a. Find the mean of G. 


b. Find the range of G. 


pg 387 |») Too cool at the cabin? During the winter months, the 


temperatures at the Starneses’ Colorado cabin can stay well below 
freezing (32°F or 0°C) for weeks at a time. To prevent the pipes from 
freezing, Mrs. Starnes sets the thermostat at 50°F. She also buys a 
digital thermometer that records the indoor temperature each night at 
midnight. Unfortunately, the thermometer is programmed to measure 
the temperature in degrees Celsius. Based on several years’ worth of 
data, the temperature T in the cabin at midnight on a randomly 
selected night can be modeled by a Normal distribution with mean 
8.5°C and standard deviation 2.25°C. Let Y = the temperature in the 
cabin at midnight on a randomly selected night in degrees Fahrenheit 
(recall that F = (9/5)C + 32). 


a. Find the mean of Y. 
b. Calculate and interpret the standard deviation of Y. 


c. Find the probability that the midnight temperature in the cabin is 
less than 40°F. 


How much cereal? A company’s single-serving cereal boxes 
advertise 1.63 ounces of cereal. In fact, the amount of cereal X ina 
randomly selected box can be modeled by a Normal distribution with 
a mean of 1.70 ounces and a standard deviation of 0.03 ounce. Let Y 
= the excess amount of cereal beyond what’s advertised in a randomly 
selected box, measured in grams (1 ounce = 28.35 grams). 
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a. Find the mean of Y. 


b. Calculate and interpret the standard deviation of Y. 


c. Find the probability of getting at least 1 gram more cereal than 
advertised. 


49. pg 389 |») Community College costs E] Dorado Community 


50. 


ol. 


2. 


College has a main campus in the suburbs and a downtown campus. 
The amount X spent on tuition by a randomly selected student at the 
main campus has mean $732.50 and standard deviation $103. The 
amount Y spent on tuition by a randomly selected student at the 
downtown campus has mean $825 and standard deviation $126.50. 
Suppose we randomly select one full-time student from each of the 
two campuses. Calculate and interpret the mean of the sum S = X + Y. 


Essay errors Typographical and spelling errors can be either 
“nonword errors” or “word errors.” A nonword error is not a real 
word, as when “the” is typed as “teh.” A word error is a real word, 
but not the right word, as when “lose” is typed as “loose.” When 
students are asked to write a 250-word essay (without spell- 
checking), the number of nonword errors X in a randomly selected 
essay has mean 2.1 and standard deviation 1.136. The number of 
word errors Y in the essay has mean 1.0 and standard deviation 1.0. 
Calculate and interpret the mean of the sum S = X + Y. 


Study habits The Survey of Study Habits and Attitudes (SSHA) is a 
psychological test that measures academic motivation and study 
habits. The SSHA score F of a randomly selected female student at a 
large university has mean 120 and standard deviation 28, and the 
SSHA score M of a randomly selected male student at the university 
has mean 105 and standard deviation 35. Suppose we select one male 
student and one female student at random from this university and 
give them the SSHA test. Calculate and interpret the mean of the 
difference D = F — M in their scores. 


Commuting to work Sulé’s job is just a few bus stops away from his 
house. While it can be faster to take the bus to work than to walk, the 
travel time is more variable due to traffic. The commute time B if 
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Sulé takes the bus to work on a randomly selected day has mean 12 
minutes and standard deviation 4 minutes. The commute time W if 
Sulé walks to work on a randomly selected day has mean 16 minutes 
and standard deviation 1 minute. Calculate and interpret the mean of 
the difference D = B — W in the time it would take Sulé to get to work 
on a randomly selected day. 


Community college costs Refer to Exercise 49. At the main campus, 
full-time students pay $50 per unit. At the downtown campus, full- 
time students pay $55 per unit. Find the mean of the difference D 
(Main — Downtown) in the number of units that the two randomly 
selected students take. 


Essay scores Refer to Exercise 50. An English professor deducts 3 
points from a student’s essay score for each nonword error and 2 
points for each word error. Find the mean of the total score 
deductions T for a randomly selected essay. 


Rainy days Imagine that we randomly select a day from the past 10 
years. Let X be the recorded rainfall on this date at the airport in 
Orlando, Florida, and Y be the recorded rainfall on this date at Disney 
World just outside Orlando. Suppose that you know the means p XX 


and YY and the variances oX27x and oY2°Y of both variables. 


a. Can we calculate the mean of the total rainfall X + Y to be uX+pY 
Hx + HY? Explain your answer. 


b. Can we calculate the variance of the total rainfall to be oX2+0Y2 


: ‘ 
?x * °y? Explain your answer. 


His and her earnings Researchers randomly select a married couple 
in which both spouses are employed. Let X be the income of the 
husband and Y be the income of the wife. Suppose that you know the 


means pXX and pYFY and the variances oX27x and oY2°¥ of 
both variables. 


a. Can we calculate the mean of the total income X + Y to be py + 


Ly? Explain your answer. 
b. Can we calculate the variance of the total income to be oX2+o0 Y2 
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2 9 
7x * °y? Explain your answer. 


57. pg 392 |») Community college costs Refer to Exercise 49. Note 


38. 
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that X and Y are independent random variables because the two 
students are randomly selected from each of the campuses. Calculate 
and interpret the standard deviation of the sum S = X + Y. 


Essay errors Refer to Exercise 50. Assume that the number of non- 
word errors X and word errors Y in a randomly selected essay are 
independent random variables. Calculate and interpret the standard 
deviation of the sum S = X + Y. 


Study habits Refer to Exercise 51. 


a. Assume that F and M are independent random variables. Explain 
what this means in context. 


b. Calculate and interpret the standard deviation of the difference D = 


F — M in their scores. 


c. From the information given, can you find the probability that the 
randomly selected female student has a higher SSHA score than 
the randomly selected male student? Explain why or why not. 


Commuting to work Refer to Exercise 52. 


a. Assume that B and W are independent random variables. Explain 
what this means in context. 


b. Calculate and interpret the standard deviation of the difference D 


(Bus — Walk) in the time it would take Sulé to get to work on a 
randomly selected day. 


c. From the information given, can you find the probability that it 
will take Sulé longer to get to work on the bus than if he walks on 
a randomly selected day? Explain why or why not. 


Community college costs Refer to Exercise 49. Note that X and Y are 
independent random variables because the two students are randomly 
selected from each of the campuses. At the main campus, full-time 
students pay $50 per unit. At the downtown campus, full-time 
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students pay $55 per unit. Suppose we randomly select one full-time 
student from each of the two campuses. Find the standard deviation of 
the difference D (Main — Downtown) in the number of units that the 
two randomly selected students take. 


Essay scores Refer to Exercise 50. Assume that the number of 
nonword errors X and word errors Y in a randomly selected essay are 
independent random variables. An English professor deducts 3 points 
from a student’s essay score for each nonword error and 2 points for 
each word error. Find the standard deviation of the total score 
deductions T for a randomly selected essay. 


Exercises 63 and 64 refer to the following setting. In Exercise 17 of 
Section 6.1, we examined the probability distribution of the random 
variable X = the amount a life insurance company earns on a randomly 
chosen 5-year term life policy. Calculations reveal that uX=$303.35 
Hx =$303.35 and 6X=$9707.57.7x =$9707.57. 


63. 


64. 


Life insurance The risk of insuring one person’s life is reduced if we 
insure many people. Suppose that we randomly select two insured 21- 
year-old males, and that their ages at death are independent. If X; and 


Xz are the insurer’s income from the two insurance policies, the 
insurer’s average income W on the two policies is 
_ X4+Xo 
W=X1+X22.—o2 
Find the mean and standard deviation of W. (You see that the mean 


income is the same as for a single policy, but the standard deviation is 
less.) 


Life insurance If we randomly select four insured 21-year-old men, 
the insurer’s average income is 
X14 +X9+X3+KXyq 
V=X1+X2+X3+X44— (OK 
where X; is the income from insuring one man. Assuming that the 


amount of income earned on individual policies is independent, find 
the mean and standard deviation of V. (If you compare with the 
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results of Exercise 63, you should see that averaging over more 
insured individuals reduces risk.) 


pg 395 |») Time and motion A time-and-motion study measures 


the time required for an assembly-line worker to perform a repetitive 
task. The data show that the time X required to bring a part from a bin 
to its position on an automobile chassis follows a Normal distribution 
with mean 11 seconds and standard deviation 2 seconds. The time Y 
required to attach the part to the chassis follows a Normal distribution 
with mean 20 seconds and standard deviation 4 seconds. The study 
finds that the times required for the two steps are independent. 


a. Describe the distribution of the total time required for the entire 
operation of positioning and attaching a randomly selected part. 


b. Management’s goal is for the entire process to take less than 30 


seconds. Find the probability that this goal will be met for a 
randomly selected part. 


Ohm-my! The design of an electronic circuit for a toaster calls for a 
100-ohm resistor and a 250-ohm resistor connected in series so that 
their resistances add. The resistance X of a 100-ohm resistor in a 
randomly selected toaster follows a Normal distribution with mean 
100 ohms and standard deviation 2.5 ohms. The resistance Y of a 250- 
ohm resistor in a randomly selected toaster follows a Normal 
distribution with mean 250 ohms and standard deviation 2.8 ohms. 
The resistances X and Y are independent. 


a. Describe the distribution of the total resistance of the two 
components in series for a randomly selected toaster. 


b. Find the probability that the total resistance for a randomly 


selected toaster lies between 345 and 355 ohms. 


Yard work Lamar and Hareesh run a two-person lawn-care service. 
They have been caring for Mr. Johnson’s very large lawn for several 
years, and they have found that the time L it takes Lamar to mow the 
lawn on a randomly selected day is approximately Normally 
distributed with a mean of 105 minutes and a standard deviation of 10 
minutes. The time H it takes Hareesh to use the edger and string 


866 


68. 


69. 


70. 


trimmer on a randomly selected day is approximately Normally 
distributed with a mean of 98 minutes and a standard deviation of 15 
minutes. Assume that L and H are independent random variables. 
Find the probability that Lamar and Hareesh finish their jobs within 5 
minutes of each other on a randomly selected day. 


Hit the track Andrea and Barsha are middle-distance runners for 
their school’s track team. Andrea’s time A in the 400-meter race on a 
randomly selected day is approximately Normally distributed with a 
mean of 62 seconds and a standard deviation of 0.8 second. Barsha’s 
time B in the 400-meter race on a randomly selected day is 
approximately Normally distributed with a mean of 62.8 seconds and 
a standard deviation of 1 second. Assume that A and B are 
independent random variables. Find the probability that Barsha beats 
Ashley in the 400-meter race on a randomly selected day. 


Swim team Hanover High School has the best women’s swimming 
team in the region. The 400-meter freestyle relay team is undefeated 
this year. In the 400-meter freestyle relay, each swimmer swims 100 
meters. The times, in seconds, for the four swimmers this season are 
approximately Normally distributed with means and standard 
deviations as shown. Assume that the swimmer’s individual times are 
independent. Find the probability that the total team time in the 400- 
meter freestyle relay for a randomly selected race is less than 220 


seconds. 
Swimmer Mean StDev 
Wendy B52 2.8 
Jill 58.0 3.0 
Carmen 56.3 2.6 


Latrice 54.7 2.7 


Toothpaste Ken is traveling for his business. He has a new 0.85- 
ounce tube of toothpaste that’s supposed to last him the whole trip. 
The amount of toothpaste Ken squeezes out of the tube each time he 
brushes is independent, and can be modeled by a Normal distribution 
with mean 0.13 ounce and standard deviation 0.02 ounce. If Ken 
brushes his teeth six times on a randomly selected trip, what’s the 
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probability that he’ll use all the toothpaste in the tube? 


Auto emissions The amount of nitrogen oxides (NOX) present in the 
exhaust of a particular model of old car varies from car to car 
according to a Normal distribution with mean 1.4 grams per mile 
(g/mi) and standard deviation 0.3 g/mi. Two randomly selected cars 
of this model are tested. One has 1.1 g/mi of NOX; the other has 1.9 
g/mi. The test station attendant finds this difference in emissions 
between two similar cars surprising. If the NOX levels for two 
randomly chosen cars of this type are independent, find the 
probability that the difference is greater than 0.8 or less than —0.8. 


Loser buys the pizza Leona and Fred are friendly competitors in 
high school. Both are about to take the ACT college entrance 
examination. They agree that if one of them scores 5 or more points 
better than the other, the loser will buy the winner a pizza. Suppose 
that, in fact, Fred and Leona have equal ability so that each score on a 
randomly selected test varies Normally with mean 24 and standard 
deviation 2. (The variation is due to luck in guessing and the accident 
of the specific questions being familiar to the student.) The two scores 
are independent. What is the probability that the scores differ by 5 or 
more points in either direction? 


Multiple Choice Select the best answer for Exercises 73 and 74, which 
refer to the following setting. 


The number of calories in a 1-ounce serving of a certain breakfast cereal is 
a random variable with mean 110 and standard deviation 10. The number 
of calories in a cup of whole milk is a random variable with mean 140 and 
standard deviation 12. For breakfast, you eat 1 ounce of the cereal with 1/2 
cup of whole milk. Let T be the random variable that represents the total 
number of calories in this breakfast. 


73. The mean of T is 


a. 110. 
b. 140. 
c. 180. 


868 


d. 195. 


e. 


250. 


74. The standard deviation of T is 


a. 
b. 
C. 
d. 


e. 


22, 


Recycle and Review 


75. Fluoride varnish (4.2) In an experiment to measure the effect of 
fluoride “varnish” on the incidence of tooth cavities, thirty-four 10- 
year-old girls whose parents volunteered them for the study were 
randomly assigned to two groups. One group was given fluoride 
varnish annually for 4 years, along with standard dental hygiene; the 
other group followed only the standard dental hygiene regimen. The 
mean number of cavities in the two groups was compared at the end 
of the 4 years. 


76. 


a. 


Cc. 


Are the participants in this experiment subject to the placebo 
effect? Explain. 


Describe how you could alter this experiment to make it double- 
blind. 


Explain the purpose of the random assignment in this experiment. 


Buying stock (5.3, 6.1) You purchase a hot stock for $1000. The 
stock either gains 30% or loses 25% each day, each with probability 
0.5. Its returns on consecutive days are independent of each other. 
You plan to sell the stock after two days. 


a. 


What are the possible values of the stock after two days, and what 
is the probability for each value? What is the probability that the 
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stock is worth more after two days than the $1000 you paid for it? 


b. What is the mean value of the stock after two days? 


Comment: You see that these two criteria give different answers to 
the question “Should I invest?” 
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SECTION 6.3 Binomial and Geometric Random 
Variables 


LEARNING TARGETS By the end of the section, you should be able to: 
Determine whether the conditions for a binomial setting are met. 


Calculate and interpret probabilities involving binomial 
distributions. 


Calculate the mean and standard deviation of a binomial random 
variable. Interpret these values. 


When appropriate, use the Normal approximation to the binomial 
distribution to calculate probabilities.* 


Find probabilities involving geometric random variables. 


When the same chance process is repeated several times, we are often 
interested in whether a particular outcome does or doesn’t happen on each 
trial. Here are some examples: 


e To test whether someone has extrasensory perception (ESP), choose one 
of four cards at random—a star, wave, cross, or circle. Ask the person to 
identify the card without seeing it. Do this a total of 50 times and see 
how many cards the person identifies correctly. Chance process: choose 
a card at random. Outcome of interest: person identifies card correctly. 
Random variable: X = number of correct identifications. 


e A shipping company claims that 90% of its shipments arrive on time. To 
test this claim, take a random sample of 100 shipments made by the 
company last month and see how many arrived on time. Chance 
process: randomly select a shipment and check when it arrived. 
Outcome of interest: arrived on time. Random variable: Y = number of 
on-time shipments. 


e In the game of Pass the Pigs, a player rolls a pair of pig-shaped dice. On 
each roll, the player earns points according to how the pigs land. If the 
player gets a “pig out,” in which the two pigs land on opposite sides, she 
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loses all points earned in that round and must pass the pigs to the next 
player. A player can choose to stop rolling at any point during her turn 
and to keep the points that she has earned before passing the pigs. 
Chance process: roll the pig dice. Outcome of interest: pig out. Random 
variable: T = number of rolls it takes the player to pig out. 


Alex Clark/Alamy 


Some random variables, like X and Y in the first two bullets, count the 
number of times the outcome of interest occurs in a fixed number of trials. 
They are called binomial random variables. Other random variables, like T 
in the Pass the Pigs setting, count the number of trials of the chance 
process it takes for the outcome of interest to occur. They are known as 
geometric random variables. These two special types of discrete random 
variables are the focus of this section. 


Binomial Settings and Binomial Random Variables 


Let’s start with an activity that involves repeating a chance process several 
times. 


ACTIVITY Pop quiz! 
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It’s time for a pop quiz! We hope you are ready. The quiz consists of 10 
multiple-choice questions. Each question has five answer choices, 
labeled A through E. Now for the bad news: you will not get to see the 
questions. You just have to guess the answer for each one! 


1. Get out a blank sheet of paper. Write your name at the top. Number 
your paper from 1 to 10. Then guess the answer to each question: A, 
B, C, D, or E. Do not look at anyone else’s paper! You have 2 
minutes. 


2. Now it’s time to grade the quizzes. Exchange papers with a 
classmate. Your teacher will display the answer key. The correct 
answer for each of the 10 questions was determined randomly so that 
A, B, C, D, or E was equally likely to be chosen. 


3. How did you do on your quiz? Make a class dotplot that shows the 
number of correct answers for each student in your class. As a class, 
describe what you see. 


In the “Pop quiz” activity, each student is performing repeated trials of 
the same chance process: guessing the answer to a multiple-choice 
question. We’re interested in the number of times that a specific event 
occurs: getting a correct answer (which we’ll call a “success”). Knowing 
the outcome of one question (right or wrong guess) tells us nothing about 
the outcome of any other question. That is, the trials are independent. The 
number of trials is fixed in advance: n = 10. And a student’s probability of 
getting a “success” is the same on each trial: p = 0.2. When these 
conditions are met, we have a binomial setting. 


DEFINITION Binomial setting 


A binomial setting arises when we perform n independent trials of 
the same chance process and count the number of times that a 
particular outcome (called a “success”) occurs. 


The four conditions for a binomial setting are 


e Binary? The possible outcomes of each trial can be classified as 
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“success” or “failure.” 


e Independent? Trials must be independent. That is, knowing the 
outcome of one trial must not tell us anything about the outcome 
of any other trial. 


e Number? The number of trials n of the chance process must be 
fixed in advance. 


e Same probability? There is the same probability of success p on 
each trial. 


The boldface letters in the definition box give you a helpful way to 
remember the conditions for a binomial setting: just check the BINS! 

When checking the binary condition, note that there can be more than 
two possible outcomes per trial—in the “Pop Quiz” Activity, each 
question (trial) had five possible answer choices: A, B, C, D, or E. If we 
define “success” as guessing the correct answer to a question, then 
“failure” occurs when the student guesses any of the four incorrect answer 
choices. 


EXAMPLE | From blood types to aces >» 
Identifying binomial settings 


PROBLEM: Determine whether the given scenario describes a 
binomial setting. Justify your answer. 


a. Genetics says that the genes children receive from their parents are 
independent from one child to another. Each child of a particular set 
of parents has probability 0.25 of having type O blood. Suppose these 
parents have 5 children. Count the number of children with type O 
blood. 


b. Shuffle a standard deck of 52 playing cards. Turn over the first 10 
cards, one at a time. Record the number of aces you observe. 


c. Shuffle a deck of cards. Turn over the top card. Put the card back in 
the deck, and shuffle again. Repeat this process until you get an ace. 
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Count the number of cards you had to turn over. 
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SOLUTION: 
a. * Binary? “Success” = has type O blood. “Failure” = doesn’t have 
type O blood. 


¢ Independent? Knowing one child’s blood type tells you nothing 
about another child’s because they inherit genes independently 
from their parents. 


¢ Number? n= 5 
¢ Same probability? p = 0.25 


This is a binomial setting. 


Check the BINS! A trial consists of observing the blood type for one 
of these parents’ children. 


All the conditions are met and we are counting the number of 
successes (children with type O blood). 


b. ¢ Binary? “Success” = get an ace. “Failure” = don’t get an ace. 


* Independent? No. If the first card you turn over is an ace, then the 
next card is less likely to be an ace because you’re not replacing 
the top card in the deck. If the first card isn’t an ace, the second 
card is more likely to be an ace. 

This is not a binomial setting because the independent condition is 

not met. 
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Check the BINS! A trial consists of turning over a card from the 
deck and observing what’s on the card. 


To check for independence, you could also write P(2nd card ace | 1st 
card ace) = 3/51 and P(2nd card ace | 1st card not ace) = 4/51 
Because the two probabilities are not equal, the trials are not 
independent. 


c. ¢ Binary? “Success” = get an ace. “Failure” = don’t get an ace. 


* Independent? Yes. Because you are replacing the card in the deck 
and shuffling each time, the result of one trial doesn’t tell you 
anything about the outcome of any other trial. 

¢ Number? No. The number of trials is not fixed in advance. 

Because there is no fixed number of trials, this is not a binomial 

setting. 


Check the BINS! A trial consists of turning over a card from the 
shuffled deck of 52 cards and observing what’s on the card. 


There’s another clue that this is not a binomial setting: you’re 


counting the number of trials to get a success and not the number of 


successes in a fixed number of trials. 


FOR PRACTICE, TRY EXERCISE 77 


The Independent condition involves conditional probabilities. In part 
(b) of the example, 


P(2nd card ace | 1st card ace) = 3/51 # P(Q2nd card ace | 1st card not ace) = 
4/51 


so the trials are not independent. The Same probability of success 
condition is about unconditional probabilities. Because 
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P(kth card in a shuffled deck is an ace)=4/52 

P(kth card in a shuffled deck is an ace) = 4/52 
this condition is met in part (b) of the example. Be sure you understand the 
difference between these two conditions. When sampling is done without 
replacement, the Independent condition is violated. 

The blood type scenario in part (a) of the example is a binomial setting. 

If we let X = the number of children with type O blood, then X is a 
binomial random variable. The probability distribution of X is called a 
binomial distribution. 


DEFINITION Binomial random variable, Binomial 
distribution 


The count of successes X in a binomial setting is a binomial random 
variable. The possible values of X are 0, 1, 2, ..., n. 


The probability distribution of X is a binomial distribution. Any 
binomial distribution is completely specified by two numbers: the 
number of trials n of the chance process and the probability p of 
success on each trial. 


In the “Pop quiz” activity at the beginning of the lesson, X = the 
number of correct answers is a binomial random variable with n = 10 and 
p= 0.2. 


CHECK YOUR UNDERSTANDING 


‘w 


y For each of the following situations, determine whether or not the 
given random variable has a binomial distribution. Justify your answer. 


1. Shuffle a deck of cards. Turn over the top card. Put the card back in 
the deck, and shuffle again. Repeat this process 10 times. Let X = the 
number of aces you observe. 
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2. Choose 5 students at random from your class. Let Y = the number 
who are over 6 feet tall. 


3. Rolla fair die 100 times. Sometime during the 100 rolls, one corner 
of the die chips off. Let W = the number of 5s you roll. 


Calculating Binomial Probabilities 


How can we calculate probabilities involving binomial random variables? 
Let’s return to the scenario from part (a) of the preceding example: 


Genetics says that the genes children receive from their parents are 
independent from one child to another. Each child of a particular set of 
parents has probability 0.25 of having type O blood. Suppose these 
parents have 5 children. Count the number of children with type O 
blood. 


In this binomial setting, a child with type O blood is a “success” (S) and a 
child with another blood type is a “failure” (F). The count X of children 
with type O blood is a binomial random variable with n = 5 trials and 
probability p = 0.25 of success on each trial. 


e What’s P(X = 0)? That is, what’s the probability that none of the 5 
children has type O blood? The probability that any one of this couple’s 
children doesn’t have type O blood is 1 — 0.25 = 0.75 (complement 
rule). By the multiplication rule for independent events (Section 5.3), 


P(X = 0) = P(FFFFF) = (0.75)(0.75)(0.75)(0.75)(0.75) = (0.75)5 = 0.2373 
P(X = 0) = P(FFFFF) = (0.75) (0.75) (0.75) (0.75) (0.75) = (0.75)° = 0.2373 
e How about P(X = 1)? There are several different ways in which exactly 
1 of the 5 children could have type O blood. For instance, the first child 
born might have type O blood, while the remaining 4 children don’t 


have type O blood. The probability that this happens is 


P(SFFFF)=(0.25)(0.75)(0.75)(0.75)(0.75)=(0.25)1(0.75)4 
P (SFFFF) = (0.25) (0.75) (0.75) (0.75) (0.75) = (0.25)'(0.75)* 


Alternatively, Child 2 could be the one that has type O blood. The 
corresponding probability is 
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P(FSFFF)=(0.75)(0.25)(0.75)(0.75)(0.75)=(0.25)1(0.75)4 
P (FSFFF) = (0.75) (0.25) (0.75) (0.75) (0.75) = (0.25)' (0.75)* 


There are three more possibilities to consider—those in which Child 3, 
Child 4, and Child 5 are the only ones to inherit type O blood. Of course, 
the probability will be the same for each of those cases. In all, there are 
five different ways in which exactly 1 child would have type O blood, each 
with the same probability of occurring. As a result, 
P(X=1)=P(exactly 1 child with type O blood) 
P(X = 1) = P(exactly 1 child with type O blood) 
__55(0.25)(0.75)$ = 0.3955 


(4 children don’t have | 
type O blood 


( 4 child with ) 
| type O blood 
S 


(Number of ways to get 1 child) 
| out of 5 with type O blood 


The pattern of this calculation works for any binomial probability: 


P(X = k) = (# of ways to get k successes in n trials) (success probability)* 
(failure probability)‘ 


To use this formula, we must count the number of arrangements of k 
successes in n trials. This number is called the binomial coefficient. We 
use the following fact to do the counting without actually listing all the 
arrangements. 


DEFINITION Binomial coefficient 


The number of ways to arrange k successes among n trials is given by 
the binomial coefficient 


n n! 
(nk)=n!k!(n-k)! ( k kl(n—k)! 
for k= 0, 1, 2,...,n where n! (read as "n factorial") is given by 
n !=n(n—1)(n-2) -~ -(3)(2)(1)™! = n(n —1)(n — 2)---+ - (3) (2) (1) 
and 0! = 1. 


The larger of the two factorials in the denominator of a binomial 
coefficient will cancel much of the n! in the numerator. For example, the 
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binomial coefficient we need to find the probability that exactly 2 of the 
couple’s 5 children inherit type O blood is 
(52)=S!2!3!=S)()3)(DYOYDAOYVSV(Z)D=(S)(A(2))=10 
aL (GM(Z)(Z)()_ wa) _ 
2 


23! aan(Z)(Z)(Z) QO 


) 
oOo The binomial coefficient (52) ( 2/ is not related to the fraction 


522. A helpful way to remember its meaning is to read it as “5 choose 
2”—as in, how many ways are there to choose which 2 children have type 
O blood in a family with 5 children? Binomial coefficients have many 
uses, but we are interested in them only as an aid to finding binomial 
probabilities. If you need to compute a binomial coefficient, use your 
calculator. 


5 


Some people prefer the notation <C> instead of (52) ( - for the binomial 


13. Technology Corner | CALCULATING BINOMIAL 
COEFFICIENTS |» 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 
5 


To calculate a binomial coefficient like (52) ( . on the TI-83/84, 
proceed as follows: 


e Type 5, press MATH, arrow over to PROB, choose nCr, and press 


ENTER|. Then type 2 and press |ENTER| again to execute the 


command 5 nCr 2 (which displays as 5C> on devices with pretty 


print). 
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The binomial coefficient (nk) ( k counts the number of different 
ways in which k successes can be arranged among n trials. The binomial 
probability P(X = k) is this count multiplied by the probability of any one 
specific arrangement of the k successes. 


BINOMIAL PROBABILITY FORMULA 


Suppose that X is a binomial random variable with n trials and 
probability p of success on each trial. The probability of getting exactly 
k successes in n trials (k = 0, 1, 2, ..., n) is 


P(X 
P(X=k)=(nk)pk(1-p)n-k 


where 


(nk)=n!k!(n—-k)! ( k 


With our formula in hand, we can now calculate any binomial 
probability. 


EXAMPLE | Inheriting blood type (>) 


Calculating a binomial probability 
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PROBLEM: Genetics says that the genes children receive from their 
parents are independent from one child to another. Each child of a 
particular set of parents has probability 0.25 of having type O blood. 
Suppose these parents have 5 children. Let X = the number of children 
with type O blood. Find P(X = 3). Interpret this value. 


SOLUTION: 


X is a binomial random variable with n = 5 and p = 0.25. 
P(X = 3)=(35) (0.25)3(0.75)2=10 (0.25)3(0.75)2=0.08789 
P(X=8) = (8) (0.25)°(0.75)° 
10 (0.25)° (0.75)? 
= 0.08789 


There is about a 9% probability that exactly 3 of the 5 children have 
type O blood. 


FOR PRACTICE, TRY EXERCISE 81 


There are times when we want to calculate a probability involving 
more than one value of a binomial random variable. As the following 
example illustrates, we can just use the binomial probability formula for 
each value of interest. 


EXAMPLE | Inheriting blood type (>) 


Calculating a cumulative binomial probability 


PROBLEM: The preceding example tells us that each child of a 
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particular set of parents has probability 0.25 of having type O blood. 
Suppose these parents have 5 children. Should the parents be surprised 
if more than 3 of their children have type O blood? Calculate an 
appropriate probability to support your answer. 


SOLUTION: 


Let X = the number of children with type O blood. X has a binomial 
distribution with n = 5 and p = 0.25. 
P(X>3)=P(X=4)+P(X=5)=(45)(0.25)4(0.75)1+(55) 
(0.25)5(0.75)0=5 (0. 25)4 (0.75)1 + 1(0.25)5 (0.75)0=0.01465+0.00098=0.01563 
P(X>3) = P(X =4)+P(X=5) 

= (5) (0.25)*(0.75)' + (2) (0.25)? (0.75)° 

= 5(0.25)* (0.75)! + 1(0.25)° (0.75)° 

= 0.01465 + 0.00098 

= 0.01563 


Because there’s only about a 1.5% probability of having more than 3 
children with type O blood, the parents should definitely be surprised if 
this happens. 


FOR PRACTICE, TRY EXERCISE 85 
We can also use the calculator’s binompdf and binomcdf commands to 


perform the calculations in the previous two examples. The following 
Technology Corner shows how to do it. 


14. Technology Corner |CALCULATING BINOMIAL 
PROBABILITIES |p) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/ps6e. 


There are two handy commands on the TI-83/84 for finding binomial 
probabilities: binompdf and binomcdf. The inputs for both commands 
are the number of trials n, the success probability p, and the values of 
interest for the binomial random variable X. 
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binompdf (n,p,k) computesP(X=k)binomcdf (n,p,k) computesP(X<k) 
binompdf (n,p,k) computesP(X=k) 


binomcdf (n,p,k) computesP(X < k) 


Let’s use these commands to confirm our answers in the previous two 
examples. 


1. Find P(X = 3). 


e Press (DISTR) and choose binompdf(. 


OS 2.55 or later: In the dialog box, enter these values: trials:5, 
p:0.25, x value:3, choose Paste, and then press [ENTER|. 


Older OS: Complete the command binompdf (5,0.25,3) and press 


[ENTER] 


These results agree with our previous answer using the binomial 
probability formula: 0.08789. 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


binompdf(5,.25,3) 
@.987890625 


2. Should the parents be surprised if more than 3 of their children have 
type O blood? To find P(X = 3), use the complement rule: 


P(X>3)=1-P(X<3)=1-binomcdf(5,0.25,3) 
P(X > 3) =1-— P(X < 3) = 1 — binomcdf (5,0.25,3) 


e Press (DISTR) and choose binomcdf(. 


OS 2.55 or later: In the dialog box, enter these values: trials:5, 
p:0.25, x value:3, choose Paste, and then press ENTER. Subtract 
this result from 1 to get the answer. 


Older OS: Complete the command binomcdf(5,0.25,3) and press 
[ENTER |. Subtract this result from 1 to get the answer. 
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binomedf(S,.25,3) 
@.984375 


This result agrees with our previous answer using the binomial 
probability formula: 0.01563. 


We could also have done the calculation for part (b) as P(X = 3) = P(X 
= 4) + P(X = 5) = binompdf(5,0.25,4) + binompdf(5,0.25,5) = 0.01465 
+ 0.00098 = 0.01563. 


Note the use of the complement rule to find P(X = 3) in the 
Technology Corner: P(X = 3) = 1 — P(X < 3). This is necessary because the 
calculator’s binomcdf(n,p,k) command only computes the probability of 
getting k or fewer successes in n trials. Students often have trouble 
identifying the correct third input for the binomcdf command when a 
question asks them to find the probability of getting less than, more than, 
or at least so many successes. 


Here’s a helpful tip to avoid making such a mistake: write out the 
possible values of the variable, circle the ones you want to find the 
probability of, and cross out the rest. In the preceding example, X can take 
values from 0 to 5 and we want to find P(X = 3): 


0% 23245 


Crossing out the values from 0 to 3 shows why the correct calculation is 
1-P(X<3),1 — P(X < 3)., 


AP® EXAM TIP 


Don’t rely on “calculator speak” when showing your work on free 
response questions. Writing binompdf(5,0.25,3) = 0.08789 will not earn 
you full credit for a binomial probability calculation. At the very least, 
you must indicate what each of those calculator inputs represents. For 
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example, “binompdf(trials:5,p:0.25,x value:3) = 0.08789.” 


Take another look at the solutions in the two blood-type examples. The 
structure is much like the one we used when doing Normal calculations. 
Here is a summary box that describes the process. 


HOW TO FIND BINOMIAL PROBABILITIES 


Step 1: State the distribution and the values of interest. Specify a 
binomial distribution with the number of trials n, success probability p, 
and the values of the variable clearly identified. 

Step 2: Perform calculations—show your work! Do one of the 
following: 

i. Use the binomial probability formula to find the desired probability; 


or 
ii. Use the binompdf or binomcdf command and label each of the 
inputs. 
Be sure to answer the question that was asked. 


Here’s an example that shows the method at work. 


EXAMPLE | Free lunch? (») 


Calculating binomial probabilities 


eldadcarin/Getty Images 


PROBLEM: A local fast-food restaurant is running a “Draw a three, 
get it free” lunch promotion. After each customer orders, a touchscreen 
display shows the message “Press here to win a free lunch.” A computer 
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program then simulates one card being drawn from a standard deck. If 
the chosen card is a 3, the customer’s order is free. Otherwise, the 
customer must pay the bill. 


a. On the first day of the promotion, 250 customers place lunch orders. 
Find the probability that fewer than 10 of them win a free lunch. 


b. In fact, only 9 customers won a free lunch. Does this result give 
convincing evidence that the computer program is flawed? 


SOLUTION: 


a. Let Y = the number of customers who win a free lunch. Y has a 
binomial distribution with n = 250 and p = 4/52. 
P (Y < 10)=P (Y<9)= binomcdf (trials: 250, p:4/52, x value: 9)= 0.00613 
Py <10)=F t¥ = 9) 
= binomcedf (trials: 250, p:4/52, x value: 9) 
= 0.00613 


b. There is only a 0.006 probability that fewer than 10 customers would 
win a free lunch if the computer program is working properly. 
Because only 9 customers won a free lunch on this day, we have 
convincing evidence that the computer program is flawed. 


Step 1: State the distribution and the values of interest. 


The values of Y that interest us are 

Step 2: Perform calculations—show your work! 

i. Use the binomial probability formula to find the desired probability; 
or 

ii. use the binompdf or binomcdf command and label each of the inputs. 

To use the binomial formula, you would have to add the probabilities for 

Y=0,1,... , 9. That’s too much work! 


FOR PRACTICE, TRY EXERCISE 89 


CHECK YOUR UNDERSTANDING 
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y To introduce his class to binomial distributions, Mr. Miller does the 
“Pop quiz” activity at the beginning of this section (page 403). Each 
student in the class guesses an answer from A through E on each of the 
10 multiple-choice questions. Mr. Miller determines the “correct” 
answer for each of the 10 questions randomly so that A, B, C, D, or E 
was equally likely to be chosen. Hannah is one of the students in this 
class. Let X = the number of questions that Hannah answers correctly. 


1. What probability distribution does X have? Justify your answer. 


2. Use the binomial probability formula to find P(X = 3). Interpret this 
result. 

3. To get a passing score on the quiz, a student must answer at least 6 
questions correctly. Would you be surprised if Hannah earned a 
passing score? Calculate an appropriate probability to support your 
answer. 


Describing a Binomial Distribution: Shape, Center, and 
Variability 


What does the probability distribution of a binomial random variable look 
like? The table shows the possible values and corresponding probabilities 
for X = the number of children with type O blood from two previous 
examples. This is a binomial random variable with n = 5 and p = 0.25. 


Value x ; 0 1 2 3 4 5 
Probability 0.23730 0.39551 0.26367 0.08789 0.01465 0.00098 
Pi 


Figure 6.7 shows a histogram of the probability distribution. This 
binomial distribution with n = 5 and p = 0.25 has a clear right-skewed 
shape. Why? Because the probability that any one of the couple’s children 
inherits type O blood is 0.25, it’s quite likely that 0, 1, or 2 of the children 
will have type O blood. Larger values of X are much less likely. 
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FIGURE 6.7 Histogram showing the probability distribution of the binomial 
random variable X = number of children with type O blood in a family with 5 
children. 


You can use technology to graph a binomial probability distribution 
like the one shown in Figure 6.7. 


15. Technology Corner |GRAPHING BINOMIAL 
PROBABILITY DISTRIBUTIONS (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
To graph the binomial probability distribution for n = 5 and p = 0.25: 


e Type the possible values of the random variable X into list Lj: 0, 1, 2, 
3, 4, and 5. 

e Highlight Ly with your cursor. Enter the command binompdf(5,0.25) 
and press [ENTER]. 

e Make a histogram of the probability distribution using the method 
shown in Technology Corner 12 (page 370). 
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Ploti:LasL2 


Lav=, 2373046875 


889 


Po 


The binomial distribution with n = 5 and p = 0.25 is skewed to the 
right. Figure 6.8 shows two more binomial distributions with different 
shapes. The binomial distribution with n = 5 and p = 0.51 in Figure 6.8(a) 
is roughly symmetric. The binomial distribution with n = 5 and p = 0.8 in 
Figure 6.8(b) is skewed to the left. In general, when n is small, the 
probability distribution of a binomial random variable will be roughly 
symmetric if p is close to 0.5, right-skewed if p is much less than 0.5, and 
left-skewed if p is much greater than 0.5. 


n=5,p=0.51 n=5,p =0.8 

0.40 7 0.40 
& 0.30 4 = 0.30 
= = 
BS 02075 = 0.20 
2 2 
7 i 

0.10 7 0.10 

0.00 0.00 

0) l 2 3 4 5 0 1 2 3 4 5 

(a) xX (b) x 


FIGURE 6.8 (a) Probability histogram for the binomial random variable X with n 
= 5 and p = 0.51. This binomial distribution is roughly symmetric. (b) Probability 
histogram for the binomial random variable X with n = 5 and p = 0.8. This 
binomial distribution has a left-skewed shape. 


EXAMPLE | Bottled water versus tap water » 


Describing a binomial distribution 


PROBLEM: Mr. Hogarth’s AP® Statistics class did the activity on 
page 360. There were 21 students in the class. If we assume that the 
students in his class could not tell tap water from bottled water, then 
each one is guessing, with a 1/3 probability of being correct. Let X = the 
number of students who correctly identify the cup containing bottled 
water. Here is a histogram of the probability distribution of X: 


a. What probability distribution does X have? Justify your answer. 


b. Describe the shape of the probability distribution. 
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SOLUTION: 


a. A trial consists of a student in the class trying to guess which of three 
cups contained bottled water. 


Check the BINS! 


¢ Binary? Success = correct guess; failure = incorrect guess 


¢ Independent? Knowing whether one student guessed correctly does 
not help us predict whether another student guessed correctly. 


¢ Number? n = 21 


¢ Same probability? p = 1/3 


X has a binomial distribution with n = 21 and p = 1/3. 


b. The probability distribution of X looks roughly symmetric with a 
single peak at X = 7. 


You could also say that the graph is slightly right-skewed due to the 


long tail that extends out to X = 21. 


FOR PRACTICE, TRY EXERCISE 91 


MEAN AND STANDARD DEVIATION OF A BINOMIAL 
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RANDOM VARIABLE The random variable X = the number of children 
with type O blood from the previous two examples has a binomial 
distribution with n= 5 and p = 0.25. Its probability distribution is shown in 
the table. 


Value x; 0 1 2 3 4 5 
Probability p; 0.23730 0.39551 0.26367 0.08789 0.01465 0.00098 


Because X is a discrete random variable, we can calculate its mean 
using the formula 
UX=E(X)=> xipi= x1p1+x2p2+x3p3+... 
ux = E(X) =) ap; = 21p1 + z2p2 + taps +... 


from Section 6.1. We get 


uX=(0)(0.23730)+(1)(0.39551)+-+:+(5)(0.00098)=1.25 
wx = (0) (0.23730) + (1) (0.39551) + - + (5) (0.00098) = 1.25 


So the expected number of children with type O blood in families like this 
one with 5 children is 1.25. 

Did you think about why the mean is pty = 1.25? Because each child 
has a 0.25 chance of inheriting type O blood, we’d expect one-fourth of 
the 5 children to have this blood type. In other words, 


uX=5(0.25)=1.254x = 5(0.25) = 1.25 


This method can be used to find the mean of any binomial random 
variable. 


MEAN (EXPEXCTED VALUE) OF A BINOMIAL 
RANDOM VARIABLE 


If a count X of successes has a binomial distribution with n\umber of 
trials n and probability of success p, the mean (expected value) of X is 


UX=E(X)=npHx = £(X) = np 


To calculate the standard deviation of X, we start by finding the 
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variance. 


oX2=Y (xi-pX)2pi=(0-1.25)2(0.23730)+(1-1.25)2(0.39551)+++++ 
(5-1.25)2(0.00098)=0.9375 
a = > (ai —4x)’pi 
= (0 — 1.25) (0.23730) + (1 — 1.25)? (0.39551) + --- + (5 — 1.25)? (0.00098) 
= 0.9375 


So the standard deviation of X is 


o0X=0.9375=0.9687x = V0.9375 = 0.968 


The number of children with type O blood will typically vary by about 
0.968 from the mean of 1.25 in families like this one with 5 children. 


There is a simple formula for the standard deviation of a binomial 
random variable, but it isn’t easy to explain (see the Think About It on 
page 417). For our family with n = 5 children and p = 0.25 of type O 
blood, the variance of X is 


5(0.25)(0.75)=0.9375° (0.25) (0.75) = 0.9375 


To get the standard deviation, we just take the square root: 


oX=5(0.25)(0.75)=0.9375=0.968 
ox = ,/5(0.25) (0.75) = 0.9375 = 0.968 


This method works for any binomial random variable. 


STANDARD DEVIATION OF A BINOMIAL RANDOM 
VARIABLE 


If a count X of successes has a binomial distribution with number of 
trials n and probability of success p, the standard deviation of X is 


oX=np(1-p)?* — mp (1 _ p) 


oo Remember that these formulas for the mean and standard 


deviation work only for binomial distributions. 
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EXAMPLE | Bottled water versus tap water » 


Describing a binomial distribution 


PROBLEM: Assume that each of the 21 students in Mr. Hogarth’s 
AP® Statistics class who did the bottled water versus tap water activity 
was just guessing, so there was a 1/3 chance of each student identifying 
the cup containing bottled water correctly. Let X = the number of 
students who make a correct identification. At right is a histogram of the 
probability distribution of X. 


a. Calculate and interpret the mean of X. 


b. Calculate and interpret the standard deviation of X. 


—re 


Gerald Bernard/Shutterstock 


0.2 


Probability 
S 


0.0 
0123 45 67 8 9 101112131415 161718192021 
X = Number of correct guesses 


SOLUTION: 
a. Hy = np = 21(1/3) =7 
If all the students in Mr. Hogarth’s class were just guessing and 


repeated the activity many times, the average number of students 
who guess correctly would be about 7. 
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b. oX = np(1 - p) = 21(1/3)(2/3) = 2.16 
ox = y/np(1 — p) = /21(1/3) (2/3) = 2.16 


If all the students in Mr. Hogarth’s class were just guessing and 
repeated the activity many times, the number of students who guess 
correctly would typically vary by about 2.16 from the mean of 7. 


FOR PRACTICE, TRY EXERCISE 95 


Of the 21 students in Mr. Hogarth’s class, 13 made correct 
identifications. Are you convinced that some of Mr. Hogarth’s students 
could tell bottled water from tap water? The class’s result corresponds to X 
= 13, a value that’s nearly 3 standard deviations above the mean. How 
likely is it that 13 or more of Mr. Hogarth’s students would guess 
correctly? It’s 
P(X>13)=1-P(X<12)=1-binomcdf(trials:21, p:1/3, xvalue:12)=1-—0.9932=( 

P(X>13) = 1-P(X <12) 
= 1-— binomcdf(trials:21, p:1/3, xvalue:12) 
1 — 0.9932 
0.0068 


lI 


The students had less than a 1% chance of getting so many correct 
identifications if they were all just guessing. This result gives convincing 
evidence that some of the students in the class could tell bottled water 
from tap water. 


Think About It 


WHERE DO THE BINOMIAL MEAN AND VARIANCE 
FORMULAS COME FROM? We can derive the formulas for the 
mean and variance of a binomial distribution using what we learned 
about combining random variables in Section 6.2. Let’s start with the 
random variable B that’s described by the following probability 
distribution. 
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Value b; 0 1 
Probability p; 1p p 


You can think of B as representing the result of a single trial of some 
chance process. If a success occurs (probability p), then B = 1. Ifa 
failure occurs, then B = 0. Notice that the mean of B is 


uB=Y bipi=(0)(1-p)+(1)(p)=p"2 = S > bipi = (0) (1— p) + (1) (p) =P 
and that the variance of B is 
OB 2-2 (OE UB) cD AO Bet Bn Dp neh B Dep Bt LD 
a = >> (bi —Hs)’pi =(0—p)’ (1—p)+(1—p)’p 

= p’(1—p)+(1-p)’p 
= p(1—p)[p+(1—p)| 

(1-p) =p(-p) = PP) 

Now consider the random variable X = B, + By + ... + B,. We can 


think of X as counting the number of successes in n independent trials 
of this chance process, with each trial having success probability p. In 
other words, X is a binomial random variable. By the rules from Section 
6.2, the mean of X is 


UX="B1+pB2+...+uBn=p+pt...+p=np 
Mx = Ma, + HB, +... tbe, =~Ptpr...tp=np 


and the variance of X is 


0X2=0B12+0B22+...+o0Bn2=p(1- p)+p(l- p)+...+p(1-p)=np(1-p) 


x = a; +o%, +.- + 0% 
= p(l—p)+p(l1—p)+-.-.+p(1—p) 
= np(1-p) 


The standard deviation of X is therefore 


oX=np(1-p)?* — np (1 7 p) 


CHECK YOUR UNDERSTANDING 


896 


fy 


y To introduce his class to binomial distributions, Mr. Miller does the 
“Pop quiz” activity at the beginning of this section (page 403). Each 
student in the class guesses an answer from A through E on each of the 
10 multiple-choice questions. Mr. Miller determines the “correct” 
answer for each of the 10 questions randomly so that A, B, C, D, or E 
was equally likely to be chosen. Hannah is one of the students in this 
class. Let Y = the number of questions that Hannah answers incorrectly. 


1. Use technology to make a histogram of the probability distribution of 
Y. Describe its shape. 

2. Calculate and interpret the mean of Y. 

3. Calculate and interpret the standard deviation of Y. 

4. On page 412, we defined X = the number of correct answers that 
Hannah got on the quiz. How do the shape, center, and variability of 
the probability distribution of X compare to your answers for 
Questions 1 to 3? 


Binomial Distributions in Statistical Sampling 


The binomial distributions are important in statistics when we wish to 
make inferences about the proportion p of successes in a population. For 
instance, suppose that a supplier inspects a random sample of 10 flash 
drives from a shipment of 10,000 flash drives in which 200 are defective 
(bad). Let X = the number of bad flash drives in the sample. 
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This is not quite a binomial setting. Because we are sampling without 
replacement, the independence condition is violated. The conditional 
probability that the second flash drive chosen is bad changes when we 
know whether the first is good or bad: P(second is bad | first is good) = 
200/9999 = 0.0200 but P(second is bad | first is bad) = 199/9999 = 0.0199. 
These probabilities are very close because removing 1 flash drive from a 
shipment of 10,000 changes the makeup of the remaining 9999 flash drives 
very little. The distribution of X is very close to the binomial distribution 
with n = 10 and p = 0.02. 


To illustrate this, let’s compute the probability that none of the 10 flash 
drives is defective. Using the binomial distribution, it’s 


P(X=0)=(100)(0.02)0(0.98)10=0.8171 


10 
P(X =0) = ( é ) (0.02)°(0.98)'° = 0.8171 


The actual probability of getting no defective flash drives is 


P(no defectives)=980010,000x97999999x97989798x:-x97919991=0.8170 
P (no defectives) = aici x zie x lie ee la 0.8170 
10,000 9999 9798 9991 


Those two probabilities are quite close! 

Almost all real-world sampling, such as taking an SRS from a 
population of interest, is done without replacement. As the preceding 
example illustrates, sampling without replacement leads to a violation of 
the Independent condition. 

However, the flash drives context shows how we can use binomial 
distributions in the statistical setting of selecting a random sample. When 
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the population is much larger than the sample, a count of successes in an 
SRS of size n has approximately the binomial distribution with n equal to 
the sample size and p equal to the proportion of successes in the 
population. What counts as “much larger”? In practice, the binomial 
distribution gives a good approximation as long as we sample less than 
10% of the population. We refer to this as the 10% condition. 


DEFINITION 10% condition 


When taking a random sample of size n from a population of size N, 
we can use a binomial distribution to model the count of successes in 
the sample as long as n < 0.10N. 


Here’s a scenario that shows why it’s important to check the 10% 
condition before calculating a binomial probability. You might recognize 
the setting from the first activity in the book (page 6). 


An airline has just finished training 25 pilots—15 male and 10 
female—to become captains. Unfortunately, only 8 captain positions 
are available right now. Airline managers announce that they will use 
a lottery to determine which pilots will fill the available positions. 
One day later, managers reveal the results of the lottery: Of the 8 
captains chosen, 5 are female and 3 are male. Some of the male pilots 
who weren’t selected suspect that the lottery was not carried out 
fairly. 


What’s the probability of choosing 5 female pilots in a fair lottery? Let X = 
the number of female pilots selected in a random sample of size n = 8 from 
the population of N = 25 pilots. Notice that the sample size is almost 1/3 of 
the population size. If we ignore this fact and use a binomial probability 
calculation, we get 

P(X=5)=(85)(0.40)5(0.60)3=0.124 


P(X =5) = (5) (0.40)°(0.60)* = 0.124 


The correct probability, however, is 0.106. You can see that the binomial 
probability is off by about 17% (0.018/0.106) from the correct answer. 
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EXAMPLE | Teens and debit cards (») 


Binomial distributions and sampling 


PROBLEM: Ina survey of 500 U.S. teenagers aged 14 to 18, subjects 
were asked a variety of questions about personal finance.® One question 
asked whether teens had a debit card. Suppose that exactly 12% of teens 
aged 14 to 18 have debit cards. Let X = the number of teens in a random 
sample of size 500 who have a debit card. 


a. Explain why X can be modeled by a binomial distribution even 
though the sample was selected without replacement. 


b. Use a binomial distribution to estimate the probability that 50 or 
fewer teens in the sample have debit cards. 


SOLUTION: 


a. 500 is less than 10% of all U.S. teenagers aged 14 to 18. 


Check the 10% condition: n < 0.10N 


b. X is approximately binomial with n = 500 and p = 0.12. 


p(X<50) = binomcdf(trials:500, p:0.12, x value: 50)= 0.0932 
p(X <50) = binomedf(trials : 500, p: 0.12, 2 value: 50) 


= 0.0932 


FOR PRACTICE, TRY EXERCISE 99 


The Normal Approximation to Binomial Distributions! 


pePLe> 


¢€ As you saw earlier, the shape of a binomial distribution can be 
skewed to the right, skewed to the left, or roughly symmetric. Something 
interesting happens to the shape as the number of trials n increases. You 
can investigate the relationship between n and p yourself using the Normal 
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Approximation to Binomial Distributions applet at the book’s website, 
highschool.bfwpub.com/tps6e. 

Figure 6.9 shows histograms of binomial distributions for different 
values of n and p. As the number of observations n becomes larger, the 
binomial distribution gets close to a Normal distribution. 


0.148 


oe 
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FIGURE 6.9 Histograms of binomial distributions with (a) n = 10 and p = 0.8, (b) 
n= 20 and p = 0.8, and (c) n= 50 and p = 0.8. As n increases, the shape of the 
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probability distribution gets closer and closer to Normal. 


When n is large, we can use Normal probability calculations to 
approximate binomial probabilities. To see if n is large enough, check the 


Large Counts condition. 


DEFINITION Large Counts condition 


Suppose that a count X of successes has the binomial distribution with 
n trials and success probability p. The Large Counts condition says 
that the probability distribution of X is approximately Normal if 


np>10andn(1—p)>10 "P >10 and n(l—p)>10 


That is, the expected numbers (counts) of successes and failures are 
both at least 10. 


This condition is called “large counts” because np is the expected 
(mean) count of successes and n(1 — p) is the expected (mean) count of 
failures in a binomial setting. Why do we require that both these values be 
at least 10? Look back at Figure 6.9. It is clear that a Normal distribution 
does not approximate the probability distributions in parts (a) or (b) very 
well. This isn’t surprising because the Large Counts condition is not met in 
either case. For graph (a), both np = 10(0.8) = 8 and n(1 — p) = 10(0.2) = 2 
are less than 10. For graph (b), np = 20(0.8) = 16 = 10, but n(1 - p) = 
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20(0.2) = 4 is less than 10. The Normal curve in graph (c) appears to 
model the binomial probability distribution well. This time, the Large 
Counts condition is met: np = 50(0.8) = 40 = 10 and n(1 — p) = 50(0.2) = 
10 => 10. 

The accuracy of the Normal approximation improves as the sample 
size n increases. It is most accurate for any fixed n when p is close to 1/2 
and least accurate when p is near 0 or 1. This is why the Large Counts 
condition depends on p as well as n. 


EXAMPLE | Teens and debit cards (>) 


Normal approximation to a binomial distribution 


PROBLEM: Ina survey of 500 U.S. teenagers aged 14 to 18, subjects 
were asked a variety of questions about personal finance. One question 
asked whether teens had a debit card. Suppose that exactly 12% of teens 
aged 14 to 18 have debit cards. Let X = the number of teens in a random 
sample of size 500 who have a debit card. 


a. Justify why X can be approximated by a Normal distribution. 


b. Use a Normal distribution to estimate the probability that 50 or fewer 
teens in the sample have debit cards. 


SOLUTION: 


a. X is approximately binomial with n = 500 and p = 0.12. Because np 
= 500(0.12) = 60 = 10 and n(1 — p) = 500(0.88) = 440 = 10, we can 
approximate X with a Normal distribution. 

b. px = np = 500(0.12) = 60 and oX = np(1 — p) = 500(0.12)(0.88) = 7.266 
ox = \/np(1 —p) = ,/500 (0.12) (0.88) = 7.266 


Start by calculating the mean and standard deviation of the binomial 


random variable X. 
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38202 45468 eo 60 67266 674.532 81.798 


X = Number of teens with a debit card 


1. Draw a Normal distribution. 


2. Perform calculations—show your work! 


(i) Standardize and use Table A or technology; or 
(ii) Use technology without standardizing. 
Be sure to answer the question that was asked. 


i. z=50-607.266=-1.38° 7.266 
Using Table A: 0.0838 


Using technology: normalcdf(lower: —1000, upper: —1.38, mean:0, 
SD:1) = 0.0838 


ii. normalcdf(lower:0, upper:50, mean:60, SD:7.266) = 0.0844 


P(X <50)5P(Z<-1.38)F (X < 50)5P(Z <- 1.38) 


FOR PRACTICE, TRY EXERCISE 103 


The probability from the Normal approximation in this example, 
0.0844, misses the exact binomial probability of 0.0932 from the example 
by about 0.0088. 


Geometric Random Variables 


In a binomial setting, the number of trials n is fixed in advance, and the 
binomial random variable X counts the number of successes. The possible 
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values of X are 0, 1, 2, ... , n. In other situations, the goal is to repeat a 
chance process until a success occurs: 


e Roll a pair of dice until you get doubles. 
e In basketball, attempt a 3-point shot until you make one. 


e Keep placing a $1 bet on the number 15 in roulette until you win. 


These are all examples of a geometric setting. 


DEFINITION Geometric setting 


A geometric setting arises when we perform independent trials of the 
same chance process and record the number of trials it takes to get 
one success. On each trial, the probability p of success must be the 
same. 


Here’s an activity your class can try that involves a geometric setting. 


ACTIVITY Is this your lucky day? 


Your teacher is planning to give you 10 problems for homework. As an 
alternative, you can agree to play the Lucky Day game. Here’s how it 
works. A student will be selected at random from your class and asked 
to pick a day of the week (e.g., Thursday). Then your teacher will use 
technology to randomly choose a day of the week as the “lucky day.” If 
the student picks the correct day, the class will have only one homework 
problem. If the student picks the wrong day, your teacher will select 
another student from the class at random. The chosen student will pick a 
day of the week and your teacher will use technology to choose a “lucky 
day.” If this student gets it right, the class will have two homework 
problems. The game continues until a student correctly picks the lucky 
day. Your teacher will assign a number of homework problems that is 
equal to the total number of picks made by members of your class. Are 
you ready to play the Lucky Day game? 


1. Decide as a class whether to “gamble” on the number of homework 
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problems you will receive. You have 30 seconds. 


2. Play the Lucky Day game and see what happens! 


In a geometric setting, if we define the random variable Y to be the 
number of trials needed to get the first success, then Y is called a 
geometric random variable. The probability distribution of Y is a 


geometric distribution. 


DEFINITION Geometric random variable, Geometric 
distribution 


The number of trials Y that it takes to get a success in a geometric 
setting is a geometric random variable. The probability distribution 
of Y is a geometric distribution with probability p of success on any 
trial. The possible values of Y are 1, 2, 3,.... 


As with binomial random variables, it’s important to be able to 
distinguish situations in which a geometric distribution does and doesn’t 
apply. Let’s consider the Lucky Day game. The random variable of 
interest in this game is Y = the number of picks it takes to correctly match 
the lucky day. Each pick is one trial of the chance process. Knowing the 
result of one student’s pick tells us nothing about the result of any other 
pick. On each trial, the probability of a correct pick is 1/7. This is a 
geometric setting. Because Y counts the number of trials to get the first 
success, it is a geometric random variable with p = 1/7. 

What is the probability that the first student picks correctly and wins 
the Lucky Day game? It’s P(Y = 1) = 1/7. That’s also the class’s chance of 
having only one homework problem assigned. For the class to have two 
homework problems assigned, the first student selected must pick an 
incorrect day of the week and the second student must pick the lucky day 
correctly. The probability that this happens is 


P(Y=2)=(6/7)(1/7)=0.1224P(Y = 2) = (6/7) (1/7) = 0.1224 


Likewise, 
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P(Y=3)=(6/7)(6/7)(1/7)=0.1050/ (Y = 3) = (6/7) (6/7) (1/7) = 0.1050 


In general, the probability that the first correct pick comes on the kth trial 
is 


jyk-1 64 | 
P(Y=k)=(6/7)k-1(1/7)P(Y = k) = (6/7)"~ (1/7) 
Let’s summarize what we’ve learned about calculating a geometric 


probability. 


GEOMETRIC PROBABILITY FORMULA 


If Y has the geometric distribution with probability p of success on each 


trial, the possible values of Y are 1, 2, 3, ... . If k is any one of these 
values, 


P(Y=k)=(1-p)k-1p? (¥ =k) = 1p)" 'p 


With the geometric probability formula in hand, we can now compute 
any geometric probability. 


EXAMPLE |The Lucky Day game (>) 


Calculating geometric probabilities 


PROBLEM: Mr. Lochel’s class decides to play the Lucky Day game. 
Let Y = the number of homework problems that the class receives. 


a. Find the probability that the class receives exactly 10 homework 
problems as a result of playing the Lucky Day game. 


b. Find P(Y < 10) and interpret this value. 


SOLUTION: 


Y has a geometric distribution with p = 1/7. 
(a) P(Y = 10) = (6/7)? (1/7) = 0.0357 
(b) P(Y < 10) = P(Y= 1) + P(Y=2) + P(Y=3)+...+ P(Y=9) 
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= 1/7 + (6/7)(1/7) + (6/7)2(1/7) + ... + (6/7)8(1/7) 
= 0.7503 


There’s about a 75% probability that the class will get fewer than 10 
homework problems by playing the Lucky Day game. 


FOR PRACTICE, TRY EXERCISE 107 


There’s a clever alternative approach to finding the probability in part 
(b) of the example. By the complement rule, P(Y < 10) = 1 — P(Y = 10). 
What’s the probability that it will take at least 10 picks for Mr. Lochel’s 
class to win the Lucky Day game? It’s the chance that the first 9 picks are 


(¢) = 0.250 
all incorrect: (67)9=0.250 \ ' . So the probability that the class 


will win the Lucky Day game in fewer than 10 picks (and therefore have 
fewer than 10 homework problems assigned) is 
P(Y<10)=1-P(Y=>10)=1-0.250=0.750 
P(Y < 10) =1—-P(Y > 10) = 1 — 0.250 = 0.750 
As you probably guessed, we can use technology to calculate 
geometric probabilities. The following Technology Corner shows how to 
do it. 


16. Technology Corner |CALCULATING GEOMETRIC 
PROBABILITIES |») 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


There are two handy commands on the TI-83/84 for finding geometric 
probabilities: geometpdf and geometcdf. The inputs for both commands 
are the success probability p and the value(s) of interest for the 
geometric random variable Y. 


geometpdf (p,k) computes P(Y = k) 


geometcdf (p,k) computes P(Y < k) 
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Let’s use these commands to confirm our answers in the previous 
example. 


a. Find the probability that the class receives exactly 10 homework 
problems as a result of playing the Lucky Day game. 


e Press (DISTR) and choose geometpdf(. 
OS 2.55 or later: In the dialog box, enter these values: p:1/ 7, x 


value:10, choose Paste, and then press [ENTER |. 
Older OS: Complete the command geometpdf(1/ 7,10) and press 


[ENTER|. 


These results agree with our previous answer using the geometric 
probability formula: 0.0357. 


NORMAL FLOAT AUTO REAL RADIAN MP 


geometpdf (177,10) 
@.9356763859 


b. Find P(Y<< 10) and interpret this value. To find P(Y<10), use the 
geometcdf command: 


P(Y<10)=P(Y<9)=geometcdf(1/7,9) 
P(Y < 10) = P(Y < 9) = geometcdf(1/7,9) 


e Press (DISTR) and choose geometcdf. 
OS 2.55 or later: In the dialog box, enter these values: p:1/ 7, x 


value:9, choose Paste, and then press [ENTER\. 
Older OS: Complete the command geometcdf(1/ 7,9) and press 


[ENTER] 


These results agree with our previous answer using the geometric 
probability formula: 0.7503. 
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NORMAL FLOAT AUTO REAL RADIAN MP fl 


geometcdf (177.9) 
@. 7502652985 


The table shows part of the probability distribution of Y = the number 
of picks it takes to match the lucky day. We can’t show the entire 
distribution because the number of trials it takes to get the first success 
could be a very large number. 


Value y ; 1 2 3 4 5 6 7 8 g 
Probability p; 0.143 0.122 0.105 0.090 0.077 0.066 0.057 0.049 0.042 


Figure 6.10 is a histogram of the probability distribution for values of 
Y from 1 to 26. Let’s describe what we see. 
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FIGURE 6.10 Histogram showing the probability distribution of the geometric 
random variable Y = number of trials needed for students to pick correctly in the 
Lucky Day game. 


Shape: Skewed to the right. Every geometric distribution has this shape. 
That’s because the most likely value of a geometric random variable is 1. 
The probability of each successive value decreases by a factor of (1 — p). 
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Center: The mean (expected value) of Y is py = 7. (Due to the infinite 
number of possible values of Y, the calculation of the mean is beyond the 
scope of this text.) If the class played the Lucky Day game many times, 
they would receive an average of 7 homework problems. It’s no 
coincidence that p = 1/7 and py = 7. With probability of success 1/7 on 
each trial, we’d expect it to take an average of 7 trials to get the first 
success. 


Variability: The standard deviation of Y is oy = 6.48. (Due to the infinite 
number of possible values of Y, the calculation of the standard deviation is 
beyond the scope of this text.) If the class played the Lucky Day game 
many times, the number of homework problems they receive would 
typically vary by about 6.5 problems from the mean of 7. That could mean 
a lot of homework! 

We can generalize the result for the mean of a geometric random 
variable. 


MEAN (EXPECTED VALUE) OF A GEOMETRIC 
RANDOM VARIABLE 


If Y is a geometric random variable with probability of success p on 
each trial, then its mean (expected value) is phY=E(Y)=1p 
py = E(Y)= 


] 
p, That is, the expected number of trials required to get 
the first success is 1/p. 


CHECK YOUR UNDERSTANDING 


Suppose you roll a pair of fair, six-sided dice until you get doubles. Let 
T = the number of rolls it takes. Note that the probability of getting 
doubles on any roll is 6/36 = 1/6. 
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1. Show that T is a geometric random variable. 
2. Find P(T = 3). Interpret this result. 


3. In the game of Monopoly, a player can get out of jail free by rolling 
doubles within 3 turns. Find the probability that this happens. 


Section 6.3 Summary 


e A binomial setting arises when we perform n independent trials of the 
same chance process and count the number of times that a particular 
outcome (a “success”) occurs. The conditions for a binomial setting are: 


» Binary? The possible outcomes of each trial can be classified as 
“success” or “failure.” 


« Independent? Trials must be independent. That is, knowing the result 
of one trial must not tell us anything about the result of any other trial. 


« Number? The number of trials n of the chance process must be fixed in 
advance. 


« Same probability? There is the same probability of success p on each 
trial. 


Remember to check the BINS! 


e The count of successes X in a binomial setting is a special type of 
discrete random variable known as a binomial random variable. Its 
probability distribution is a binomial distribution. Any binomial 
distribution is completely specified by two numbers: the number of 
trials n of the chance process and the probability of success p on any 
trial. The possible values of X are the whole numbers 0, 1, 2, ... , n. 


e Use the binomial probability formula to calculate the probability of 
getting exactly k successes in n trials: 


Pix=h = (*) (1 — py 
P(X=k)=(nk)pk(1—-p)n-k k 


: The binomial coefficient 
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( n ) n! 
(nk)=n!k!(n-k)! \F 7 Bi(n—k)! 
counts the number of ways k successes can be arranged among n trials. 


» The factorial of n is 
n!=n(n-1)(n-2) (3)(2)(1)™ =n(n—1)(n—2)--:-- (3) (2) (1) 
for positive whole numbers n, and 0! = 1. 


You can also use technology to calculate binomial probabilities. The TI- 
83/84 command binompdf(n,p,k) computes P(X = k). The TI-83/84 
command binomcdf(n,p,k) computes P(X < k). 


A binomial distribution can have a shape that is roughly symmetric, 
skewed to the right, or skewed to the left. 


The mean and standard deviation of a binomial random variable X are 
uX=npandoX=np(1-p) 4X = "P and ox = ,/np(1—p) 


The binomial distribution with n trials and probability p of success gives 
a good approximation to the count of successes in a random sample of 
size n from a large population containing proportion p of successes. This 
is true as long as the sample size n is less than 10% of the population 
size N (the 10% condition). 


The Normal approximation to the binomial distribution* says that if X is 
a count of successes having the binomial distribution with n trials and 
success probability p, then when n is large, X is approximately Normally 
distributed. You can use this approximation when np = 10 and n(1 — p) 
> 10 (the Large Counts condition). 


A geometric setting consists of repeated trials of the same chance 
process in which the probability p of success is the same on each trial, 
and the goal is to count the number of trials it takes to get one success. 
If Y = the number of trials required to obtain the first success, then Y is a 
geometric random variable. Its probability distribution is called a 
geometric distribution. 


If Y has the geometric distribution with probability of success p, the 
possible values of Y are the positive integers 1, 2, 3, ... . The probability 
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that it takes exactly k trials to get the first success is given by 
k-1 
P(Y=k)=(1-p)k-1pP (Y =*) =(1—p)" p 


e The mean (expected value) of a geometric random variable Y is pry = 
1/p. 


6.3 Technology Corner >» 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 

13. Calculating binomial coefficients Page 408 
14. Calculating binomial probabilities Page 410 
15. Graphing binomial probability distributions Page 413 
16. Calculating geometric probabilities Page 424 


Section 6.3 Exercises 


In Exercises 77-80, determine whether the given scenario describes a 
binomial setting. Justify your answer. 


77. pg 405 |) Baby elk Biologists estimate that a randomly selected 


baby elk has a 44% chance of surviving to adulthood. Assume this 
estimate is correct. Suppose researchers choose 7 baby elk at random 
to monitor. Let X = the number that survive to adulthood. 


78. Long or short? Put the names of all the students in your statistics 
class in a hat. Mix up the names, and draw 4 without looking. Let X = 
the number whose last names have more than six letters. 


79. Bull’s-eye! Lawrence likes to shoot a bow and arrow in his free time. 
On any shot, he has about a 10% chance of hitting the bull’s-eye. As a 
challenge one day, Lawrence decides to keep shooting until he gets a 
bull’s-eye. Let Y = the number of shots he takes. 
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80. 


81. 


82. 


83. 


84. 


Taking the train According to New Jersey Transit, the 8:00 A.M. 
weekday train from Princeton to New York City has a 90% chance of 
arriving on time on a randomly selected day. Suppose this claim is 
true. Choose 6 days at random. Let Y = the number of days on which 
the train arrives on time. 


pg 409 \p) Baby elk Refer to Exercise 77. Use the binomial 
probability formula to find P(X = 4). Interpret this value. 


Taking the train Refer to Exercise 80. Use the binomial probability 
formula to find P(Y = 4). Interpret this value. 


Take a spin An online spinner has two colored regions—blue and 
yellow. According to the website, the probability that the spinner 
lands in the blue region on any spin is 0.80. Assume for now that this 
claim is correct. Suppose we spin the spinner 12 times and let X = the 
number of times it lands in the blue region. 


a. Explain why X is a binomial random variable. 


b. Find the probability that exactly 8 spins land in the blue region. 


Red light! Pedro drives the same route to work on Monday through 
Friday. His route includes one traffic light. According to the local 
traffic department, there is a 55% chance that the light will be red on 
a randomly selected work day. Suppose we choose 10 of Pedro’s 
work days at random and let Y = the number of times that the light is 
red. 


a. Explain why Y is a binomial random variable. 


b. Find the probability that the light is red on exactly 7 days. 
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85. 


86. 


87. 
88. 
89. 


90. 


pg 409 |») Baby elk Refer to Exercise 77. How surprising would it 


be for more than 4 elk in the sample to survive to adulthood? 
Calculate an appropriate probability to support your answer. 


Taking the train Refer to Exercise 80. Would you be surprised if the 
train arrived on time on fewer than 4 days? Calculate an appropriate 
probability to support your answer. 


Take a spin Refer to Exercise 83. Calculate and interpret P(X < 7). 
Red light! Refer to Exercise 84. Calculate and interpret P(Y = 7). 


pg 411 |») The last kiss Do people have a preference for the last 


thing they taste? Researchers at the University of Michigan designed 
a study to find out. The researchers gave 22 students five different 
Hershey’s Kisses (milk chocolate, dark chocolate, creme, caramel, 
and almond) in random order and asked the student to rate each one. 
Participants were not told how many Kisses they would be tasting. 
However, when the 5th and final Kiss was presented, participants 
were told that it would be their last one.2 Assume that the participants 
in the study don’t have a special preference for the last thing they 
taste. That is, assume that the probability a person would prefer the 
last Kiss tasted is p = 0.20. 


a. Find the probability that 14 or more students would prefer the last 
Kiss tasted. 


b. Of the 22 students, 14 gave the final Kiss the highest rating. Does 


this give convincing evidence that the participants have a 
preference for the last thing they taste? 


1 in 6 wins As a special promotion for its 20-ounce bottles of soda, a 
soft drink company printed a message on the inside of each bottle 
cap. Some of the caps said, “Please try again!” while others said, 
“You’re a winner!” The company advertised the promotion with the 
slogan “1 in 6 wins a prize.” Grayson’s statistics class wonders if the 
company’s claim holds true at a nearby convenience store. To find 
out, all 30 students in the class go to the store and each buys one 20- 
ounce bottle of the soda. 
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a. Find the probability that two or fewer students would win a prize if 
the company’s claim is true. 


b. Two of the students in Grayson’s class got caps that say, “You’re a 
winner!” Does this result give convincing evidence that the 
company’s 1-in-6 claim is false? 


91. pg 414 |») Bag check Thousands of travelers pass through the 


92. 


airport in Guadalajara, Mexico, each day. Before leaving the airport, 
each passenger must go through the customs inspection area. Customs 
agents want to be sure that passengers do not bring illegal items into 
the country. But they do not have time to search every traveler’s 
luggage. Instead, they require each person to press a button. Either a 
red or a green bulb lights up. If the red light flashes, the passenger 
will be searched by customs agents. A green light means “go ahead.” 
Customs agents claim that the light has probability 0.30 of showing 
red on any push of the button. Assume for now that this claim is true. 
Suppose we watch 20 passengers press the button. Let R = the 
number who get a red light. Here is a histogram of the probability 
distribution of R: 


a. What probability distribution does R have? Justify your answer. 


b. Describe the shape of the probability distribution. 


0.20 + 
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Probability 
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R = Number of passengers who get a red light 
Easy-start mower? A company has developed an “easy-start” mower 
that cranks the engine with the push of a button. The company claims 
that the probability the mower will start on any push of the button is 
0.9. Assume for now that this claim is true. On the next 30 uses of the 
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93. 


94. 


95. 


mower, let T = the number of times it starts on the first push of the 
button. Here is a histogram of the probability distribution of T: 


a. What probability distribution does T have? Justify your answer. 


b. Describe the shape of the probability distribution. 
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R = Number of passengers who get a red light 


Take a spin An online spinner has two colored regions—blue and 
yellow. According to the website, the probability that the spinner 
lands in the blue region on any spin is 0.80. Assume for now that this 
claim is correct. Suppose we spin the spinner 12 times and let X = the 
number of times it lands in the blue region. Make a graph of the 
probability distribution of X. Describe its shape. 


Red light! Pedro drives the same route to work on Monday through 
Friday. His route includes one traffic light. According to the local 
traffic department, there is a 55% chance that the light will be red on 
a randomly selected work day. Suppose we choose 10 of Pedro’s 
work days at random and let Y = the number of times that the light is 
red. Make a graph of the probability distribution of Y. Describe its 
shape. 


416 Bag check Refer to Exercise 91. 
pg a, 8 Exercise JL 


a. Calculate and interpret the mean of R. 


b. Calculate and interpret the standard deviation of R. 
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96. Easy-start mower Refer to Exercise 92. 


a. Calculate and interpret the mean of T. 


b. Calculate and interpret the standard deviation of T. 


97. Random digit dialing When a polling company calls a telephone 


98. 


99. 


number at random, there is only a 9% chance that the call reaches a 

live person and the survey is successfully completed.12 Suppose the 
random digit dialing machine makes 15 calls. Let X = the number of 
calls that result in a completed survey. 


a. Find the probability that more than 12 calls are not completed. 
b. Calculate and interpret uX/*. 


c. Calculate and interpret oX°7% . 

Lie detectors A federal report finds that lie detector tests given to 
truthful persons have probability 0.2 of suggesting that the person is 
deceptive! A company asks 12 job applicants about thefts from 
previous employers, using a lie detector to assess their truthfulness. 
Suppose that all 12 answer truthfully. Let Y = the number of people 
whom the lie detector indicates are being deceptive. 


a. Find the probability that the lie detector indicates that at least 10 of 
the people are being honest. 


b. Calculate and interpret pYFY . 
c. Calculate and interpret 0oY?Y . 


pg 419 |») Lefties A total of 11% of students at a large high school 


are left-handed. A statistics teacher selects a random sample of 100 
students and records L = the number of left-handed students in the 
sample. 


a. Explain why L can be modeled by a binomial distribution even 
though the sample was selected without replacement. 


b. Use a binomial distribution to estimate the probability that 15 or 
more students in the sample are left-handed. 
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100. 


101. 


102. 


103. 


104. 


In debt? According to financial records, 24% of U.S. adults have 
more debt on their credit cards than they have money in their 
savings accounts. Suppose that we take a random sample of 100 
U.S. adults. Let D = the number of adults in the sample with more 


debt than savings. 


a. Explain why D can be modeled by a binomial distribution even 
though the sample was selected without replacement. 


b. Use a binomial distribution to estimate the probability that 30 or 
more adults in the sample have more debt than savings. 


Airport security The Transportation Security Administration (TSA) 
is responsible for airport safety. On some flights, TSA officers 
randomly select passengers for an extra security check before 
boarding. One such flight had 76 passengers—12 in first class and 
64 in coach class. Some passengers were surprised when none of the 
10 passengers chosen for screening were seated in first class. Should 
we use a binomial distribution to approximate this probability? 
Justify your answer. 


Scrabble In the game of Scrabble, each player begins by drawing 7 
tiles from a bag containing 100 tiles. There are 42 vowels, 56 
consonants, and 2 blank tiles in the bag. Cait chooses her 7 tiles and 
is surprised to discover that all of them are vowels. Should we use a 
binomial distribution to approximate this probability? Justify your 


answer. 
*pg 421 (pb) Lefties Refer to Exercise 99. 
a. Justify why L can be approximated by a Normal distribution. 


b. Use a Normal distribution to estimate the probability that 15 or 
more students in the sample are left-handed. 


*In debt? Refer to Exercise 100. 
a. Justify why D can be approximated by a Normal distribution. 


b. Use a Normal distribution to estimate the probability that 30 or 
more adults in the sample have more debt than savings. 
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105. 


106. 


107. 


108. 


109. 


10% condition To use a binomial distribution to approximate the 
count of successes in an SRS, why do we require that the sample 
size n be less than 10% of the population size N? 


* Large Counts condition To use a Normal distribution to 
approximate binomial probabilities, why do we require that both np 
and n(1 — p) be at least 10? 


pg 423 |p) Cranky mower To start her old lawn mower, Rita has 


to pull a cord and hope for some luck. On any particular pull, the 
mower has a 20% chance of starting. 


a. Find the probability that it takes her exactly 3 pulls to start the 
mower. 


b. Find the probability that it takes her more than 6 pulls to start the 
mower. 


1-in-6 wins Alan decides to use a different strategy for the 1-in-6 
wins game of Exercise 90. He keeps buying one 20-ounce bottle of 
the soda at a time until he gets a winner. 


a. Find the probability that he buys exactly 5 bottles. 


b. Find the probability that he buys at most 6 bottles. Show your 
work. 


Geometric or not? Determine whether each of the following 
scenarios describes a geometric setting. If so, define an appropriate 
geometric random variable. 


a. A popular brand of cereal puts a card bearing the image of 1 of 5 
famous NASCAR drivers in each box. There is a 1/5 chance that 
any particular driver’s card ends up in any box of cereal. Buy 
boxes of the cereal until you have all 5 drivers’ cards. 


b. Ina game of 4-Spot Keno, Lola picks 4 numbers from 1 to 80. 
The casino randomly selects 20 winning numbers from 1 to 80. 
Lola wins money if she picks 2 or more of the winning numbers. 
The probability that this happens is 0.259. Lola decides to keep 
playing games of 4-Spot Keno until she wins some money. 
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110. 


111. 


Geometric or not? Determine whether each of the following 
scenarios describes a geometric setting. If so, define an appropriate 
geometric random variable. 


a. 


Shuffle a standard deck of playing cards well. Then turn over one 


card at a time from the top of the deck until you get an ace. 


b. Billy likes to play cornhole in his free time. On any toss, he has 


about a 20% chance of getting a bag into the hole. As a challenge 
one day, Billy decides to keep tossing bags until he gets one in 
the hole. 


Using Benford’s law According to Benford’s law (Exercise 15, page 


377), the probability that the first digit of the amount of a randomly 
chosen invoice is an 8 or a9 is 0.097. Suppose you examine 
randomly selected invoices from a vendor until you find one whose 


amount begins with an 8 ora 9. 


a. How many invoices do you expect to examine before finding one 
that begins with an 8 or 9? 


b. In fact, the first invoice you find with an amount that starts with 
an 8 or 9 is the 40th invoice. Does this result provide convincing 
evidence that the invoice amounts are not genuine? Calculate an 
appropriate probability to support your answer. 


112. Roulette Marti decides to keep placing a $1 bet on number 15 in 


consecutive spins of a roulette wheel until she wins. On any spin, 
there’s a 1-in-38 chance that the ball will land in the 15 slot. 


a. How many spins do you expect it to take for Marti to win? 


b. Would you be surprised if Marti won in 3 or fewer spins? 
Compute an appropriate probability to support your answer. 


Multiple Choice: Select the best answer for Exercises 113-117. 


113. Joe reads that 1 out of 4 eggs contains salmonella bacteria. So he 


never uses more than 3 eggs in cooking. If eggs do or don’t contain 
salmonella independently of each other, the number of contaminated 
eggs when Joe uses 3 eggs chosen at random has the following 
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distribution: 

a. binomial; n= 4 and p = 1/4 
b. binomial; n= 3 and p = 1/4 
c. binomial; n= 3 and p = 1/3 
d. geometric; p = 1/4 


e. geometric; p = 1/3 


Exercises 114 and 115 refer to the following setting. A fast-food restaurant 
runs a promotion in which certain food items come with game pieces. 
According to the restaurant, 1 in 4 game pieces is a winner. 


114. 


115. 


If Jeff gets 4 game pieces, what is the probability that he wins 
exactly 1 prize? 


a. 0.25 
b. 1.00 


4 
0.2 
c. ee | ') 


4 
0.2 
d. eee | ') 


e. (0.75)2(0.75)! 


If Jeff keeps playing until he wins a prize, what is the probability that 
he has to play the game exactly 5 times? 


a. (0.25)? 


b. (0.75)4 


ie) 


. (0.75)? 


d. (0.75)4(0.25) 


5 
0.7 
eee | 1 


io) 


116. Each entry in a table of random digits like Table D has probability 


0.1 of being a 0, and the digits are independent of one another. Each 
line of Table D contains 40 random digits. The mean and standard 
deviation of the number of Os in a randomly selected line will be 
approximately 


a. mean = 0.1, standard deviation = 0.05. 
b. mean = 0.1, standard deviation = 0.1. 
c. mean = 4, standard deviation = 0.05. 
d. mean = 4, standard deviation = 1.90. 


e. mean = 4, standard deviation = 3.60. 


117. * In which of the following situations would it be appropriate to use a 


Normal distribution to approximate probabilities for a binomial 
distribution with the given values of n and p? 


a. n=10,p=0.5 

b. n= 40, p = 0.88 

c. n=100, p=0.2 

d. n= 100, p = 0.99 

e. n= 1000, p = 0.003 


Recycle and Review 


118. 


Spoofing (4.2) To collect information such as passwords, online 
criminals use “spoofing” to direct Internet users to fraudulent 
websites. In one study of Internet fraud, students were warned about 
spoofing and then asked to log into their university account starting 
from the university’s home page. In some cases, the log-in link led 
to the genuine dialog box. In others, the box looked genuine but, in 
fact, was linked to a different site that recorded the ID and password 
the student entered. The box that appeared for each student was 
determined at random. An alert student could detect the fraud by 
looking at the true Internet address displayed in the browser status 
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bar, but most just entered their ID and password. 


a. Is this an observational study or an experiment? Justify your 
answer. 


b. What are the explanatory and response variables? Identify each 
variable as categorical or quantitative. 


119. Standard deviations (6.1) Continuous random variables A, B, and 
C all take values between 0 and 10. Their density curves, drawn on 
the same horizontal scales, are shown here. Rank the standard 
deviations of the three random variables from smallest to largest. 
Justify your answer. 


a a 


- + 
Hu H 
B 


Orr 
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_Mé _. FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


Buckley Farms produces homemade potato chips that it sells in 
bags labeled “16 ounces.” The total weight of each bag follows an 
approximately Normal distribution with a mean of 16.15 ounces and 
a standard deviation of 0.12 ounce. 


a. Ifyou randomly selected 1 bag of these chips, what is the 
probability that the total weight is less than 16 ounces? 


b. Ifyou randomly selected 10 bags of these chips, what is the 
probability that exactly 2 of the bags will have a total weight 
less than 16 ounces? 


c. Buckley Farms ships its chips in boxes that contain 6 bags. 
The empty boxes have a mean weight of 10 ounces and a 
standard deviation of 0.05 ounce. Calculate the mean and 
standard deviation of the total weight of a box containing 6 
bags of chips. 
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d. Buckley Farms decides to increase the mean weight of each 
bag of chips so that only 5% of the bags have weights that are 
less than 16 ounces. Assuming that the standard deviation 
remains 0.12 ounce, what mean weight should Buckley Farms 
use? 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 


score. 
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Chapter 6 Review 


Section 6.1: Discrete and Continuous Random Variables 


A random variable assigns numerical values to the outcomes of a chance 
process. The probability distribution of a random variable describes its 
possible values and their probabilities. There are two types of random 
variables: discrete and continuous. Discrete random variables take on a 
fixed set of values with gaps in between. Continuous random variables 
take on all values in an interval of numbers. 


As in Chapter 1, we are often interested in the shape, center, and 
variability of a probability distribution. The shape of a discrete probability 
distribution can be identified by graphing a probability histogram, with the 
height of each bar representing the probability of a single value. The center 
is usually identified by the mean (expected value) of the random variable. 
The mean (expected value) is the average value of the random variable if 
the chance process is repeated many times. The variability of a probability 
distribution is usually identified by the standard deviation, which describes 
how much the values of a random variable typically vary from the mean 
value, in many repetitions of the chance process. 


Continuous probability distributions, such as the Normal distribution, 
describe the distribution of continuous random variables. A density curve 
is used to display a continuous probability distribution. Probabilities for 
continuous random variables are determined by finding the area under the 
density curve and above the values of interest. 


Section 6.2: Transforming and Combining Random Variables 


In this section, you learned how linear transformations of a random 
variable affect the shape, center, and variability of its probability 
distribution. Similar to what you learned in Chapter 2, adding a positive 
constant to (or subtracting it from) each value of a random variable 
changes the measures of center and location, but not the shape or 
variability of the probability distribution. Multiplying or dividing each 
value of a random variable by a positive constant changes the measures of 
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center and location and measures of variability, but not the shape of the 
probability distribution. 
You also learned how to calculate the mean and standard deviation for 
a combination of two or more random variables. The mean of a sum or 
difference of any two random variables X and Y is given by 
uX+Y=uX+pYandux-Y=ypX-pY 
bxsy =x +py and px_y = px — by 


If X and Y are any two independent random variables, variances add: 


OX+Y2=0X2+0Y2 and oX-Y2=0X2+0Y2 


2 2 2 2 2 2 
Oxsy =OX toy and oy y =ox toy 


To find the standard deviation, just take the square root of the variance. 
Recall that X and Y are independent if knowing the value of one variable 
doesn’t help you predict the value of the other variable. Also, if 
independent random variables X and Y are both Normally distributed, then 
their sum X + Y and difference X — Y are both Normally distributed as 
well. 


Section 6.3: Binomial and Geometric Random Variables 


In this section, you learned about two common types of discrete random 
variables, binomial random variables and geometric random variables. 
Binomial random variables count the number of successes in a fixed 
number of trials (n), whereas geometric random variables count the 
number of trials needed to get one success. Otherwise, the binomial and 
geometric settings have the same conditions: there must be two possible 
outcomes for each trial (success or failure), the trials must be independent, 
and the probability of success p must stay the same throughout all trials. 


To calculate probabilities for a binomial distribution with n trials and 
probability of success p on each trial, use technology or the binomial 
probability formula 


P(X=k)= (‘; ) a —p)" 
P(X=k)=(nk)pk(1-p)n-k k 


The mean and standard deviation of a binomial random variable X are 
uX=npandoX=np(1-p)4#* ~ "P and ox = ,/np(1—p) 
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The shape of a binomial distribution depends on both the number of 
trials n and the probability of success p. When the number of trials is large 
enough that both np and n(1 — p) are at least 10, the distribution of the 
binomial random variable X has an approximately Normal distribution. Be 
sure to check the Large Counts condition before using a Normal 
approximation to a binomial distribution. 


A common application of the binomial distribution is when we count 
the number of times a particular outcome occurs in a random sample from 
some population. Because sampling is almost always done without 
replacement, the Independent condition is violated. However, if the sample 
size is a small fraction of the population size (less than 10%), the lack of 
independence isn’t a concern. Be sure to check the 10% condition when 
sampling is done without replacement before using a binomial distribution. 


Finally, to calculate probabilities for a geometric distribution with 
probability of success p on each trial, use technology or the geometric 
probability formula 


P(Y=k)=(1-p)k-1p? (¥ =k) = (1p) "p 


The mean (expected value) of a geometric random variable Y is pY=1/p 
py =1/p. 


What Did You Learn? 


Related 

Example on Relevant Chapter 
Learning Target Section Page(s) Review Exercise(s) 
Use the probability 6.1 363 R6.1 
distribution of a 
discrete random 
variable to calculate 
the probability of an 
event. 
Make a histogram to 6.1 365 R6.3 


display the probability 
distribution of a 
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discrete random 
variable and describe 
its shape. 


Calculate and interpret 6.1 367 
the mean (expected 

value) of a discrete 

random variable. 


Calculate and interpret 6.1 369 
the standard deviation 

of a discrete random 

variable. 


Use the probability 6.1 O/2s070 
distribution of a 

continuous random 

variable (uniform or 

Normal) to calculate 

the probability of an 

event. 


Describe the effect of 6.2 384, 385, 387 
adding or subtracting a 

constant or 

multiplying or 

dividing by a constant 

on the probability 

distribution of a 

random variable. 


Calculate the mean 6.2 389, 392 
and standard deviation 

of the sum or 

difference of random 

variables. 


Find probabilities 6.2 395 
involving the sum or 

difference of 

independent Normal 

random variables. 


Determine whether the 6.3 405, 414 
conditions for a 

binomial setting are 

met. 
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Calculate and interpret 
probabilities involving 
binomial distributions. 


Calculate the mean 
and standard deviation 
of a binomial random 
variable. Interpret 
these values. 

iWhen appropriate, use 
the Normal 
approximation to the 
binomial distribution 
to calculate 
probabilities. 


Find probabilities 
involving geometric 
random variables. 


iThis topic is not required for the AP® Statistics exam. 


6.3 


6.3 


6.3 


6.3 


409, 411, 419 
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Chapter 6 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R6.1 _») Knees Patients receiving artificial knees often experience pain 
after surgery. The pain is measured on a subjective scale with 
possible values of 1 (low) to 5 (high). Let Y be the pain score for a 
randomly selected patient. The following table gives the probability 
distribution for Y. 


Value 1 2 3 4 5 
Probability 0.1 0.2 0.3 0.3 ?? 


a. Find P(Y = 5). Interpret this value. 


b. Find the probability that a randomly selected patient has a pain 
score of at most 2. 


c. Calculate the expected pain score and the standard deviation of 
the pain score. 


R6.2 |) A glass act In a process for manufacturing glassware, glass 


stems are sealed by heating them in a flame. Let X be the 
temperature (in degrees Celsius) for a randomly chosen glass. The 
mean and standard deviation of X are Ly = 550°C and oy = 5.7°C. 


a. Is temperature a discrete or continuous random variable? Explain 
your answer. 


b. The target temperature is 550°C. What are the mean and standard 
deviation of the number of degrees off target, D = X — 550? 


c. A manager asks for results in degrees Fahrenheit. The conversion 
of X into degrees Fahrenheit is given by Y=95X+32 
_ 9 
es oe ue What are the mean pty and the standard deviation 


Oy of the temperature of the flame in the Fahrenheit scale? 
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R6.3 |) Keno In a game of 4-Spot Keno, the player picks 4 numbers 


from 1 to 80. The casino randomly selects 20 winning numbers from 
1 to 80. The table shows the possible outcomes of the game and their 
probabilities, along with the amount of money (Payout) that the 
player wins for a $1 bet. If X = the payout for a single $1 bet, you 
can check that [ty = $0.70 and oy = $6.58. 


Matches 0 1 2 3 4 
Payout x; $0 $0 $1 $3 $120 
Probability p; 0.308 0.433 0.213 0.043 0.003 


a. Make a graph of the probability distribution. Describe what you 
see. 


b. Interpret the values of py and oy. 


c. Jerry places a single $5 bet on 4-Spot Keno. Find the expected 
value and the standard deviation of his winnings. 


d. Marla plays five games of 4-Spot Keno, betting $1 each time. 
Find the expected value and the standard deviation of her total 
winnings. 


R6.4 _») Applying torque A machine fastens plastic screw-on caps onto 


containers of motor oil. If the machine applies more torque than the 
cap can withstand, the cap will break. Both the torque applied and 
the strength of the caps vary. The capping-machine torque T follows 
a Normal distribution with mean 7 inch-pounds and standard 
deviation 0.9 inch-pound. The cap strength C (the torque that would 
break the cap) follows a Normal distribution with mean 10 inch- 
pounds and standard deviation 1.2 inch-pounds. 


a. Find the probability that a randomly selected cap has a strength 
greater than 11 inch-pounds. 


b. Explain why it is reasonable to assume that the cap strength and 
the torque applied by the machine are independent. 


c. Let the random variable D = C — T. Find its mean and standard 
deviation. 
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d. What is the probability that a randomly selected cap will break 
while being fastened by the machine? 


Exercises R6.5 and R6.6 refer to the following setting. 


According to Mars, Incorporated, 20.5% of the M&M’S® Milk Chocolate 
Candies made at its Cleveland factory are orange. Assume that the 
company’s claim is true. Suppose you take a random sample of 8 candies 
from a large bag of M&M’S. Let X = the number of orange candies you 
get. 


R6.5 |») Orange M&M’S 
a. Explain why it is reasonable to use the binomial distribution for 
probability calculations involving X. 
b. What’s the probability that you get 3 orange M&M’S? 
c. Calculate P(X = 4). Interpret this result. 


d. Suppose that you get 4 orange M&M’S in your sample. Does this 
result provide convincing evidence that Mars’s claim about its 
M&WM’S is false? Justify your answer. 


R6.6 _») Orange M&M’S 


a. Find and interpret the expected value of X. 
b. Find and interpret the standard deviation of X. 


R6.7 |) Sushi Roulette In the Japanese game show Sushi Roulette, the 


contestant spins a large wheel that’s divided into 12 equal sections. 
Nine of the sections show a sushi roll, and three have a “wasabi 
bomb.” When the wheel stops, the contestant must eat whatever 
food applies to that section. Then the game show host replaces the 
item of food on the wheel. To win the game, the contestant must eat 
one wasabi bomb. Find the probability that it takes 3 or fewer spins 
for the contestant to get a wasabi bomb. 


R6.8* |») Public transportation In a large city, 34% of residents use 


public transportation at least once per week. Suppose the city’s 
mayor selects a random sample of 200 residents. Let T = the number 


934 


who use public transportation at least once per week. 


a. 


What type of probability distribution does T have? Justify your 
answer. 


Explain why T can be approximated by a Normal distribution. 


Calculate the probability that at most 60 residents in the sample 
use public transportation at least once per week. 
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Chapter 6 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


Questions T6.1—T6.3 refer to the following setting. A psychologist 
studied the number of puzzles that subjects were able to solve in a 
5-minute period while listening to soothing music. Let X be the 
number of puzzles completed successfully by a randomly chosen 
subject. The psychologist found that X had the following 
probability distribution. 


Value 1 2 3 4 
Probability 0.2 0.4 0.3 0.1 


T6.1 What is the probability that a randomly chosen subject completes 
more than the expected number of puzzles in the 5-minute period 
while listening to soothing music? 


a. 0.1 
b. 0.4 
c. 0.8 
d. 1 


e. Cannot be determined 


T6.2 The standard deviation of X is 0.9. Which of the following is the 
best interpretation of this value? 


a. About 90% of subjects solved 3 or fewer puzzles. 


b. About 68% of subjects solved between 0.9 puzzles less and 0.9 
puzzles more than the mean. 


c. The typical subject solved an average of 0.9 puzzles. 


d. The number of puzzles solved by subjects typically differed 
from the mean by about 0.9 puzzles. 
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e. The number of puzzles solved by subjects typically differed 
from one another by about 0.9 puzzles. 


T6.3 Let D be the difference in the number of puzzles solved by two 
randomly selected subjects in a 5-minute period. What is the 
standard deviation of D? 


a. O 

b. 0.81 
c. 0.9 
d. 1.27 
e. 1.8 


T6.4 Suppose a student is randomly selected from your school. Which 
of the following pairs of random variables are most likely 
independent? 


a. X = student’s height; Y = student’s weight 
b. X = student’s IQ; Y= student’s GPA 


c. X =student’s PSAT Math score; Y = student’s PSAT Verbal 
score 


d. X = average amount of homework the student does per night; Y 
= student’s GPA 


e. X = average amount of homework the student does per night; Y 
= student’s height 


T6.5 A certain vending machine offers 20-ounce bottles of soda for 
$1.50. The number of bottles X bought from the machine on any 
day is a random variable with mean 50 and standard deviation 15. 
Let the random variable Y equal the total revenue from this 
machine on a randomly selected day. Assume that the machine 
works properly and that no sodas are stolen from the machine. 
What are the mean and standard deviation of Y? 


a. by = $1.50, oy = $22.50 
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uy = $1.50, dy = $33.75 
uy = $75, oy = $18.37 
Uy = $75, oy = $22.50 
Uy = $75, oy = $33.75 


T6.6 The weight of tomatoes chosen at random from a bin at the 
farmer’s market follows a Normal distribution with mean p = 10 
ounces and standard deviation o = 1 ounce. Suppose we pick four 
tomatoes at random from the bin and find their total weight T. The 
random variable T is 


a. 
b. 


Cc. 


a 


Normal, with mean 10 ounces and standard deviation 1 ounce. 
Normal, with mean 40 ounces and standard deviation 2 ounces. 
Normal, with mean 40 ounces and standard deviation 4 ounces. 


binomial, with mean 40 ounces and standard deviation 2 
ounces. 


binomial, with mean 40 ounces and standard deviation 4 
ounces. 


T6.7 Which of the following random variables is geometric? 


The number of times I have to roll a single die to get two 6s 


The number of cards I deal from a well-shuffled deck of 52 
cards to get a heart 


The number of digits I read in a randomly selected row of the 
random digits table to get a 7 


The number of 7s in a row of 40 random digits 


The number of 6s I get if I roll a die 10 times 


T6.8 Seventeen people have been exposed to a particular disease. Each 
one independently has a 40% chance of contracting the disease. A 
hospital has the capacity to handle 10 cases of the disease. What is 
the probability that the hospital’s capacity will be exceeded? 
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0.011 
0.035 
c. 0.092 
0.965 
e. 0.989 


- 


Pp 


T6.9 The figure shows the probability distribution of a discrete random 
variable X. Which of the following best describes this random 
variable? 


a. Binomial with n = 8, p = 0.1 
b. Binomial with n = 8, p = 0.3 
c. Binomial with n = 8, p = 0.8 


a 


Geometric with p = 0.1 


e. Geometric with p = 0.2 


T6.10 A test for extrasensory perception (ESP) involves asking a person 
to tell which of 5 shapes—a circle, star, triangle, diamond, or 
heart—appears on a hidden computer screen. On each trial, the 
computer is equally likely to select any of the 5 shapes. Suppose 
researchers are testing a person who does not have ESP and so is 
just guessing on each trial. What is the probability that the person 
guesses the first 4 shapes incorrectly but gets the fifth one 
correct? 


a. 1/5 


b. ashe) 
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C. Aone) (2) 
3) 
1 


d. ee | )-(8) (3) 


e. 4/5 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T6.11 Let Y denote the number of broken eggs in a randomly selected 
carton of one dozen “store brand” eggs at a local supermarket. 
Suppose that the probability distribution of Y is as follows. 


Value y; 0 i 2 3 4 
Probability p; 0.78 0.11 0.07 0.03 0.01 


a. What is the probability that at least 10 eggs in a randomly 
selected carton are unbroken? 


b. Calculate and interpret py. 
c. Calculate and interpret oy. 


d. A quality control inspector at the store keeps looking at 
randomly selected cartons of eggs until he finds one with at 
least 2 broken eggs. Find the probability that this happens in 
one of the first three cartons he inspects. 


T6.12 Ladies Home Journal magazine reported that 66% of all dog 
owners greet their dog before greeting their spouse or children 
when they return home at the end of the workday. Assume that 
this claim is true. Suppose 12 dog owners are selected at random. 
Let X = the number of owners who greet their dogs first. 


a. Explain why it is reasonable to use the binomial distribution 
for probability calculations involving X. 
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b. 


Find the probability that exactly 6 owners in the sample greet 
their dogs first when returning home from work. 


In fact, only 4 of the owners in the sample greeted their dogs 
first. Does this give convincing evidence against the Ladies 
Home Journal claim? Calculate P(X < 4) and use the result to 
support your answer. 


T6.13 Ed and Adelaide attend the same high school but are in different 
math classes. The time E that it takes Ed to do his math 
homework follows a Normal distribution with mean 25 minutes 
and standard deviation 5 minutes. Adelaide’s math homework 
time A follows a Normal distribution with mean 50 minutes and 
standard deviation 10 minutes. Assume that E and A are 
independent random variables. 


a. 


Randomly select one math assignment of Ed’s and one math 
assignment of Adelaide’s. Let the random variable D be the 
difference in the amount of time each student spent on their 
assignments: D = A — E. Find the mean and the standard 
deviation of D. 


Find the probability that Ed spent longer on his assignment 
than Adelaide did on hers. 


T6.14 According to the Census Bureau, 13% of American adults (aged 
18 and over) are Hispanic. An opinion poll plans to contact an 
SRS of 1200 adults. 


a. 


What is the mean number of Hispanics in such samples? What 
is the standard deviation? 


Should we be suspicious if the sample selected for the opinion 
poll contains 10% or less Hispanic people? Calculate an 
appropriate probability to support your answer. 
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INTRODUCTION 


In this chapter, we will return to a key idea about statistical inference from 
Chapter 4—making conclusions about a population based on data from a 
sample. Here are a few examples of statistical inference in practice: 


e Each month, the Current Population Survey (CPS) interviews a random 
sample of individuals in about 60,000 U.S. households. The CPS uses 
the proportion of unemployed people in the sample p/? to estimate the 
national unemployment rate p.P: 


e To estimate how much gasoline prices vary in a large city, a reporter 
records the price per gallon of regular unleaded gasoline at a random 
sample of 10 gas stations in the city. The range (Maximum — Minimum) 
of the prices in the sample is 25 cents. What can the reporter say about 
the range of gas prices at all the city’s stations? 


e A battery manufacturer wants to make sure that the AA batteries it 
produces each hour meet certain standards. Quality control inspectors 
collect data from a random sample of 50 AA batteries produced during 
one hour and use the sample mean lifetime x = to estimate the 
unknown population mean lifetime p/ for all batteries produced that 
hour. 


Let’s look at the battery example a little more closely. To make an 
inference about the batteries produced in the given hour, we need to know 
how close the sample mean x @ is likely to be to the population mean p./ 
After all, different random samples of 50 batteries from the same hour of 
production would yield different values of x .%- How can we describe this 
sampling distribution of possible x © values? We can think of x # asa 
random variable because it takes numerical values that describe the 
outcomes of the random sampling process. As a result, we can examine its 
probability distribution using what we learned in Chapter 6. 


The following activity will help you get a feel for the distribution of 
two very common statistics, the sample mean x # and the sample 
proportion p/.?- 
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ACTIVITY A penny for your thoughts? 


In this activity, your class will investigate how the mean year x © and 
the proportion of pennies from the 2000s p/P? vary from sample to 
sample, using a large population of pennies of various ages.4 


peterspiro/Getty Images 


1. Each member of the class should randomly select 1 penny from the 
population and record the year of the penny with an “X” on the 
dotplot provided by your teacher. Return the penny to the population. 
Repeat this process until at least 100 pennies have been selected and 
recorded. This graph gives you an idea of what the population 
distribution of penny years looks like. 


2. Each member of the class should then select an SRS of 5 pennies 
from the population and note the year on each penny. 


e Record the average year of these 5 pennies (rounded to the nearest 
”“Z” on anew class dotplot. Make sure this dotplot 
is on the same scale as the dotplot in Step 1. 


year) with an “x 


e Record the proportion of pennies from the 2000s with a “par'p” ona 
different dotplot provided by your teacher. 
Return the pennies to the population. Repeat this process until there are 
at least 100 x ’s®’s and 100 p’’s.P’S: 


3. Repeat Step 2 with SRSs of size n=20.n = 20. Make sure these 
dotplots are on the same scale as the corresponding dotplots from 
Siew 2: 


4. Compare the distribution of X (year of penny) with the two 
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distributions of x Z (mean year). How are the distributions similar? 
How are they different? What effect does sample size seem to have 
on the shape, center, and variability of the distribution of x ?£? 


5. Compare the two distributions of p.?- How are the distributions 
similar? How are they different? What effect does sample size seem 
to have on the shape, center, and variability of the distribution of p/? 


7) 


Pp 


Sampling distributions are the foundation of inference when data are 
produced by random sampling. Because the results of random samples 
include an element of chance, we can’t guarantee that our inferences are 
correct. What we can guarantee is that our methods usually give correct 
answers. The reasoning of statistical inference rests on asking, “How often 
would this method give a correct answer if I used it many times?” If our 
data come from random sampling, the laws of probability help us answer 
this question. These laws also allow us to determine how far our estimates 
typically vary from the truth and what values of a statistic should be 
considered unusual. 


Section 7.1 presents the basic ideas of sampling distributions. The most 
common applications of statistical inference involve proportions and 
means. Section 7.2 focuses on sampling distributions of sample 
proportions. Section 7.3 investigates sampling distributions of sample 
means. 
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SECTION 7.1 What Is a Sampling Distribution? 


LEARNING TARGETS By the end of the section, you should be able to: 


Distinguish between a parameter and a statistic. 


Create a sampling distribution using all possible samples from a 
small population. 


Use the sampling distribution of a statistic to evaluate a claim about 
a parameter. 


Distinguish among the distribution of a population, the distribution 
of a sample, and the sampling distribution of a statistic. 


Determine if a statistic is an unbiased estimator of a population 
parameter. 


Describe the relationship between sample size and the variability of 
a Statistic. 


What is the average income of U.S. residents with a college degree? Each 
March, the government’s Current Population Survey (CPS) asks detailed 
questions about income. The random sample of about 70,000 U.S. college 
grads contacted in March 2016 had a mean “total money income” of 
$73,750 in 2015.2 That $73,750 describes the sample, but we use it to 
estimate the mean income of all college grads in the United States. 


Because of some very large incomes, the mean total income ($73,750) was larger than 


the median total income ($55,071). 


Parameters and Statistics 


As we begin to use sample data to draw conclusions about a larger 
population, we must be clear about whether a number describes a sample 
or a population. For the sample of college graduates contacted by the CPS, 
the mean income was x =$73,750.2 = $73, 750. The number $73,750 is a 
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statistic because it describes this one CPS sample. The population that the 
poll wants to draw conclusions about is the nearly 100 million U.S. 
residents with a college degree. In this case, the parameter of interest is 
the mean income uF of all these college graduates. We don’t know the 
value of this parameter, but we can estimate it using data from the sample. 


DEFINITION Statistic, Parameter 
A statistic is a number that describes some characteristic of a sample. 


A parameter is a number that describes some characteristic of the 
population. 


Remember s and p: statistics come from samples, and parameters 
come from populations. As long as we were doing data analysis, the 
distinction between population and sample rarely came up. Now that we 
are focusing on statistical inference, however, it is essential. The notation 
we use should reflect this distinction. The table shows three commonly 
used statistics and their corresponding parameters. 


It is common practice to use Greek letters for parameters and Roman letters for 


statistics. In that case, the population proportion would be 7 (pi, the Greek letter for 
“p”) and the sample proportion would be p. We’II stick with the notation that’s used 


on the AP® exam, however. 


Population 

Sample statistic parameter 
x X (the sample estimates yu (the population 
mean) mean) 
p\P (the sample estimates p (the population 
proportion) proportion) 
S, (the sample SD) estimates o? (the population 

SD) 


EXAMPLE |From ghosts to cold cabins > 


Parameters and statistics 
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PROBLEM: Identify the population, the parameter, the sample, and the 
Statistic in each of the following settings. 


a. The Gallup Poll asked 515 randomly selected U.S. adults if they 
believe in ghosts. Of the respondents, 160 said “Yes,” 


b. During the winter months, the temperatures outside the Starneses’ 
cabin in Colorado can stay well below freezing for weeks at a time. 
To prevent the pipes from freezing, Mrs. Starnes sets the thermostat 
at 50°F. She wants to know how low the temperature actually gets in 
the cabin. A digital thermometer records the indoor temperature at 20 
randomly chosen times during a given day. The minimum reading is 
38°F. 


SOLUTION: 


a. Population: all U.S. adults. Parameter: p = the proportion of all U.S. 
adults who believe in ghosts. Sample: the 515 people who were 
interviewed in this Gallup Poll. Statistic: pAP = the proportion in the 
sample who say they believe in ghosts = 160/515 = 0.31. 


b. Population: all times during the day in question. Parameter: the true 
minimum temperature in the cabin at all times that day. Sample: the 
20 temperature readings at randomly selected times. Statistic: the 
sample minimum temperature, 38°F. 


Not all parameters and statistics have their own symbols. To distinguish 


parameters and Statistics in these cases, use descriptors like “true” and 


“sample.” 
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FOR PRACTICE, TRY EXERCISE 1 


AP® EXAM TIP 


Many students lose credit on the AP® Statistics exam when defining 
parameters because their description refers to the sample instead of the 
population or because the description isn’t clear about which group of 
individuals the parameter is describing. When defining a parameter, we 
suggest including the word all or the word true in your description to 
make it clear that you aren’t referring to a sample statistic. 


The Idea of a Sampling Distribution 


The students in Mrs. Gallas’s class did the “Penny for your thoughts” 
activity at the beginning of the chapter. Figure 7.1 shows their “dotplot” of 
the sample mean year for 50 samples of size n=5.7 = 5. 


XX X 


1990 1995 42000 42005 2010 4201s 


x= sample mean year (n= 5) 


FIGURE 7.1 Distribution of the sample mean year of penny for 50 samples of size 


n=5 


from Mrs. Gallas’s population of pennies. 


It shouldn’t be surprising that the statistic x Z is a variable. After all, 
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different samples of n=5” = 5 pennies will produce different means. As 
you learned in Section 4.3, this basic fact is called sampling variability. 


DEFINITION Sampling variability 


Sampling variability refers to the fact that different random samples 
of the same size from the same population produce different values 
for a Statistic. 


Knowing how statistics vary from sample to sample is essential when 
making an inference about a population. Understanding sampling 
variability reminds us that the value of a statistic is unlikely to be exactly 
equal to the value of the parameter it is trying to estimate. It also lets us 
say how much we expect an estimate to vary from its corresponding 
parameter. 


Mrs. Gallas’s class took only 50 random samples of 5 pennies. 
However, there are many, many possible random samples of size 5 from 
Mrs. Gallas’s large population of pennies. If the students took every one of 
those possible samples, calculated the value of x ® for each, and graphed 
all those x Z values, then we’d have a sampling distribution. 


DEFINITION Sampling distribution 


The sampling distribution of a statistic is the distribution of values 
taken by the statistic in all possible samples of the same size from the 
same population. 


Remember that a distribution describes the possible values of a variable and how 


often these values occur. Thus, a sampling distribution shows the possible values of a 


statistic and how often these values occur. 


For large populations, it is too difficult to take all possible samples of 
size n to obtain the exact sampling distribution of a statistic. Instead, we 
can approximate a sampling distribution by taking many samples, 
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calculating the value of the statistic for each of these samples, and 
graphing the results. Because the students in Mrs. Gallas’s class didn’t 
take all possible samples of 5 pennies, their dotplot of x ’s%’s in Figure 
7.1 is called an approximate sampling distribution. 

The following example demonstrates how to construct a complete 
sampling distribution using a small population. 


EXAMPLE | Sampling heights (») 


Creating a sampling distribution 


Brian Miller 


PROBLEM: John and Carol have four grown sons. Their heights (in 
inches) are 71, 75, 72, and 68. 


a. List all 6 possible samples of size 2. 


b. Calculate the mean of each sample and display the sampling 
distribution of the sample mean using a dotplot. 


c. Calculate the range of each sample and display the sampling 
distribution of the sample range using a dotplot. 


SOLUTION: 
a. 
Sample 
Theo 
TAgT2 
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71, 68 
ee 12 
75, 68 
72, 68 


Sample Sample mean 


TIS 73.0 
TAL, VA 715 
71, 68 She Ae. 
Vise es [ene 
75, 68 Tas 
72, 68 70.0 


69 70 7 72 73 74 
x = Sample mean height (in.) 


Sample Sample range 


TNS TES: 4 
TNE if 
71, 68 3 
Tale 3 
75, 68 7 
72, 68 4 
s 4 4 S 
* &# © & © F € # & 
Sample range of height (in.) 


FOR PRACTICE, TRY EXERCISE 7 
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Being able to construct (or approximate) the sampling distribution of a 
Statistic allows us to determine the values of the statistic that are likely to 
occur by chance alone—and the values that should be considered unusual. 
The following example shows how we can use a sampling distribution to 
evaluate a claim. 


EXAMPLE | Reaching for chips (>) 


Using a sampling distribution to evaluate a claim 


woodygraphs/Getty 
Images 


PROBLEM: To determine how much homework time students will get 
in class, Mrs. Lin has a student select an SRS of 20 chips from a large 
bag. The number of red chips in the SRS determines the number of 
minutes in class students get to work on homework. Mrs. Lin claims that 
there are 200 chips in the bag and that 100 of them are red. When Jenna 
selected a random sample of 20 chips from the bag (without looking), 
she got 7 red chips. Does this provide convincing evidence that less than 
half of the chips in the bag are red? 


a. What is the evidence that less than half of the chips in the bag are 
red? 


b. Provide two explanations for the evidence described in part (a). 
We used technology to simulate choosing 500 SRSs of size n=20n = 20 
from a population of 200 chips, 100 red and 100 blue. The dotplot 


shows p/P = the sample proportion of red chips for each of the 500 
samples. 
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c. There is one dot on the graph at 0.80. Explain what this value 


represents. 


. Would it be surprising to get a sample proportion of pA=7/20=0.35 
p = 7/20 = 0.35 oF smaller in an SRS of size 20 when p=0.5? 


p = 0.5? Justify your answer. 


. Based on your previous answers, is there convincing evidence that 
less than half of the chips in the large bag are red? Explain your 


reasoning. 
| OR SAL ES 


Caen See oe oe ee T 
0.1 02 03 04 OS 06 0.7 08 0.9 


p = sample proportion of red chips 


SOLUTION: 


. Jenna’s sample proportion was p’? = 7/20 = 0.35, which is less than 
0.50. 


. It is possible that Mrs. Lin is telling the truth and Jenna got a pp 
less than 0.50 because of sampling variability. It is also possible that 
Mrs. Lin is lying and less than half of the chips in the bag are red. 


. In one simulated SRS of 20 chips, there were 16 red chips. So p\P = 
16/20 = 0.80 for this sample. 


. No; there were many simulated samples that had p\P values less 
than or equal to 0.35. 


. Because it isn’t surprising to get a p’? less than or equal to 0.35 by 
chance alone when p = 0.50, there isn’t convincing evidence that less 
than half of the chips in the bag are red. 


FOR PRACTICE, TRY EXERCISE 13 
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Suppose that Jenna’s sample included only 3 red chips, giving 
p\=3/20=0.15.P = 3/20 = 0.15. Would this provide convincing evidence 
that less than half of the chips in the bag are red? Yes. According to the 
simulated sampling distribution in the example, it would be very unusual 
to get a p\P value this small when p=0.50.? = 9-50. Therefore, sampling 
variability would not be a plausible explanation for the outcome of Jenna’s 
sample. The only plausible explanation for a p/P value of 0.15 is that less 
than half of the chips in the bag are red. 


Figure 7.2 illustrates the process of choosing many random samples of 
20 chips from a population of 100 red chips and 100 blue chips and finding 
the sample proportion of red chips p/? for each sample. Follow the flow 
of the figure from the population distribution on the left, to choosing an 
SRS, graphing the distribution of sample data, and finding the p/P for that 
particular sample, to collecting together the p\’sP’S from many samples. 
The first sample has p\=0.40.P = 9.40. The second sample is a different 
group of chips, with pA=0.55P = 0.55, and so on. 


Distributions of sample data 
14- 
12 + 
Population distribution eB 104 Approximate 
Parameter: p = 0.50 g 85 sampling distribution 
ses ¥ 6-4 p=2 040 
n=20 & 4-4 20 
24 
a 0 
Blue Red 
Color 
sy 
=] 
3 ul 
SRS a p=—=055 “~~ [ | 
n=20 & 20 ; t 
retro th eit el ito 


0.1 02 03 04 05 06 0.7 08 09 


0 
Blue Red 
Color 


Pp = sample proportion of red chips 


FIGURE 7.2 The idea of a sampling distribution is to take many samples from the 
same population, collect the value of the statistic from all the samples, and display 
the distribution of the statistic. The dotplot shows the approximate sampling 
distribution of p/ 
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= the sample proportion of red chips. 


The dotplot at the right of the figure shows the distribution of the values of 
p/P from 500 separate SRSs of size 20. This is the approximate sampling 
distribution of the statistic p.P: 


AP® EXAM TIP 


Terminology matters. Never just say “the distribution.” Always say “the 
distribution of [blank],” being careful to distinguish the distribution of 
the population, the distribution of sample data, and the sampling 
distribution of a statistic. Likewise, don’t use ambiguous terms like 
“sample distribution,” which could refer to the distribution of sample 
data or to the sampling distribution of a statistic. You will lose credit on 
free response questions for misusing statistical terms. 


As Figure 7.2 shows, there are three distinct distributions involved 
when we sample repeatedly and calculate the value of a statistic. 


e The population distribution gives the values of the variable for all 
individuals in the population. In this case, the individuals are the 200 
chips and the variable we’re recording is color. Our parameter of 


interest is the proportion of red chips in the population, p=0.50. 
p = 0.50. 


e The distribution of sample data shows the values of the variable for the 
individuals in a sample. In this case, the distribution of sample data 
shows the values of the variable color for the 20 chips in the sample. For 
each sample, we record a value for the statistic p\?, the sample 
proportion of red chips. 


e The sampling distribution of the sample proportion displays the values 
of p/P from all possible samples of the same size. 

Remember that a sampling distribution describes how a statistic (e.g., 
p\P ) varies in many samples from the population. However, the 
population distribution and the distribution of sample data describe how 
individuals (e.g., chips) vary. 
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CHECK YOUR UNDERSTANDING 


(\ 


y Mars,® Inc. says that the mix of colors in its M&M’S® Milk 
Chocolate Candies from its Hackettstown, NJ, factory is 25% blue, 25% 
orange, 12.5% green, 12.5% yellow, 12.5% red, and 12.5% brown. 
Assume that the company’s claim is true and that you will randomly 
select 50 candies to estimate the proportion that are orange. 


1. Identify the population, the parameter, the sample, and the statistic in 
this setting. 
2. Graph the population distribution. 


3. Imagine taking a random sample of 50 M&M’S Milk Chocolate 
Candies. Make a graph showing a possible distribution of the sample 
data. Give the value of the statistic for this sample. 


4. Which of these three graphs could be the approximate sampling 
distribution of the statistic? Explain your choice. 


12-4 

104 

oll 

64 

24 | if 
| 

al ott ai. 3 wtf Wie 

I GH T T 
5 


® 2 eo o > 2 
SFP Sr LS Se 0.05 0.15 035 0.45 0. 
ry Oe + 


Frequency 


T 
010 Os 0.20 0.25 


= sample proportion of orange candies p = sample proportion of orange candies 
Color 


Describing Sampling Distributions 


The fact that statistics from random samples have definite sampling 
distributions allows us to answer the question “How trustworthy is a 
Statistic as an estimate of a parameter?” To get a complete answer, we will 
consider the shape, center, and variability of the sampling distribution. For 
reasons that will be clear later, we’ll save shape for Sections 7.2 and 7.3. 


Here is an activity that gets you thinking about the center and 
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variability of a sampling distribution. 


ACTIVITY The craft stick problem 


In this activity, you will create a statistic for estimating the total number 
of craft sticks in a bag (N). The sticks are numbered 1, 2, 3, ..., N. Near 
the end of the activity, your teacher will select a random sample of n=7 
n = 7 sticks and read the number on each stick to the class. The team 
that has the best estimate for the total number of sticks will win a prize. 


Stephen Orsillo/Alamy 


1. Form teams of three or four students. As a team, spend about 10 
minutes brainstorming different ways to estimate the total number of 
sticks. Try to come up with at least three different statistics. 


2. Before your teacher provides the sample of sticks, use simulation to 
investigate the sampling distribution of each statistic. For the 
simulation, assume that there are N=100N = 100 sticks in the bag 
and that you will be selecting samples of size n=7.n = 7. 


e Using your TI-83/84 calculator, select an SRS of size 7 using the 
command RandIntNoRep(lower:1,upper:100,n:7). [With older OS, use 
the command RandInt(lower:1,upper:100,n:7) and verify that there are 
no repeated numbers. If there are repeats, press ENTER to get a new 
sample. | 


e For each sample, calculate the value of each of your three statistics. 


e Graph these values on a set of dotplots like those shown here. 
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e Perform as many trials of the simulation as possible. 


— 

& 

= 

5 
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x 

J 
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. 
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a aT Ts a | 
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Estimated total 


3. Based on the simulated sampling distributions, which of your 
Statistics is likely to produce the best estimate? Discuss as a team. 


4. Your teacher will now select a random sample of n=7n = 7 sticks 
from the bag. On a piece of paper, write the names of your group 
members, your group’s estimate for the number of sticks in the bag (a 
number), and the statistic you used to calculate your estimate (a 
formula). 


CENTER: BIASED AND UNBIASED ESTIMATORS Figure 7.3 
shows the simulated sampling distribution of p\? = proportion of red chips 
when selecting samples of size n=20n = 20 from a population where p = 


0.5. 
1 | 


Ic lc sahil a oh a a 
0.4 05 0.6 0.7 0.8 0.9 


I a 
0.1 0.2 0.3 


p = sample proportion of red chips 
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FIGURE 7.3 The distribution of p/ = the 
sample proportion of red chips in 500 SRSs of size n=20 


yee | 


from a population where p = 0.5. 


Notice that the center of this distribution is very close to 0.5, the 
parameter value. In fact, if we took all possible samples of 20 chips from 
the population, calculated p’? for each sample, and then found the mean 
of all those p/P -values, we’d get exactly 0.5. For this reason, we say that 
p/P is an unbiased estimator of p. 


DEFINITION Unbiased estimator 


A statistic used to estimate a parameter is an unbiased estimator if 
the mean of its sampling distribution is equal to the value of the 
parameter being estimated. 
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In a particular sample, the value of an unbiased estimator might be 
greater than the value of the parameter or it might be less than the value of 
the parameter. However, because the sampling distribution of the statistic 
is centered at the true value, the statistic will not consistently overestimate 
or consistently underestimate the parameter. This fits with our definition of 
bias from Chapter 4. The design of a statistical study shows bias if it is 
very likely to underestimate or very likely to overestimate the value we 
want to know. 


We will confirm mathematically in Section 7.2 that the sample 
proportion p/? is an unbiased estimator of the population proportion p. 
This is a very helpful result if we’re dealing with a categorical variable 
(like color). With quantitative variables, we might be interested in 
estimating the population mean, median, minimum, maximum, Q), Q3, 


variance, standard deviation, IQR, or range. Which (if any) of these are 
unbiased estimators? 


Let’s revisit the “Sampling heights” example with John and Carol’s 
four sons to investigate one of these statistics. Recall that the heights of the 
four sons are 71, 75, 72, and 68 inches. Here again is the sampling 
distribution of the sample mean x # for samples of size 2: 


. 
e ° e e e 
T T T T T T 


69 70 71 72 73 74 
xX = sample mean height (in.) 


To determine if the sample mean is an unbiased estimator of the 
population mean, we need to compare the mean of the sampling 
distribution to the value we are trying to estimate—the mean of the 
population p./ 

The mean of the sampling distribution of x 7 is 


ux =69.5+70+71.5+71.5+73+73.56=71.5 


69.5+70+71.5+71.5+73+73.5 
bg = 18 


The mean of the population is 
_ 71475472468 


p= = 71.5 
W=71+75+72+684=71.5 a 


Because these values are equal, this example suggests that the sample 
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mean x @ is an unbiased estimator of the population mean p./ We will 
confirm this fact in Section 7.3. 


EXAMPLE | Estimating the range (») 


Biased and unbiased estimators 


as 


eS 
eae 


ee . 


Brian Miller 


PROBLEM: In the “Sampling heights” example, we created the 
sampling distribution of the sample range for samples of size n=2n = 2 
from the population of John and Carol’s four sons with heights of 71, 
795, 72, and 68 inches tall. Is the sample range an unbiased estimator of 
the population range? Explain your answer. 


Sample range of height (in.) 


SOLUTION: 


The mean of the sampling distribution is 


1+3+3+4+4+7 


= 3.67 
14+3+3+4+4+76 = 3.67 6 


The range of the population is 
Population range = 75 — 68 = 7 
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Because the mean of the sampling distribution of the sample range 
(3.67) is not equal to the value it is trying to estimate (7), the sample 
range is not an unbiased estimator of the population range. 


FOR PRACTICE, TRY EXERCISE 19 


Because the sample range is consistently smaller than the population 
range, the sample range is a biased estimator of the population range. 


Think About It 


WHY DO WE DIVIDE BY n - 1 WHEN CALCULATING THE 
SAMPLE STANDARD DEVIATION? Now that we know about 
sampling distributions and unbiased estimators, we can finally answer 
this question. In Chapter 1, you learned that the formula for the sample 


fo e-2? 

standard deviation is sx=) (xi-x)2n-1." ~ V1" You also 
learned that the value obtained before taking the square root in the 
standard deviation calculation is known as the variance. That is, the 
sample variance is sx2=)' (xi-x )2n-1.°* n=l 

In an inference setting involving a quantitative variable, we might 
be interested in estimating the variance 020° of the population 
distribution. The most logical choice for our estimator is the sample 
variance sx2.8z- We used technology to select 500 SRSs of size n=4 
n = 4 from a population where the population variance is o2=9.0° = 9. 
For each sample, we recorded the value of two statistics: 


S-> (2-2)? 


a var (n — 1) =———__=> 8; 
var(n—-1)=¥ (xi-x )2n-1=sx2 oat! 


~ 


_ _ . var(n) = 
var(n)=> (xi-x )2n n 


Figure 7.4 shows the approximate sampling distributions of these 
two Statistics. The blue vertical lines mark the means of these two 
distributions. 
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FIGURE 7.4 Results from a simulation of 500 SRSs of size n=4 
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variance sx2 
(labeled “var(n—1)” in the figure) is an unbiased estimator. The “var(n)” statistic 
is a biased estimator. 


We can see that “var(n)” is a biased estimator of the population 
variance. The mean of its approximate sampling distribution (marked 
with a blue line segment) is clearly less than the value of the population 
parameter, o2=9.07 = 9. However, the statistic “var(n—1)” (otherwise 
known as the sample variance sx252) is an unbiased estimator. Its 
values are centered at 62=9.0° = 9. That’s why we divide by n - 1 
when calculating the sample variance—and when calculating the 
sample standard deviation. 


VARIABILITY: SMALLER IS BETTER! To get a trustworthy estimate 
of an unknown population parameter, start by using a statistic that’s an 
unbiased estimator. This ensures that you won’t consistently overestimate 
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or consistently underestimate the parameter. Unfortunately, using an 
unbiased estimator doesn’t guarantee that the value of your statistic will be 
close to the actual parameter value. 


To investigate the variability of a statistic, let’s consider the proportion 
of people in a random sample who have ever watched the show Survivor. 
According to Nielsen ratings, Survivor was one of the most-watched 
television shows in the United States every week that it aired. Suppose that 
the true proportion of U.S. adults who have ever watched Survivor is 
p=0.37.P = 0.37. 

The top dotplot in Figure 7.5 shows the results of drawing 400 SRSs of 
size n=100n = 100 from a large population with p=0.37P = 9-37 and 
recording the value of p’? = the sample proportion who have ever watched 
Survivor. We see that a sample of 100 people often gave a p? quite far 
from the population parameter. 

Let’s repeat our simulation, this time taking 400 SRSs of size n=1000 
n = 1000 from a large population with proportion p = 0.37 who have 
watched Survivor. The bottom dotplot in Figure 7.5 displays the 
distribution of the 400 values of p/P from these larger samples. Both 
graphs are drawn on the same horizontal scale to make comparison easy. 
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FIGURE 7.5 The approximate sampling distribution of the sample proportion 
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from SRSs of size n=100 


= 100 
= 1000 


chosen from a large population with proportion p = 0.37. Both dotplots show the 
results of 400 SRSs. 


and n=1000 


We can see that the variability shown in the top dotplot in Figure 7.5 is 
much greater than the variability shown in the bottom dotplot. With 
samples of size 100, the standard deviation of these p/P values is about 
0.047. Using SRSs of size 1000, the standard deviation of these p\P 
values is about 0.016. This confirms what we learned in Section 4.3: larger 
random samples tend to produce estimates that are closer to the true 
population value. 


One important and surprising fact is that the variability of a statistic 
does not depend very much on the size of the population, as long as the 
sample size is less than 10% of the population size. Suppose that in a small 
town of 25,000 people, 37% of the population have watched Survivor. 
That is, p = 0.37. Let’s simulate taking 400 SRSs of size 1000 from this 
small town and compute pip , the sample proportion who have watched 
Survivor. The results are shown in Figure 7.6. 
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p = sample proportion who have watched 
Survivor (n = 1000 from small town) 


FIGURE 7.6 The approximate sampling distribution of the sample proportion 


= 


p/ from 400 SRSs of size n=1000 


n = 1000 


chosen from a population of 25,000 
individuals with proportion p = 0.37. 


Notice that the distribution of p’? looks nearly the same when 
sampling from a small town of 25,000 residents as when sampling from 
the entire United States. In fact, the standard deviation for each sampling 
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distribution is approximately 0.016. 


Sergey Skleznev/Shutterstock 


Why does the size of the population have little influence on the 
behavior of statistics from random samples? Imagine sampling harvested 
corn by thrusting a scoop into a large sack of corn kernels. The scoop 
doesn’t know if it is surrounded by a bag of corn or by an entire truckload. 
As long as the corn is well mixed (so that the scoop selects a random 
sample), the variability of the result depends only on the size of the scoop. 


CHECK YOUR UNDERSTANDING 


The histogram on the left shows the interval (in minutes) between 
eruptions of the Old Faithful geyser for all 222 recorded eruptions 
during a particular month. For this population, the median is 75 minutes. 
We used technology to take 500 SRSs of size 10 from the population. 
The 500 values of the sample median are displayed in the histogram on 
the right. The mean of these 500 values is 73.5. 
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Interval between eruptions (min) Simulated sample median interval (min) 


1. Is the sample median an unbiased estimator of the population 
median? Justify your answer. 


2. Suppose we had taken samples of size 20 instead of size 10. Would 
the variability of the sampling distribution of the sample median be 
larger, smaller, or about the same? Justify your answer. 


3. Describe the shape of the sampling distribution of the sample median. 


CHOOSING AN ESTIMATOR In many cases, it is obvious which 
statistic should be used as an estimator of a population parameter. If we 
want to estimate the population mean pi, use the sample mean x .@. If we 
want to estimate a population proportion p, use the sample proportion p‘. 
p. However, in other cases, there isn’t an obvious best choice. When 
trying to estimate the population maximum in the “Craft stick” activity, 
there were many different estimators that could be used. 

To decide which estimator to use when there are several choices, 
consider both bias and variability. Imagine the true value of the population 
parameter as the bull’s-eye on a target and the sample statistic as an arrow 
fired at the target. Both bias and variability describe what happens when 
we take many shots at the target. 

Bias means that our aim is off and we consistently miss the bull’s-eye 
in the same direction. Our sample statistics do not center on the population 
parameter. In other words, our estimates are not accurate. High variability 
means that repeated shots are widely scattered on the target. Repeated 
samples do not give very similar results. In other words, our estimates are 
not very precise. Figure 7.7 shows this target illustration. 
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Estimator with high bias Estimator with low bias 
and low variability and high variability 


(a) (b) 


Estimator with high bias Estimator with no bias 
and high variability and low variability 


(c) (d) 
FIGURE 7.7 Bias and variability of four different estimators. (a) High bias, low 
variability. (b) Low bias, high variability. (c) High bias, high variability. (d) The 
ideal estimator: no bias, low variability. 


Notice that an estimator with low variability can also have high bias, as 
in Figure 7.7(a). And an estimator with low or no bias can be quite 
variable, as in Figure 7.7(b). Ideally, we’d like to use an estimator that is 
unbiased with low variability, as in Figure 7.7(d). 


AP® EXAM TIP 


Make sure to understand the difference between accuracy and precision 
when writing responses on the AP® Statistics Exam. Many students use 
“accurate” when they really mean “precise.” For example, a response 
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that says “increasing the sample size will make an estimate more 
accurate” is incorrect. It should say that increasing the sample size will 
make an estimate more precise. If you can’t remember which term to 
use, don’t use either of them. Instead, explain what you mean without 
using statistical vocabulary. 


Section 7.1 Summary 


e A parameter is a number that describes a population. To estimate an 
unknown parameter, use a statistic calculated from a sample. 


e The population distribution of a variable describes the values of the 
variable for all individuals in a population. The sampling distribution 
of a statistic describes the values of the statistic in all possible samples 
of the same size from the same population. Don’t confuse the sampling 
distribution with a distribution of sample data, which gives the values 
of the variable for all individuals in a particular sample. 


e A statistic can be an unbiased estimator or a biased estimator of a 
parameter. A statistic is an unbiased estimator if the center (mean) of its 
sampling distribution is equal to the true value of the parameter. 


e The variability of a statistic is described by the spread of its sampling 
distribution. Larger samples give less variability. 


e When trying to estimate a parameter, choose a Statistic with low or no 
bias and minimum variability. 


Section 7.1 Exercises 


For Exercises 1—6, identify the population, the parameter, the sample, and 
the statistic in each setting. 


1. pg 442 |») Healthy living From a large group of people who 


signed a card saying they intended to quit smoking, 1000 people were 
selected at random. It turned out that 210 (21%) of these individuals 
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had not smoked over the past 6 months. 


Unemployment Each month, the Current Population Survey 
interviews about 60,000 randomly selected U.S. adults. One of their 
goals is to estimate the national unemployment rate. In October 2016, 
4.9% of those interviewed were unemployed. 


Fillings How much do prices vary for filling a cavity? To find out, an 
insurance company randomly selects 10 dental practices in California 
and asks for the cash (non-insurance) price for this procedure at each 
practice. The interquartile range is $74. 


Warm turkey Tom is cooking a large turkey breast for a holiday 
meal. He wants to be sure that the turkey is safe to eat, which requires 
a minimum internal temperature of 165°F. Tom uses a thermometer 
to measure the temperature of the turkey meat at four randomly 
chosen points. The minimum reading is 170°F. 


Iced tea On Tuesday, the bottles of Arizona Iced Tea filled in a plant 
were supposed to contain an average of 20 ounces of iced tea. Quality 
control inspectors selected 50 bottles at random from the day’s 
production. These bottles contained an average of 19.6 ounces of iced 
tea. 


Bearings A production run of ball bearings is supposed to have a 
mean diameter of 2.5000 centimeters (cm). An inspector chooses 100 
bearings at random from the run. These bearings have mean diameter 
2.5009 cm. 


Exercises 7—10 refer to the small population of 5 students in the table. 


Name Gender Quiz score 
Abigail Female 10 
Bobby Male 5 
Carlos Male 10 
DeAnna Female 7 
Emily Female S| 


7. pg 444 |») Sample means List all 10 possible SRSs of size n=2, 


n= 2, calculate the mean quiz score for each sample, and display the 


o/7 


10. 


th. 


12. 


13. 


sampling distribution of the sample mean on a dotplot. 


Sample minimums List all 10 possible SRSs of size n=3," = 3: 
calculate the minimum quiz score for each sample, and display the 
sampling distribution of the sample minimum on a dotplot. 


Sample proportions List all 10 possible SRSs of size n=2," = 2; 
calculate the proportion of females for each sample, and display the 
sampling distribution of the sample proportion on a dotplot. 


Sample medians List all 10 possible SRSs of size n=3," = 3: 
calculate the median quiz score for each sample, and display the 
sampling distribution of the sample median on a dotplot. 


Doing homework A school newspaper article claims that 60% of the 
students at a large high school completed their assigned homework 
last week. Assume that this claim is true for the 2000 students at the 
school. 


a. Make a bar graph of the population distribution. 


b. Imagine one possible SRS of size 100 from this population. Sketch 
a bar graph of the distribution of sample data. 


Tall girls According to the National Center for Health Statistics, the 
distribution of height for 16-year-old females is modeled well by a 
Normal density curve with mean p= 64 inches and standard 
deviation o=2.5 inches.o = 2.5 inches. Assume this claim is true for 
the three hundred 16-year-old females at a large high school. 


a. Make a graph of the population distribution. 


b. Imagine one possible SRS of size 20 from this population. Sketch a 
dotplot of the distribution of sample data. 


pg 445 |p) More homework Some skeptical AP® Statistics 


students want to investigate the newspaper’s claim in Exercise 11, so 
they choose an SRS of 100 students from the school to interview. In 
their sample, 45 students completed their homework last week. Does 
this provide convincing evidence that less than 60% of all students at 
the school completed their assigned homework last week? 
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14. 


a. What is the evidence that less than 60% of all students completed 
their assigned homework last week? 


b. Provide two explanations for the evidence described in part (a). 


We used technology to simulate choosing 250 SRSs of size n=100 
n = 100 from a population of 2000 students where 60% completed 
their assigned homework last week. The dotplot shows p/? = the 
sample proportion of students who completed their assigned 
homework last week for each of the 250 simulated samples. 


c. There is one dot on the graph at 0.73. Explain what this value 
represents. 


d. Would it be surprising to get a sample proportion of p4=0.45 
P = 0.45 or smaller in an SRS of size 100 when p=0.60P = 9.60? 
Justify your answer. 


e. Based on your previous answers, is there convincing evidence that 
less than 60% of all students at the school completed their assigned 
homework last week? Explain your reasoning. 


0.45 0.50 0.55 0.60 0.65 0.70 0.75 


p = simulated sample proportion who completed 
their assigned homework last week 


Tall girls? To see if the claim made in Exercise 12 is true at their 
high school, an AP® Statistics class chooses an SRS of twenty 16- 
year-old females at the school and measures their heights. In their 
sample, the mean height is 64.7 inches. Does this provide convincing 
evidence that 16-year-old females at this school are taller than 64 
inches, on average? 


a. What is the evidence that the average height of all 16-year-old 
females at this school is greater than 64 inches, on average? 
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15. 


16. 


b. Provide two explanations for the evidence described in part (a). 


We used technology to simulate choosing 250 SRSs of size n=20 

n = 20 from a population of three hundred 16-year-old females 
whose heights follow a Normal distribution with mean p=64! = 64 
inches and standard deviation p=2.54 — 2.5 inches. The dotplot 
shows x £= the sample mean height for each of the 250 simulated 
samples. 


c. There is one dot on the graph at 62.5. Explain what this value 
represents. 


d. Would it be surprising to get a sample mean of x =64.7£ = 64.7 or 
larger in an SRS of size 20 when p=64/ = 4 inches and o=2.5 
o = 2.5 inches? Justify your answer. 


e. Based on your previous answers, is there convincing evidence that 
the average height of all 16-year-old females at this school is 
greater than 64 inches? Explain your reasoning. 


eeeeeeceeeeeoeeeeoe 


T T T T T 
62.0 625 63.0 635 640 645 65.0 65.5 66.0 


¥ = simulated sample mean height (in.) 
Even more homework Refer to Exercises 11 and 13. Suppose that 
the sample proportion of students who did all their assigned 
homework last week is pA=57/ 100=0.57.P = 57/100 = 0.57. would 
this sample proportion provide convincing evidence that less than 
60% of all students at the school completed all their assigned 
homework last week? Explain your reasoning. 


Even more tall girls Refer to Exercises 12 and 14. Suppose that the 
sample mean height of the twenty 16-year-old females is x #= 65.8 
inches. Would this sample mean provide convincing evidence that the 


980 


average height of all 16-year-old females at this school is greater than 
64 inches? Explain your reasoning. 


Exercises 17 and 18 refer to the following setting. During the winter 
months, outside temperatures at the Starneses’ cabin in Colorado can stay 
well below freezing (32°F, or 0°C) for weeks at a time. To prevent the 
pipes from freezing, Mrs. Starnes sets the thermostat at 50°F. The 
manufacturer claims that the thermostat allows variation in home 
temperature that follows a Normal distribution with o = 3°F. To test this 
claim, Mrs. Starnes programs her digital thermometer to take an SRS of 
n=10n = 10 readings during a 24-hour period. Suppose the thermostat is 
working properly and that the actual temperatures in the cabin vary 
according to a Normal distribution with mean p = 50°F and standard 
deviation o = 3°F. 


17. Cold cabin? The dotplot shows the results of taking 300 SRSs of 10 
temperature readings from a Normal population with p“= 50 and og= 
3 and recording the sample standard deviation sx** each time. 
Suppose that the standard deviation from an actual sample is sx=5°F. 
8, = 5 F. What would you conclude about the thermostat 
manufacturer’s claim? Explain your reasoning. 


T I T T T T T T T 
1.0 L5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 


S, = simulated sample standard deviation 
of temperature (°F) 


18. Really cold cabin The dotplot shows the results of taking 300 SRSs 
of 10 temperature readings from a Normal population with p“= 50 
and o@= 3 and recording the sample minimum each time. Suppose 
that the minimum of an actual sample is 40°F. What would you 
conclude about the thermostat manufacturer’s claim? Explain your 
reasoning. 
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38 40 42 44 46 48 50 
Simulated sample minimum temperature (°F) 


Exercises 19-22 refer to the small population of 4 cars listed in the table. 


19. 


20. 


21. 


22% 


Color Age (years) 
Red 1 
White 5 
Silver 8 
Red 20 


pg 449 |») Sample proportions List all 6 possible SRSs of size 


n=2," = 2; calculate the proportion of red cars in the sample, and 
display the sampling distribution of the sample proportion on a 
dotplot. Is the sample proportion an unbiased estimator of the 
population proportion? Explain your answer. 


Sample minimums List all 6 possible SRSs of size n=2,” = 2: 


calculate the minimum age for each sample, and display the sampling 
distribution of the sample minimum on a dotplot. Is the sample 
minimum an unbiased estimator of the population minimum? Explain 
your answer. 


More sample proportions List all 4 possible SRSs of size n=3, 

n = 3, calculate the proportion of red cars in the sample, and display 
the sampling distribution of the sample proportion on a dotplot with 
the same scale as the dotplot in Exercise 19. How does the variability 
of this sampling distribution compare with the variability of the 
sampling distribution from Exercise 19? What does this indicate 
about increasing the sample size? 


More sample minimums List all 4 possible SRSs of size n=3," = 3; 
calculate the minimum age for each sample, and display the sampling 
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23. 


24. 


25. 


distribution of the sample minimum on a dotplot with the same scale 
as the dotplot in Exercise 20. How does the variability of this 
sampling distribution compare with the variability of the sampling 
distribution from Exercise 20? What does this indicate about 
increasing the sample size? 


A sample of teens A study of the health of teenagers plans to 
measure the blood cholesterol levels of an SRS of 13- to 16-year-olds. 
The researchers will report the mean x £ from their sample as an 
estimate of the mean cholesterol level / in this population. Explain to 
someone who knows little about statistics what it means to say that x — 
& is an unbiased estimator of p. 


Predict the election A polling organization plans to ask a random 
sample of likely voters who they plan to vote for in an upcoming 
election. The researchers will report the sample proportion p/P that 
favors the incumbent as an estimate of the population proportion p 
that favors the incumbent. Explain to someone who knows little about 
Statistics what it means to say that pP is an unbiased estimator of p. 


Bias and variability The figure shows approximate sampling 
distributions of 4 different statistics intended to estimate the same 
parameter. 
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Population parameter 
a. Which statistics are unbiased estimators? Justify your answer. 


b. Which statistic does the best job of estimating the parameter? 
Explain your answer. 


Multiple Choice Select the best answer for Exercises 26—30. 


26. Ata particular college, 78% of all students are receiving some kind 
of financial aid. The school newspaper selects a random sample of 
100 students and 72% of the respondents say they are receiving 
some sort of financial aid. Which of the following is true? 


a. 78% is a population and 72% is a sample. 
b. 72% is a population and 78% is a sample. 
c. 78% is a parameter and 72% is a Statistic. 
d. 72% is a parameter and 78% is a statistic. 


e. 72% is a parameter and 100 is a statistic. 


27. A Statistic is an unbiased estimator of a parameter when 


984 


28. 


ou. 


30. 


a. the statistic is calculated from a random sample. 


b. ina single sample, the value of the statistic is equal to the value 
of the parameter. 


c. in many samples, the values of the statistic are very close to the 
value of the parameter. 


d. in many samples, the values of the statistic are centered at the 
value of the parameter. 


e. in many samples, the distribution of the statistic has a shape that 
is approximately Normal. 


In a residential neighborhood, the median value of a house is 
$200,000. For which of the following sample sizes is the sample 
median most likely to be above $250,000? 


a. n=10n = 10 

b. n=50n = 50 

c. n=100n = 100 
d. n=1000n = 1000 


e. Impossible to determine without more information. 


Increasing the sample size of an opinion poll will reduce the 
a. bias of the estimates made from the data collected in the poll. 


b. variability of the estimates made from the data collected in the 
poll. 


c. effect of nonresponse on the poll. 
d. variability of opinions in the sample. 


e. variability of opinions in the population. 


The math department at a small school has 5 teachers. The ages of 
these teachers are 23, 34, 37, 42, and 58. Suppose you select a 
random sample of 4 teachers and calculate the sample minimum 
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age. Which of the following shows the sampling distribution of the 
sample minimum age? 


20 30 40) 50 60 


(d) e ee @ o 


T | T T T 
20 30 40 50 60 


(e) None of these. 


Recycle and Review 


31. 


32. 


Dem bones (2.2) Osteoporosis is a condition in which the bones 
become brittle due to loss of minerals. To diagnose osteoporosis, an 
elaborate apparatus measures bone mineral density (BMD). BMD is 
usually reported in standardized form. The standardization is based on 
a population of healthy young adults. The World Health Organization 
(WHO) criterion for osteoporosis is a BMD score that is 2.5 standard 
deviations below the mean for young adults. BMD measurements in a 
population of people similar in age and gender roughly follow a 
Normal distribution. 


a. What percent of healthy young adults have osteoporosis by the 
WHO criterion? 


b. Women aged 70 to 79 are, of course, not young adults. The mean 


BMD in this age group is about —2 on the standard scale for young 
adults. Suppose that the standard deviation is the same as for 
young adults. What percent of this older population has 
osteoporosis? 


Squirrels and their food supply (3.2) Animal species produce more 
offspring when their supply of food goes up. Some animals appear 
able to anticipate unusual food abundance. Red squirrels eat seeds 
from pinecones, a food source that sometimes has very large crops. 
Researchers collected data on an index of the abundance of pinecones 
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and the average number of offspring per female over 16 years.4 


Computer output from a least-squares regression on these data and a 
residual plot are shown here. 


Predictor Coef SE Coef T P 
Constant 1.4146 0.2517 5.62 0.000 
Cone index 0.4399 0.1016 4.33 0.001 
S = 0.600309 R-Sq = 57.2% R-Sq(adj) = 54.2% 
1.5 
1.0 
mq 05 
co ° 
-0.5 
-1.0 - 
0 i 2 3 4 5 
Cone index 


a. Is a linear model appropriate for these data? Explain. 


b. Give the equation for the least-squares regression line. Define any 
variables you use. 


c. Interpret the values of r2 and s in context. 
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SECTION 7.2 Sample Proportions 


LEARNING TARGETS By the end of the section, you should be able to: 


e Calculate the mean and standard deviation of the sampling 
distribution of a sample proportion p/? and interpret the standard 
deviation. 


e Determine if the sampling distribution of pp is approximately 
Normal. 


e If appropriate, use a Normal distribution to calculate probabilities 
involving p/.P- 


What proportion of U.S. teens know that 1492 was the year in which 
Columbus “discovered” America? A Gallup Poll found that 210 out of a 
random sample of 501 American teens aged 13 to 17 knew this historically 
important date.2 The sample proportion 

210 | 


p=, 


0.42 
p\=210501=0.42 501 


is the statistic that we use to gain information about the unknown 
population proportion p. Because another random sample of 501 teens 
would likely result in a different estimate, we can only say that “about” 
42% of all U.S. teenagers know that Columbus discovered America in 
1492. In this section, we’ll use sampling distributions to clarify what 
“about” means. 
AS FAR AS I KNowW THE WORLD 
_ THERE ISN'T A. THERE ISN'T A So WHY IS HAS MANY 
CLEVELAND DAY’... “CINCINNATI DAY”... THERE A MYSTERIES. 


THERE ISN’T AN 
“AKRON DAY"... 


FOXTROT©2005 Bill Amend. Andrews McMeel Syndicate. 
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The Sampling Distribution of p4? 


When Mrs. Gallas’s class did the “Penny for your thoughts” activity at the 
beginning of the chapter, her students produced the “dotplot” in Figure 7.8, 


which approximates the sampling distribution of the sample proportion 
of pennies from the 2000s for samples of size n=20.n = 20. 


p 
P P 
A A 
P P 
A A A 
PP P 
P PP P 
on a So a” a | 
P PPP PP 
~ a + A 2 & 
P P PPP P 
oe oo a” oe oo 
P PPP PP P 
P P PPP PP P 
P P PP PPP PP P P 


0.3 0.4 0.5 0.6 07 0.8 0.9 1.0 
p = Sample proportion of pennies from 20005 (n = 20) 


FIGURE 7.8 Approximate sampling distribution of the sample proportion of 
pennies from the 2000s in 50 samples of size n=20 


from a population of pennies. 


DEFINITION Sampling distribution of the sample 
proportion 


The sampling distribution of the sample proportion p/? describes 
the distribution of values taken by the sample proportion p/P in all 
possible samples of the same size from the same population. 


This distribution is roughly symmetric with a mean of about 0.65 and a 


989 


standard deviation of about 0.10. By the end of this section, you should be 
able to anticipate the shape, center, and variability of distributions like this 
one without getting your hands dirty in a jar of pennies. 


ACTIVITY The candy machine 


Imagine a very large candy machine filled with orange, brown, and 
yellow candies. When you insert money, the machine dispenses a 
sample of candies. In this activity, you will use an applet to investigate 
the shape, center, and variability of the sampling distribution of the 
sample proportion p/.P: 


Reese’s Pieces® 


Probability of orange |0.5 
Number ofcandies 25 | 
Number ofsamples 1 

¥] Animate 


Draw Samples 
Total = 1 


O Number of orange 

@ Proportion of orange 
Most recent p= 0.360 

As extreme as | 2 | | Count 


¥| Summary Stats 


1. Launch the Reese’s Pieces® applet at 
www.rossmanchance.com/applets. Make sure the Probability of 
orange = 0.5, Number of candies (sample size) = 25, and Number of 
samples = 1. Choose “Proportion of orange” as the statistic to be 
calculated and check the box for Summary stats to be calculated, as 
shown in the screen shot for Step 2. 


2. Click on the “Draw Samples” button. An animated simple random 
sample of n=25n = 25 candies should be dispensed. The following 
screen shot shows the results of one such sample where the sample 
proportion of orange candies was p=0.360.P = 0.360. How far was 
your sample proportion of orange candies from the actual population 
proportion, p = 0.50? 


3. Click “Draw Samples” 9 more times so that you have a total of 10 
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sample proportions. Look at the dotplot of your p/P values. What is 
the mean of your 10 sample proportions? What is their standard 
deviation? 


4. To take many more samples quickly, enter 990 in the “Number of 
samples” box. Click on the “Animate” box to turn the animation off. 
Then click “Draw Samples.” You have now taken a total of 1000 
samples of 25 candies from the machine. Describe the shape, center, 
and variability of the approximate sampling distribution of p\P 
shown in the dotplot. 


5. How does the sampling distribution of p/\? change if the proportion 
of orange candies in the machine is different from p = 0.5? Use the 
applet to investigate this question. Then write a brief summary of 
what you learned. 


6. How does the sampling distribution of pip change if the machine 
dispenses a different number of candies? Use the applet to investigate 
this question. Then write a brief summary of what you learned. 


7. For what combinations of n and p is the sampling distribution of p/P? 
approximately Normal? Use the applet to investigate this question. 
Then write a brief summary of what you learned. 


The graphs in Figure 7.9 show approximate sampling distributions of 
p’? for different combinations of p and n. 
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Mean = 0.100 
SD = 0.030 
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p = sample proportion of orange candies p = sample proportion of orange candies p = sample proportion of orange candies 
(p = 0.10, n = 100) (p = 0.50, n = 100) (p = 0.90, n = 100) 


FIGURE 7.9 Approximate sampling distributions of p/ 
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= sample proportion of orange candies for different combinations of p (population 
proportion) and n (sample size). 


What do these graphs teach us about the sampling distribution of p/? 
p? 
Shape: When n is small and p is close to 0, the sampling distribution of p/ 
P is skewed to the right. When n is small and p is close to 1, the sampling 
distribution of p/P is skewed to the left. Finally, the sampling distribution 
of p\P becomes more Normal when p is closer to 0.5 or nis larger (or 
both). 
Center: The mean of the sampling distribution of p/? is equal to the 
population proportion p. This makes sense because the sample proportion 
pP is an unbiased estimator of p. 
Variability: The value of op)?» depends on both n and p. For a specific 
sample size, the standard deviation op’? is larger for values of p close to 
0.5 and smaller for values of p close to 0 or 1. For a specific value of p, the 
standard deviation op’» gets smaller as n gets larger. Specifically, 
multiplying the sample size by 4 cuts the standard deviation in half. 

Here’s a summary of the important facts about the sampling 
distribution of p’.?- 


SAMPLING DISTRIBUTION OF A SAMPLE 
PROPORTION p/P 


Choose an SRS of size n from a population of size N with proportion p 
of successes. Let p/? be the sample proportion of successes. Then: 
e The mean of the sampling distribution of p/P? is ppA=p."» ~ P. 
¢ The standard deviation of the sampling distribution of p/? is 
| p(l—p 
0. = } 
op\=p(i-pyn? Von 


as long as the 10% condition is satisfied: n<0.10N.n < 0.10N. The 
value op’? measures the typical distance between a sample 
proportion p/P? and the population proportion p. 


e The sampling distribution of pAP is approximately Normal as long 
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as the Large Counts condition is satisfied: np>10"P = 10 and 
n(1-p)>10." (1 — p) 2 10. 


The two conditions mentioned in the preceding box are very important. 


e Large Counts condition: If we assume that the sampling distribution of 
p/P is approximately Normal when it isn’t, any calculations we make 
using a Normal distribution will be flawed. 


e 10% condition: When we’re sampling without replacement from a 
(finite) population, the observations are not independent, because 
knowing the outcome of one trial helps us predict the outcome of future 
trials. But the standard deviation formula assumes that the observations 
are independent. If we sample too large a fraction of the population, our 
calculated value of op? will be too large. 


We call it the “Large Counts” condition because np is the expected count of successes 


in the sample and n (1 — p) is the expected count of failures in the sample. 


Because larger random samples give better information, it sometimes 
makes sense to sample more than 10% of a population. In such a case, 
there’s a more accurate formula for calculating the standard deviation op’. 
»* Tt uses something called a finite population correction (FPC). We’|l 
avoid situations that require the FPC in this text. 


EXAMPLE | Backing the pack (») 
The sampling distribution of p/P? 
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bo1982/Getty Images 


PROBLEM: Suppose that 84% of students at a large high school 
regularly use a backpack to carry their books from class to class. 
Imagine taking an SRS of 100 students and calculating p/P? = the 
proportion of students in the sample who regularly use a backpack. 


a. Identify the mean of the sampling distribution of p/.P- 


b. Calculate and interpret the standard deviation of the sampling 
distribution of p.?- Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p’.P- Justify your 
answer. 


SOLUTION: 


a. ppr= 0.844» ~ aii 


b. Assuming that n = 100 students is less than 10% of students in a 
large high school, 


0.84(1—0.84 
= 4] ———_———— = 0.0367 


O- 
op’ = 0.84(1 — 0.84)100=0.0367 ” 100 


996 
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In SRSs of size 100, the sample proportion of students who regularly 
use a backpack will typically vary by about 0.0367 from the true 
proportion of p = 0.84. 


When <0.10N, < 9-101, 


—— / p(l—p) 
op\=p(i-pn PV ® 


Because 100(0.84)=84210100 (0.84) = 84 > 10 and 100(1-0.84)=16>10, 
100(1 — 0.84) = 16 > 10, the sampling distribution of p/P is 


approximately Normal. 


When np>10"? = 10 and n (1 — p) == 10, the sampling distribution 
of p\P is approximately Normal. 


FOR PRACTICE, TRY EXERCISE 33 


Think About It 


HOW IS THE SAMPLING DISTRIBUTION OF p\P RELATED 
TO THE BINOMIAL COUNT X? From Chapter 6, we know that the 
mean and standard deviation of a binomial random variable X are 


uX=np and oX=np(1-p)/* = Tp and ox = \/np (1 p) 
The sample proportion of successes is closely related to X: 
p/=count of successes in samplesample size=Xn 


. count of successesinsample X 


sample size n 


Because p\=X/n=(1/n)X,P = */" = (1/n) X, we're just multiplying 
the random variable X by a constant (1/n) to get the random variable p‘. 
P- Recall from Chapter 6 that multiplying a random variable by a 
constant multiplies both the mean and the standard deviation by that 
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constant. We have 


1 1 
ae a Sasi (np) = p 


upA=1n pX=1n (np)=p'” “Tn v 
op/=1n oX=1nnp(1-p)=np(1-p)n2=p(1—-p)n 
1 toe... . | p(1—p) 
05 == ox =— VP-P) = f= 


What about shape? Multiplying a random variable by a positive 
constant doesn’t change the shape of the probability distribution. So the 
sampling distribution of p/P will have the same shape as the 
distribution of the binomial random variable X. If you studied the 
optional material in Chapter 6 about the Normal approximation to a 
binomial distribution, then you already know that a Normal distribution 
can be used to approximate the sampling distribution of p\? whenever 
both np and n(1 — p) are at least 10. 


CHECK YOUR UNDERSTANDING 


( 


y Suppose that 75% of young adult Internet users (ages 18 to 29) 

watch online videos. A polling organization contacts an SRS of 1000 

young adult Internet users and calculates the proportion p/? in this 

sample who watch online videos. 

1. Identify the mean of the sampling distribution of p.?- 

2. Calculate and interpret the standard deviation of the sampling 
distribution of p4.?- Check that the 10% condition is met. 

3. Is the sampling distribution of pip approximately Normal? Check 
that the Large Counts condition is met. 


4. If the sample size were 9000 rather than 1000, how would this change 
the sampling distribution of p\2?” 


Using the Normal Approximation for p/? 


998 


Inference about a population proportion p is based on the sampling 
distribution of p’.P: When the sample size is large enough for np and n(1 
— p) to both be at least 10 (the Large Counts condition), the sampling 
distribution of p/P is approximately Normal. In that case, we can use a 
Normal distribution to estimate the probability of obtaining an SRS in 
which p/P lies in a specified interval of values. Here is an example. 


EXAMPLE | Going to college (») 


Normal calculations involving p/P 


lvan Vukovic/Shutterstock 


PROBLEM: A polling organization asks an SRS of 1500 first-year 
college students how far away their home is. Suppose that 35% of all 
first-year students attend college within 50 miles of home. Find the 
probability that the random sample of 1500 students will give a result 
within 2 percentage points of the true value. 


SOLUTION: 
up’ = 0.35/47 = 0.35 
Assuming that 1500 < 10% of all first-year college students, op/ = 


0.35) (0.65 
oO; = pe) = 0.0123 
(0.35)(0.65)1500 = 0.0123 ” 1500 
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Calculate the mean and standard deviation of the sampling distribution of 
p.P- 


upA=p'p ~ P 


pe a f PP) 
op\=p(i-pn? Von 


Because np = 1500(0.35) = 525 = 10 and n(1 — p) = 1500(0.65) = 975 = 
10, the distribution of p/? is approximately Normal. 


Justify that the distribution of p\P is approximately Normal using the 


Large Counts condition. 


t 1 
0.3131 0.3254) 0.337% 035 0.3623 / 0.3746 0.3869 
A 
p= 0.33 p= 0.37 
A 


P= sample proportion within 50 miles 
0.33—0,35 


(i) z=0.33 — 0.350.0123=-1.637 — 00123 ~ —1-°8 ang 7 =0,37- 


, — 0.37-0.35 _ 
0350,0123=1.63" — _oo1s — 1-68 


Using Table A: P(0.33<p/<0.37)=P(-1.63<z<1.63)=0.9484—0.0516=0.8968 
P(0.33 < p < 0.37) = P(—1.63 < z < 1.63) = 0.9484 — 0.0516 = 0.8968 
Using technology: normalcdf (lower:—1.63, upper:1.63, mean:0, SD:1) 
= 0.8969 

(ii) normalcdf (lower:0.33, upper:0.37, mean:0.35, SD:0.0123) = 0.8961 


1. Draw a Normal distribution. 


2. Perform calculations. 

i. Standardize and use Table A or technology or 
ii. Use technology without standardizing. 

Be sure to answer the question that was asked. 
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FOR PRACTICE, TRY EXERCISE 43 


In the preceding example, about 90% of all SRSs of size 1500 from 
this population will give a result within 2 percentage points of the truth 
about the population. This result also suggests that in about 90% of all 
SRSs of size 1500 from this population, the true proportion will be within 
2 percentage points of the sample proportion. This fact will become very 
important in Chapter 8 when we use sample data to create an interval of 
plausible values for a population parameter. 


Section 7.2 Summary 


e When we want information about the population proportion p of 
successes, we often take an SRS and use the sample proportion p/? to 
estimate the unknown parameter p. The sampling distribution of the 
sample proportion pp describes how the statistic pp varies in all 
possible samples of the same size from the population. 

e The mean of the sampling distribution of p/? is pp/=p."» ~ ” So pAP 
is an unbiased estimator of p. 


e The standard deviation of the sampling distribution of p\P is 


op\=p(1-p)/n7? a id for an SRS of size n. This formula can 
be used if the sample size is less than 10% of the population size (the 
10% condition). 


e The sampling distribution of p/? is approximately Normal when both 
np = 10 and n(1 — p) = 10 (the Large Counts condition). 


Section 7.2 Exercises 


33. pg 462 |p) Registered voters In a congressional district, 55% of 


registered voters are Democrats. A polling organization selects a 
random sample of 500 registered voters from this district. Let pA? = 
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34. 


35. 


36. 


the proportion of Democrats in the sample. 


a. Identify the mean of the sampling distribution of pr.P: 


b. Calculate and interpret the standard deviation of the sampling 


distribution of p/.P- Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p/.?- Justify 
your answer. 


Married with children According to a recent U.S. Bureau of Labor 
Statistics report, the proportion of married couples with children in 
which both parents work outside the home is 59%.© You select an 
SRS of 50 married couples with children and let p\? = the sample 
proportion of couples in which both parents work outside the home. 


a. Identify the mean of the sampling distribution of pr.P: 


b. Calculate and interpret the standard deviation of the sampling 
distribution of p4.?: Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p/.?: Justify 
your answer. 


Orange Skittles® The makers of Skittles claim that 20% of Skittles 
candies are orange. Suppose this claim is true. You select a random 
sample of 30 Skittles from a large bag. Let p/P = the proportion of 
orange Skittles in the sample. 


a. Identify the mean of the sampling distribution of p.?- 


b. Calculate and interpret the standard deviation of the sampling 
distribution of p’.?- Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p/.?- Justify 
your answer. 


Male workers A factory employs 3000 unionized workers, 90% of 
whom are male. A random sample of 15 workers is selected for a 
survey about worker satisfaction. Let p/? = the proportion of males in 
the sample. 
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a. Identify the mean of the sampling distribution of p.?- 


b. Calculate and interpret the standard deviation of the sampling 
distribution of p4.?: Verify that the 10% condition is met. 


c. Describe the shape of the sampling distribution of p/.P- Justify 
your answer. 


37. More Skittles® What sample size would be required to reduce the 
standard deviation of the sampling distribution to one-half the value 
you found in Exercise 35(b)? Justify your answer. 


38. More workers What sample size would be required to reduce the 
standard deviation of the sampling distribution to one-third the value 
you found in Exercise 36(b)? Justify your answer. 


39. Airport security The Transportation Security Administration (TSA) 
is responsible for airport safety. On some flights, TSA officers 
randomly select passengers for an extra security check before 
boarding. One such flight had 76 passengers—12 in first class and 64 
in coach class. TSA officers selected an SRS of 10 passengers for 
screening. Let p/P be the proportion of first-class passengers in the 
sample. 


a. Is the 10% condition met in this case? Justify your answer. 


b. Is the Large Counts condition met in this case? Justify your 
answer. 


40. Scrabble® In the game of Scrabble, each player begins by drawing 7 
tiles from a bag containing 100 tiles. There are 42 vowels, 56 
consonants, and 2 blank tiles in the bag. Cait chooses an SRS of 7 
tiles. Let pip be the proportion of vowels in her sample. 


a. Is the 10% condition met in this case? Justify your answer. 


b. Is the Large Counts condition met in this case? Justify your 
answer. 


41. Do you drink the cereal milk? A USA Today poll asked a random 
sample of 1012 U.S. adults what they do with the milk in the bowl 
after they have eaten the cereal. Let p/? be the proportion of people 
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42. 


in the sample who drink the cereal milk. A spokesman for the dairy 
industry claims that 70% of all U.S. adults drink the cereal milk. 
Suppose this claim is true. 


a. What is the mean of the sampling distribution of p\?P? 


b. Find the standard deviation of the sampling distribution of pr.P: 
Verify that the 10% condition is met. 


c. Verify that the sampling distribution of p’? is approximately 
Normal. 


d. Of the poll respondents, 67% said that they drink the cereal milk. 
Find the probability of obtaining a sample of 1012 adults in which 
67% or fewer say they drink the cereal milk, assuming the milk 
industry spokesman’s claim is true. 


e. Does this poll give convincing evidence against the spokesman’s 
claim? Explain your reasoning. 


Do you go to church? The Gallup Poll asked a random sample of 
1785 adults if they attended church during the past week. Let pp be 
the proportion of people in the sample who attended church. A 
newspaper report claims that 40% of all U.S. adults went to church 
last week. Suppose this claim is true. 


a. What is the mean of the sampling distribution of pA??? 


b. Find the standard deviation of the sampling distribution of pr.P: 
Verify that the 10% condition is met. 


c. Verify that the sampling distribution of p’? is approximately 
Normal. 


d. Of the poll respondents, 44% said they did attend church last week. 
Find the probability of obtaining a sample of 1785 adults in which 
44% or more say they attended church last week, assuming the 
newspaper report’s claim is true. 


e. Does this poll give convincing evidence against the newspaper’s 
claim? Explain your reasoning. 
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43. 


44. 


45. 


46. 


pg 464 |») Students on diets Suppose that 70% of college women 


have been on a diet within the past 12 months. A sample survey 
interviews an SRS of 267 college women. What is the probability that 
75% or more of the women in the sample have been on a diet? 


Who owns a Harley? Harley-Davidson motorcycles make up 14% of 
all the motorcycles registered in the United States. You plan to 
interview an SRS of 500 motorcycle owners. How likely is your 
sample to contain 20% or more who own Harleys? 


On-time shipping A mail-order company advertises that it ships 90% 
of its orders within three working days. You select an SRS of 100 of 
the 5000 orders received in the past week for an audit. The audit 
reveals that 86 of these orders were shipped on time. 


a. If the company really ships 90% of its orders on time, what is the 
probability that the proportion in an SRS of 100 orders is 0.86 or 
less? 


b. Based on your answer to part (a), is there convincing evidence that 


less than 90% of all orders from this company are shipped within 
three working days? Explain your reasoning. 


Wait times A hospital claims that 75% of people who come to its 
emergency room are seen by a doctor within 30 minutes of checking 
in. To verify this claim, an auditor inspects the medical records of 55 
randomly selected patients who checked into the emergency room 
during the last year. Only 32 (58.2%) of these patients were seen by a 
doctor within 30 minutes of checking in. 


a. If the wait time is less than 30 minutes for 75% of all patients in 
the emergency room, what is the probability that the proportion of 
patients who wait less than 30 minutes is 0.582 or less in a random 
sample of 55 patients? 


b. Based on your answer to part (a), is there convincing evidence that 


less than 75% of all patients in the emergency room wait less than 
30 minutes? Explain your reasoning. 


Multiple Choice Select the best answer for Exercises 47—50. 
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Exercises 47—49 refer to the following setting. The magazine Sports 
Illustrated asked a random sample of 750 Division I college athletes, 
“Do you believe performance-enhancing drugs are a problem in 
college sports?” Suppose that 30% of all Division I athletes think that 
these drugs are a problem. Let pip be the sample proportion who say 
that these drugs are a problem. 


. Which of the following are the mean and standard deviation of the 
sampling distribution of the sample proportion prop? 


a. Mean = 0.30, SD = 0.017 
b. Mean = 0.30, SD = 0.55 

c. Mean = 0.30, SD = 0.0003 
d. Mean = 225, SD = 12.5 


e. Mean = 225, SD = 157.5 


. Decreasing the sample size from 750 to 375 would multiply the 
standard deviation by 


a. 2. (d)/2.1/v2. 

b. 2v2 (e) none of these. 

c. 1/2. 

. The sampling distribution of p’? is approximately Normal because 
a. there are at least 7500 Division I college athletes. 

b. np = 225 and n(1 — p) = 525 are both at least 10. 

c. arandom sample was chosen. 

d. the athletes’ responses are quantitative. 


e. the sampling distribution of p/P always has this shape. 


. Ina congressional district, 55% of the registered voters are 
Democrats. Which of the following is equivalent to the probability of 
getting less than 50% Democrats in a random sample of size 100°? 
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Plze< 0.50—0.55 
a. P(z<0.50-0.55100) ( 100 ) 


_ - .0.50—0.55 


S 


_ 0.55—0.50 


/ 0.55(0.45) 
P(z<0.50-0.550.55(0.45)100) \ 100 
" 


P(z<0.55-0.500.55(0.45)100) 


ie) 


7 (: < — jae) 
d. P(z<0.50-0.55100(0.55)(0.45)) -/100(0.55)(0-45) 


e. P(z<0.55-0.50100(0.55)(0.45)) \/100(0.55)(0.45) 
Recycle and Review 


51. Sharing music online (5.2, 5.3) A sample survey reports that 29% of 
Internet users download music files online, 21% share music files 


from their computers, and 12% both download and share music.Z 


a. Make a two-way table that displays this information. 


b. What percent of Internet users neither download nor share music 
files? 


c. Given that an Internet user downloads music files online, what is 
the probability that this person also shares music files? 


52. Whole grains (4.2) A series of observational studies revealed that 
people who typically consume 3 servings of whole grain per day have 
about a 20% lower risk of dying from heart disease and about a 15% 
lower risk of dying from stroke or cancer than those who consume no 


whole grains.® 


a. Explain how confounding makes it difficult to establish a cause- 
and-effect relationship between whole grain consumption and risk 
of dying from heart disease, stroke, or cancer, based on these 
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studies. 


b. Explain how researchers could establish a cause-and-effect 
relationship in this context. 
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SECTION 7.3 Sample Means 


LEARNING TARGETS By the end of the section, you should be able to: 


e Calculate the mean and standard deviation of the sampling 
distribution of asample mean X X and interpret the standard 
deviation. 


Explain how the shape of the sampling distribution of xX X is 
affected by the shape of the population distribution and the sample 
size. 


If appropriate, use a Normal distribution to calculate probabilities 
involving X .x. 


Sample proportions arise most often when we are interested in categorical 
variables. We then ask questions like “What proportion of U.S. adults has 
watched Survivor?” or “What percent of the adult population attended 
church last week?” But when we record quantitative variables—household 
income, lifetime of car brake pads, blood pressure—we are interested in 
other statistics, such as the median or mean or standard deviation of the 
variable. The sample mean x = is the most common statistic computed 
from quantitative data. 


The Sampling Distribution of X X 


When Mrs. Gallas’s class did the “Penny for your thoughts” activity at the 
beginning of the chapter, her students produced the “dotplots” in Figure 


7.10, which approximate the sampling distribution of the sample mean 
year of penny for samples of size n = 5 and for samples of size n = 20. 


DEFINITION Sampling distribution of the sample mean 


The sampling distribution of the sample mean x X describes the 
distribution of values taken by the sample mean X * in all possible 
samples of the same size from the same population. 
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x 
= a 
ey 22 2a 
~ « %_ XXXKXKX X 
~o% ~X_ XX _XXKKKKXXXX 
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1990 1995 2000 2005 200 205 


x = Sample mean yedn= 5) 


omen: (Renee aes | ee eens ee, ee 
1990 1995 2000 2005 200 2056 
¥= sample mean yea4n= 20 


FIGURE 7.10 Approximate sampling distribution of the sample mean year of 


pennies in 50 samples of size n = 5 and 50 samples of size n = 20 from a 


population of pennies. 


How do these approximate sampling distributions compare? 


Shape: The distribution of x = is slightly skewed to the left when using 
samples of size n = 5 but roughly symmetric when using samples of size 
n= 20. 

Center: The distribution of x Z is centered at around 2002 for both 


~ 


sample sizes (ux * 2002)/z ~ 2002, 


Variability: The distribution of x Z is about half as variable when using 
samples of size n = 20 (ox ¥2.672 ~ 2-6) than with samples of size n = 
5 (ox 85.2% © 5-2), 

Like the sampling distribution of p4,?> there are some simple rules that 
describe the mean and standard deviation of the sampling distribution of 
x .£. Describing the shape of the sampling distribution of x % is more 


complicated, so we’ ll save that for later. 


SAMPLING DISTRIBUTION OF x =: MEAN AND 


STANDARD DEVIATION 
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Suppose that x £ is the mean of an SRS of size n drawn from a large 
population with mean p and standard deviation o. Then: 


e The mean of the sampling distribution of x % is ux =p.z ~ F: 


e The standard deviation of the sampling distribution of x = is 


o 
oc; =— 
n 


ox =on vn 


as long as the 10% condition is satisfied: n<0.10N.n < 0.10N. The 
value ox %= measures the typical distance between a sample mean 
x © and the population mean pu. 


The behavior of x £ in repeated samples is much like that of the 


sample proportion p‘:P : 


The sample mean x = is an unbiased estimator of the population mean 
Ll. 

The variability of x % depends on both the variability in the population 
oo and the sample size n. Values of x will be more variable for 
populations that have more variability. Values of x £ will be less 
variable for larger samples. Specifically, multiplying the sample size by 
4 cuts the standard deviation in half. 


You should use the formula o/n7/V” for the standard deviation of x = 
only when the sample size is less than 10% of the population size (the 
10% condition). There is a more complicated formula for the standard 
deviation when the sample size is larger than 10% of the population 
size, but it is beyond the scope of this book. 


Notice that these facts about the mean and standard deviation of x = are 
true no matter what shape the population distribution has. 


AP® EXAM TIP 


Notation matters. The symbols p/,x ,n,p,p,0,p/,op’,ux , 


Psy 1s Ps Bs 9s By» Pp» => and ox % all have specific and different 


meanings. Either use notation correctly—or don’t use it at all. You can 
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| expect to lose credit if you use incorrect notation. | 


EXAMPLE | Been to the movies recently? » 


Mean and standard deviation of X 


PatriciaPix/Getty Images 


PROBLEM: The number of movies viewed in the last year by students 
at a large high school has a mean of 19.3 movies with a standard 
deviation of 15.8 movies. Suppose we take an SRS of 100 students from 
this school and calculate the mean number of movies viewed by the 
members of the sample. 

a. Identify the mean of the sampling distribution of x .%. 


b. Calculate and interpret the standard deviation of the sampling 
distribution of x .%. Verify that the 10% condition is met. 


SOLUTION: 


a. ux =19.3Hz = 19-3movies 


b. Assuming that n = 100 is less than 10% of students at the large high 


15.8 


= Oz = . . 
school, ox = 15.8100 = 1.58 v100 movies. 
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———— 


In SRSs of size 100, the sample mean number of movies viewed will 
typically vary by about 1.58 movies from the true mean of 19.3 movies. 


FOR PRACTICE, TRY EXERCISE 53 


Think About It 


WHERE DO THE FORMULAS FOR THE MEAN AND 
STANDARD DEVIATION OF x © COME FROM? Choose an SRS 
of size n from a population, and measure a quantitative variable X on 
each individual in the sample. Call the individual measurements Xj, X5, 
..., Xp. If the population is large relative to the sample, we can think of 
these X;’s as independent random variables, each with mean p and 


standard deviation o. Because 


1 
_ B=— (X,+X2.+...+ Xn) 

x =1n(X1+X2+...+Xn) on 

we can use the rules for random variables from Chapter 6 to find the 
mean and standard deviation of x .®- If we let T = X, + Xp +... + Xp, 


- , es 
then x =1nT.? = af: 
Using the addition rules for means and variances, we get 


w= X1+pX2++uXn=ptut+p=npolT2=0X12+0X22+~ 
+o Xn2=02+02++02=no2 >oT=no2=on 
br = px, + ux, +--+ + hx, =Mte+---+ p= ny 


” oe 2 eto = 2 eet oe = ne 
OT = FF, + OF | oY =o to~ -+ oo” =no 


Because x = is just a constant multiple of the random variable T, 


1 ¥ ) 
a LS TO (Tp) = 
[Lx =1npT=1n(np)=p" ae ee aad 


ox =1noT=1n(on)=onn2=01n=oln=on 


1 1 _ rn o 
Cs =—or =—(e,/n) =<, | — = a4] — = oO E= 
r= 7 (evn) = 9,/—> =oy/- 2 


on \ ’ 
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Sampling from a Normal Population 


We have described the mean and standard deviation of the sampling 
distribution of a sample mean x = but not its shape. That’s because the 
shape of the sampling distribution of x % depends on the shape of the 
population distribution. In one important case, there is a simple 
relationship between the two distributions. The following activity shows 
what we mean. 


ACTIVITY Exploring the sampling distribution of x = for 
a Normal population 


Professor David Lane of Rice University has developed a wonderful 
applet for investigating the sampling distribution of x .- It’s dynamic, 
and it’s fun to play with. In this activity, you’ ll use Professor Lane’s 
applet to explore the shape of the sampling distribution when the 
population is Normally distributed. 


1. Go to http://onlinestatbook.com/stat_sim/sampling dist/ or search for 


“online statbook sampling distributions applet” and go to the website. 
When the BEGIN button appears on the left side of the screen, click 
on it. You will then see a yellow page entitled “Sampling 
Distributions” like the one in the screen shot. 
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mean= 16.00 Parent population 


median= 16.00 Claaciowens 
sd= 5.00 
2 a Normal ¥ 
lh 0.00 [Normal _* | 
kurtosis= 0.00 

2 = Sample: 
Reps= 5 Sample Data 
range= 13.00 


CorFNWAUAD 


0 : | | T 32 


Reps= 6 Distribution of Means, N=5 
mean 16.05 Pear 
median= 16.00 " 
sd= 1.89 C) Fit normal 
skew= 0.02 
kurtosis= —0.43 

0 3 


2. There are choices for the population distribution: Normal, uniform, 
skewed, and custom. Keep the default option: Normal. Click the 
“Animated” button. What happens? Click the button several more 
times. What do the black boxes represent? What is the blue square 
that drops down onto the plot below? 


3. Click on “Clear lower 3” to start clean. Then click on the “100,000” 
button under “Sample:” to simulate taking 100,000 SRSs of sizen=5 
from the population. Answer these questions: 


KFnNwWt ua 


o 
nN 


e Does the simulated sampling distribution of x % (blue bars) have a 
recognizable shape? Click the box next to “Fit normal.” 


e To the left of each distribution is a set of summary Statistics. 
Compare the mean of the simulated sampling distribution with the 
mean of the population. 


e How is the standard deviation of the simulated sampling 
distribution related to the standard deviation of the population? 


4. Click “Clear lower 3.” Use the drop-down menus to set up the bottom 
graph to display the mean for samples of size n = 20. Then sample 
100,000 times. How do the two distributions of x = compare: shape, 
center, and variability? 


5. What have you learned about the shape of the sampling distribution 
of x © when the population has a Normal shape? 
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As the preceding activity demonstrates, if the population distribution is 
Normal, then so is the sampling distribution of x .%. This is true no matter 
what the sample size is. 


SAMPLING DISTRIBUTION OF THE SAMPLE MEAN xX 
X WHEN SAMPLING FROM A NORMAL POPULATION 


Suppose that a population is Normally distributed with mean p and 
standard deviation o. Then the sampling distribution of x 7 has the 
Normal distribution with mean px =pz ~ # and standard deviation 
ox =o/n?z = 7/¥n (provided the 10% condition is met). 


We already knew the mean and standard deviation of the sampling 
distribution. All we have added is the Normal shape. Now we have enough 
information to calculate probabilities involving x £ when the population 
distribution is Normal. 


EXAMPLE | Young women’s heights (>) 


Sampling from a Normal population 


Alamy Images 


PROBLEM: The heights of young women follow a Normal distribution 
with mean p = 64.5 inches and standard deviation o = 2.5 inches. 


a. Find the probability that a randomly selected young woman is taller 
than 66.5 inches. 
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b. Find the probability that the mean height of an SRS of 10 young 
women exceeds 66.5 inches. 


SOLUTION: 


7 | 


4 
SF 53.5 62 64.5 67 69.5 ¥Z 
66.5 


Height (in.) 


1. Draw a Normal distribution. 


66.5—64.5 
: z= ——— = 0.80 
i. Z= 66.5 — 64.52.5 = 0.80 2.5 
Using ng aE 7 P(X > 66.5) - ta 0.7881 
Table PX?88-5)=P(z>0.80)=1-0.7881=0.2119 = ou 
Be 
Using technology: normalcdf (lower:0.80, upper:1000, mean:0, 
SD:1) = 0.2119 


ii. normalcdf (lower:66.5, upper:1000, mean:64.5, SD:2.5) = 0.2119 


2. Perform calculations. 
i. Standardize and use Table A or technology or 


ii. Use technology without standardizing. 
Be sure to answer the question that was asked. 


b. pX =64.54x = 64.5 
Because 10 < 10% of all young women, 


Justify that the distribution of XX is Normal. 
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tT te Oe } 
62.13 62.92 63.71 64.5 65.29 66.08 | 66.87 
X= 66.5 


x= sample mean height (in.) 


2.5 
ee | Cy 


r =—_ 


xe = 2510 =0.79 v10 


Calculate the mean and standard deviation of the sampling 
distribution of X .X . 


ux Mz = a 


o 


Jn 


Because the population of heights is Normal, the distribution of x £ 
is also Normal. 
_ 66.5—64.5 953 
i. Z= 66.5 — 64.50.79 = 2.53 0.79 
Using Table A: P(x >66.5)=P(z>2.53)=1—0.9943=0.0057 
P(% > 66.5) = P(z > 2.53) = 1 — 0.9943 = 0.0057 
Using technology: normalcdf (lower:2.53, upper:1000, mean:0, 
SD:1) = 0.0057 


ii. normalcdf (lower:66.5, upper:1000, mean:64.5, SD:0.79) = 
0.0057 


FOR PRACTICE, TRY EXERCISE 57 


Figure 7.11 compares the population distribution and the sampling 
distribution of x % for the example about young women’s heights. It also 
shows the areas corresponding to the probabilities that we computed. You 
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can see that it is much less likely for the average height of 10 randomly 
selected young women to exceed 66.5 inches than it is for the height of 
one randomly selected young woman to exceed 66.5 inches. 


Sampling 


distribution 
of X 


Population 
distribution 


M=645 in. 66,5 in, 


FIGURE 7.11 The sampling distribution of the mean height x | 


for SRSs of 10 
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young women compared with the population distribution of young women’s 
heights. 


AP® EXAM TIP 


Many students lose credit on probability calculations involving x Z 
because they forget to divide the population standard deviation by n. 
v™- Remember that averages are less variable than individual 


observations! 


The fact that averages of several observations are less variable than 
individual observations is important in many settings. For example, it is 
common practice in science and medicine to repeat a measurement several 
times and report the average of the results. 


CHECK YOUR UNDERSTANDING 


(\ 


’ 


y The length of human pregnancies from conception to birth varies 
according to a distribution that is approximately Normal with mean 266 
days and standard deviation 16 days. 


1. Find the probability that a randomly chosen pregnant woman has a 
pregnancy that lasts for more than 270 days. 


Suppose we choose an SRS of 6 pregnant women. Let x %= the mean 
pregnancy length for the sample. 


2. What is the mean of the sampling distribution of x ?£? 


3. Calculate and interpret the standard deviation of the sampling 
distribution of x .%- Verify that the 10% condition is met. 


4. Find the probability that the mean pregnancy length for the women in 
the sample exceeds 270 days. 
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The Central Limit Theorem 


Most population distributions are not Normal. What is the shape of the 
sampling distribution of x when sampling from a non-Normal 
population? The following activity sheds some light on this question. 


ACTIVITY Exploring the sampling distribution of X X 
for non-Normal populations 


Let’s use the sampling distributions applet from the preceding activity to 
investigate what happens when we start with a non-Normal population 
distribution. 


1. Go to onlinestatbook.com/stat_sim/sampling_dist/ and launch the 
applet. Select “Skewed” population. Set the bottom two graphs to 


display the mean—one for samples of size 2 and the other for samples 
of size 5. Click the “Animated” button a few times to be sure you see 
what’s happening. Then “Clear lower 3” and take 100,000 SRSs. 
Describe what you see. 


mean= 16.00 Parent population 
median= 16.00 Clear lower 3 
sd= 5.00 
Normal * 
» al x66 [Normal | 
kurtosis= 0.00 
0 (SARA DE 32 Sample: 

Reps= 5 Sample Data | Animated | 
range= 13.00 6 

5 

4 

3 10,000 

2 100,000 

1 

0 0 32 
Reps= 6 Distribution of Means, N=5 
mcan= 16.05 6 “ 
median= 16.00 : 
sd= 1.89 ; C Fit normal 
skew= 0.02 > 
kurtosis= —0.43 1 

0 


0 32 


2. Change the sample sizes to n = 10 and n =16 and take 100,000 
samples. What do you notice? 
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. Now change the sample sizes to n = 20 and n = 25 and take 100,000 
more samples. Did this confirm what you saw in Step 2? 


4. Clear the page, and select “Custom” distribution. Click on a point on 
the population graph to insert a bar of that height. Or click on a point 
on the horizontal axis, and drag up to define a bar. Make a 
distribution that looks as strange as you can. (Note: You can shorten a 
bar or get rid of it completely by clicking on the top of the bar and 
dragging down to the axis.) Then repeat Steps 1 to 3 for your custom 
distribution. Cool, huh? 


O1 


. Summarize what you learned about the shape of the sampling 
distribution of x .. 


The screen shots in Figure 7.12 show the approximate sampling 
distributions of x Z for samples of size n = 2 and samples of size n = 25 
from three different populations. 


Parent Population Parent Population Parent Population 


3 Ete. 32 I ETT 39 


Sample Data Sample Data Sample Data 


Distribution of Means, N = 2 Distribution of Means, N = 2 Distribution of Means, N =2 


| UE ULE 3 YD 32 


Distribution of Means, N = 25 Distribution of Means, N = 25 Distribution of Means, N = 25 


| Nay) 32 0 e 32 


FIGURE 7.12 Approximate sampling distributions of x — 


: 
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for different population shapes and sample sizes. 


It is a remarkable fact that as the sample size increases, the sampling 
distribution of x ® changes shape: it looks less like that of the population 
and more like a Normal distribution. When the sample size is large 
enough, the sampling distribution of x £ is very close to Normal. This is 
true no matter what shape the population distribution has, as long as the 
population has a finite standard deviation o. This important fact of 
probability theory is called the central limit theorem (sometimes 
abbreviated as CLT). 


DEFINITION Central limit theorem (CLT) 
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Draw an SRS of size n from any population with mean p and finite 
standard deviation o. The central limit theorem (CLT) says that 
when n is large, the sampling distribution of the sample mean x = is 
approximately Normal. 


How large a sample size n is needed for the sampling distribution of x — 
Z to be close to Normal depends on the population distribution. More 
observations are required if the shape of the population distribution is far 
from Normal. In that case, the sampling distribution of x will also be 
very non-Normal if the sample size is small. 

As Figure 7.12 illustrates, even when the population distribution is 
very non- Normal, the sampling distribution of x % often looks 
approximately Normal with sample sizes as small as n = 25. To be safe, 
we’ll require that n be at least 30 to invoke the CLT. 


SHAPE OF THE SAMPLING DISTRIBUTION OF THE 
SAMPLE MEAN X X 


e If the population distribution is Normal, the sampling distribution of 
x £ will also be Normal, no matter what the sample size n is. 


If the population distribution is not Normal, the sampling distribution 
of x # will be approximately Normal when the sample size is large 
(n > 30 in most cases). If the sample size is small and the population 
distribution is not Normal, the sampling distribution of x # will 
retain some characteristics of the population distribution (e.g., 
skewness). 


EXAMPLE | Free oil changes (>) 
Calculations using the CLT 
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Joe Belanger/Alamy 


PROBLEM: Keith is the manager of an auto-care center. Based on 
service records from the past year, the time (in hours) that a technician 
requires to complete a standard oil change and inspection follows a 
right-skewed distribution with p = 30 minutes and o = 20 minutes. For a 
promotion, Keith randomly selects 40 current customers and offers them 
a free oil change and inspection if they redeem the offer during the next 
month. Keith budgets an average of 35 minutes per customer for a 
technician to complete the work. Will this be enough? 


a. Calculate the probability that the average time it takes to complete 
the work exceeds 35 minutes. 


b. How much average time per customer should Keith budget if he 
wants to be 99% certain that he doesn’t go “over budget”? 


SOLUTION: 


a. px = 30Hz = 30 
Assuming 40 < 10% of all current customers, 


2 ozs = % = 3.16 
ox = 2040 = 3.16 v4 


Calculate the mean and standard deviation of the sampling distribution of 


X 2. 
ce 


Hx =! | me. 


Because the sample size is large (40 = 30), the distribution of x 7 is 
approximately Normal. 


| Justify that the distribution of x © is approximately Normal | 
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20.52 23.68 26.84 30 33.16 | 3632 39.48 
x 


X= sample mean time (min) 


i. 2=35-303.16=1.58° 316 


Using Table A:P(x >35)=P(z>1.58)=1-0.9429=0.0571 
P (= > 35) = P(z > 1.58) = 1 — 0.9429 = 0.0571 


Using 
technology: normalcdf (lower:1.58, upper:1000, mean:0, SD:1)=0.0571 
normalcdf (lower:1.58, upper:1000, mean:0, SD:1) = 0.0571 


ii. normalcdf (lower:35, upper:1000, mean:30, SD:3.16)=0.0568 
normalcdf (lower:35, upper:1000, mean:30, SD:3.16) = 0.0568 


There is only a 5.68% probability that Keith hasn’t budgeted enough 
time to complete the work. 


b. 


Area = 0.99 
/ 


A 
La 


2052 23.68 26.84 30 33.16 36.32 x 39.48 


X= sample mean time (min) 
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1. Draw a Normal distribution. 


i. Using Table A:0.99 area to left = z=2.330.99 area to left + z = 2.33 


Using technology: invnorm(area:0.99, mean:0, SD:1)=2.33 
invnorm(area:0.99, mean:0, SD:1) = 2.33 


2.33 =x -303.16—x =37.4 minutes 


z—30 = . 
235 = =r — 2 = 37.4 minutes 
~~». ) 


ii. invnorm(area:0.99, mean:30, SD:3.16) = 37.4 minutes 


To be 99% sure he has budgeted enough time, Keith should plan for an 
average of 37.4 minutes per customer. 


. Perform calculations. 


. Use Table A or technology to find the value of z with the indicated 
area under the standard Normal curve, then “unstandardize” to 
transform back to the original distribution; or 


ii. Use technology to find the desired value without standardizing. 
Be sure to answer the question that was asked. 


FOR PRACTICE, TRY EXERCISE 65 


What if Keith decided to give away only 10 free oil changes? Because 
the population distribution is skewed to the right and the sample size is 
small (10 < 30), we can’t use a Normal distribution to do probability 
calculations. The sampling distribution of x = is likely to be skewed to the 
right—although not as strongly as the population distribution itself. 


Section 7.3 Summary 


e When we want information about the population mean p for some 
quantitative variable, we often take an SRS and use the sample mean x | 
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© to estimate the unknown parameter p!. The sampling distribution of 
the sample mean x = describes how the statistic x % varies in all 
possible samples of the same size from the population. 


e The mean of the sampling distribution of x % is px =p,4z ~ > sox & 
is an unbiased estimator of . 


e The standard deviation of the sampling distribution of x % is ox =o/n 
7; =2/\/" for an SRS of size n if the population has standard deviation 
o. This formula can be used if the sample size is less than 10% of the 
population size (10% condition). 


e Choose an SRS of size n from a population with mean p and standard 
deviation o. If the population distribution is Normal, then so is the 
sampling distribution of the sample mean x .@. If the population 
distribution is not Normal, the central limit theorem (CLT) states that 
when n is large, the sampling distribution of x Z is approximately 
Normal. 


e In some cases, we can use a Normal distribution to calculate 
probabilities for events involving x .@. 


e If the population distribution is Normal, so is the sampling 
distribution of x .®. 


e Ifn>30, the sampling distribution of x % will be approximately 
Normal in most cases. 


Section 7.3 Exercises 


53. pg 470 |») Songs on an iPod David’s iPod has about 10,000 songs. 


The distribution of the play times for these songs is heavily skewed to 
the right with a mean of 225 seconds and a standard deviation of 60 
seconds. Suppose we choose an SRS of 10 songs from this population 
and calculate the mean play time x of these songs. 


a. Identify the mean of the sampling distribution of x .%. 


b. Calculate and interpret the standard deviation of the sampling 
distribution of x .%- Verify that the 10% condition is met. 
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4. 


55. 


56. 


a7. 


58. 


Making auto parts A grinding machine in an auto parts plant 
prepares axles with a target diameter p = 40.125 millimeters (mm). 
The machine has some variability, so the standard deviation of the 
diameters is o = 0.002 mm. The machine operator inspects a random 
sample of 4 axles each hour for quality control purposes and records 
the sample mean diameter x .%- Assume the machine is working 


properly. 
a. Identify the mean of the sampling distribution of x .%. 


b. Calculate and interpret the standard deviation of the sampling 
distribution of x .%- Verify that the 10% condition is met. 


Songs on an iPod Refer to Exercise 53. How many songs would you 
need to sample if you wanted the standard deviation of the sampling 
distribution of x Z to be 10 seconds? Justify your answer. 


Making auto parts Refer to Exercise 54. How many axles would you 
need to sample if you wanted the standard deviation of the sampling 
distribution of x Z to be 0.0005 mm? Justify your answer. 


pg 472 |») Bottling cola A bottling company uses a filling machine 


to fill plastic bottles with cola. The bottles are supposed to contain 
300 milliliters (ml). In fact, the contents vary according to a Normal 
distribution with mean p = 298 ml and standard deviation o = 3 ml. 


a. What is the probability that a randomly selected bottle contains 
less than 295 ml? 


b. What is the probability that the mean contents of six randomly 
selected bottles is less than 295 ml? 


Cereal A company’s cereal boxes advertise that each box contains 
9.65 ounces of cereal. In fact, the amount of cereal in a randomly 
selected box follows a Normal distribution with mean p = 9.70 
ounces and standard deviation o = 0.03 ounce. 


a. What is the probability that a randomly selected box of the cereal 
contains less than 9.65 ounces of cereal? 


b. Now take an SRS of 5 boxes. What is the probability that the mean 
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59. 


60. 


61. 


amount of cereal in these boxes is less than 9.65 ounces? 


Cholesterol Suppose that the blood cholesterol level of all men aged 
20 to 34 follows the Normal distribution with mean p = 188 
milligrams per deciliter (mg/dl) and standard deviation o = 41 mg/dl. 


a. Choose an SRS of 100 men from this population. Describe the 
sampling distribution of x .®. 


b. Find the probability that x % estimates p within +3 mg/dl. (This is 


the probability that x ® takes a value between 185 and 191 mg/dl.) 


c. Choose an SRS of 1000 men from this population. Now what is the 
probability that x % falls within +3 mg/dl of p? In what sense is 
the larger sample “better’’? 


Finch beaks One dimension of bird beaks is “depth”—the height of 
the beak where it arises from the bird’s head. During a research study 
on one island in the Galapagos archipelago, the beak depth of all 
Medium Ground Finches on the island was found to be Normally 
distributed with mean p= 9.5 millimeters (mm) and standard 


deviation o¢= 1.0 mm.2 


a. Choose an SRS of 5 Medium Ground Finches from this 
population. Describe the sampling distribution of x .%. 


b. Find the probability that x % estimates p within +0.5 mm. (This is 


the probability that x % takes a value between 9 and 10 mm.) 


c. Choose an SRS of 50 Medium Ground Finches from this 
population. Now what is the probability that x % falls within +0.5 
mm of p/? In what sense is the larger sample “better”? 


Dead battery? A car company claims that the lifetime of its batteries 
varies from car to car according to a Normal distribution with mean J 
= 48 months and standard deviation o = 8.2 months. A consumer 
organization installs this type of battery in an SRS of 8 cars and 
calculates x %= 42.2 months. 


a. Find the probability that the sample mean lifetime is 42.2 months 
or less if the company’s claim is true. 
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b. Based on your answer to part (a), is there convincing evidence that 
the company is overstating the average lifetime of its batteries? 


62. Foiled again? The manufacturer of a certain brand of aluminum foil 
claims that the amount of foil on each roll follows a Normal 
distribution with a mean of 250 square feet (ft?) and a standard 
deviation of 2 ft2. To test this claim, a restaurant randomly selects 10 
rolls of this aluminum foil and carefully measures the mean area to be 
x ©= 249.6 ft. 


a. Find the probability that the sample mean area is 249.6 ft? or less if 
the manufacturer’s claim is true. 


b. Based on your answer to part (a), is there convincing evidence that 
the company is overstating the average area of its aluminum foil 
rolls? 


63. Songs on an iPod David’s iPod has about 10,000 songs. The 
distribution of the play times for these songs is heavily skewed to the 
right with a mean of 225 seconds and a standard deviation of 60 
seconds. 


a. Describe the shape of the sampling distribution of x % for SRSs of 
size n = 5 from the population of songs on David’s iPod. Justify 
your answer. 


b. Describe the shape of the sampling distribution of x £ for SRSs of 
size n = 100 from the population of songs on David’s iPod. Justify 
your answer. 


64. High school GPAs The distribution of grade point average for 
students at a large high school is skewed to the left with a mean of 
3.53 and a standard deviation of 1.02. 


a. Describe the shape of the sampling distribution of x Z for SRSs of 
size n = 4 from the population of students at this high school. 
Justify your answer. 


b. Describe the shape of the sampling distribution of x £ for SRSs of 
size n = 50 from the population of students at this high school. 
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65. 


66. 


67. 


Justify your answer. 


pg 477 |) More on insurance An insurance company claims that 


in the entire population of homeowners, the mean annual loss from 
fire is p = $250 and the standard deviation of the loss is o = $5000. 
The distribution of losses is strongly right-skewed: many policies 
have $0 loss, but a few have large losses. The company hopes to sell 
1000 of these policies for $300 each. 


a. Assuming that the company’s claim is true, what is the probability 
that the mean loss from fire is greater than $300 for an SRS of 
1000 homeowners? 


b. If the company wants to be 90% certain that the mean loss from 


fire in an SRS of 1000 homeowners is less than the amount it 
charges for the policy, how much should the company charge? 


Cash grab At a traveling carnival, a popular game is called the “Cash 
Grab.” In this game, participants step into a sealed booth, a powerful 
fan turns on, and dollar bills are dropped from the ceiling. A customer 
has 30 seconds to grab as much cash as possible while the dollar bills 
swirl around. Over time, the operators of the game have determined 
that the mean amount grabbed is $13 with a standard deviation of $9. 
They charge $15 to play the game and expect to have 40 customers at 
their next carnival. 


a. What is the probability that an SRS of 40 customers grab an 
average of $15 or more? 


b. How much should the operators charge if they want to be 95% 


certain that the mean amount grabbed by an SRS of 40 customers 
is less than what they charge to play the game? 


Bad carpet The number of flaws per square yard in a type of carpet 
material varies with mean 1.6 flaws per square yard and standard 
deviation 1.2 flaws per square yard. 


a. Without doing any calculations, explain which event is more 
likely: 


1032 


e randomly selecting a 1 square yard of material and finding 2 or 
more flaws 


e randomly selecting 50 square yards of material and finding an 
average of 2 or more flaws 


b. Explain why you cannot use a Normal distribution to calculate the 
probability of the first event in part (a). 


c. Calculate the probability of the second event in part (a). 


68. How many people in a car? A study of rush-hour traffic in San 
Francisco counts the number of people in each car entering a freeway 
at a suburban interchange. Suppose that this count has mean 1.6 and 
standard deviation 0.75 in the population of all cars that enter at this 
interchange during rush hour. 


a. Without doing any calculations, explain which event is more 
likely: 


e randomly selecting 1 car entering this interchange during rush 
hour and finding 2 or more people in the car 


e randomly selecting 35 cars entering this interchange during rush 
hour and finding an average of 2 or more people in the cars 


b. Explain why you cannot use a Normal distribution to calculate the 
probability of the first event in part (a). 


c. Calculate the probability of the second event in part (a). 


69. What does the CLT say? Asked what the central limit theorem says, 
a student replies, “As you take larger and larger samples from a 
population, the histogram of the sample values looks more and more 
Normal.” Is the student right? Explain your answer. 


70. What does the CLT say? Asked what the central limit theorem says, 
a student replies, “As you take larger and larger samples from a 
population, the variability of the sampling distribution of the sample 
mean decreases.” Is the student right? Explain your answer. 


71. Airline passengers get heavier In response to the increasing weight 
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12: 


of airline passengers, the Federal Aviation Administration (FAA) told 
airlines to assume that passengers average 190 pounds in the summer, 
including clothes and carry-on baggage. But passengers vary, and the 
FAA did not specify a standard deviation. A reasonable standard 
deviation is 35 pounds. A commuter plane carries 30 passengers. Find 
the probability that the total weight of 30 randomly selected 
passengers exceeds 6000 pounds. (Hint: To calculate this probability, 
restate the problem in terms of the mean weight.) 


Lightning strikes The number of lightning strikes on a square 
kilometer of open ground in a year has mean 6 and standard deviation 
2.4. The National Lightning Detection Network (NLDN) uses 
automatic sensors to watch for lightning in 1-square-kilometer plots 
of land. Find the probability that the total number of lightning strikes 
in arandom sample of 50 square-kilometer plots of land is less than 
250. (Hint: To calculate this probability, restate the problem in terms 
of the mean number of strikes.) 


Multiple Choice Select the best answer for Exercises 73—76. 


73. 


74. 


The distribution of scores on the mathematics part of the SAT exam 
in a recent year was approximately Normal with mean 515 and 
standard deviation 114. Imagine choosing many SRSs of 100 students 
who took the exam and averaging their SAT Math scores. Which of 
the following are the mean and standard deviation of the sampling 
distribution of x ?£? 


a. Mean = 515, SD = 114 

b. Mean = 515, SD=114/1008D = 114/100 

c. Mean = 515/100, SD = 114/100 

d. Mean = 515/100, SD=114/1008D = 114/100 


e. Cannot be determined without knowing the 100 scores. 


Why is it important to check the 10% condition before calculating 
probabilities involving x ?x? 
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73% 


76. 


To reduce the variability of the sampling distribution of x % 


. To ensure that the distribution of x Z is approximately Normal 


To ensure that we can generalize the results to a larger population 


. To ensure that x £ will be an unbiased estimator of py 


To ensure that the observations in the sample are close to 
independent 


A machine is designed to fill 16-ounce bottles of shampoo. When the 
machine is working properly, the amount poured into the bottles 
follows a Normal distribution with mean 16.05 ounces and standard 
deviation 0.1 ounce. Assume that the machine is working properly. If 
4 bottles are randomly selected and the number of ounces in each 
bottle is measured, then there is about a 95% probability that the 
sample mean will fall in which of the following intervals? 


a. 
b. 
C. 
d. 


e. 


16.05 to 16.15 ounces 
16.00 to 16.10 ounces 
15.95 to 16.15 ounces 
15.90 to 16.20 ounces 


15.85 to 16.25 ounces 


The number of hours a lightbulb burns before failing varies from bulb 
to bulb. The population distribution of burnout times is strongly 
skewed to the right. The central limit theorem says that 


a. 


as we look at more and more bulbs, their average burnout time gets 
ever closer to the mean p for all bulbs of this type. 


the average burnout time of a large number of bulbs has a sampling 
distribution with the same shape (strongly skewed) as the 
population distribution. 


the average burnout time of a large number of bulbs has a sampling 
distribution with a similar shape but not as extreme (skewed, but 
not as strongly) as the population distribution. 
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d. the average burnout time of a large number of bulbs has a sampling 
distribution that is close to Normal. 


e. the average burnout time of a large number of bulbs has a sampling 
distribution that is exactly Normal. 


Recycle and Review 


Exercises 77 and 78 refer to the following setting. In the language of 
government statistics, you are “in the labor force” if you are available for 
work and either working or actively seeking work. The unemployment rate 
is the proportion of the labor force (not of the entire population) that is 
unemployed. Here are estimates from the Current Population Survey for 
the civilian population aged 25 years and over in a recent year. The table 
entries are counts in thousands of people. 


Highest education Total population Inlabor force Employed 
Didn’t finish high school 27,669 12,470 11,408 
High school but no college 59,860 37,834 35,857 
Less than bachelor’s degree 47,556 34,439 32,977 
College graduate 51,582 40,390 39,293 


77. Unemployment (1.1) Find the unemployment rate for people with 
each level of education. Is there an association between 
unemployment rate and education? Explain your answer. 


78. Unemployment (5.2, 5.3) Suppose that you randomly select one 
person 25 years of age or older. 


a. What is the probability that a randomly chosen person 25 years of 
age or older is in the labor force? 


b. If you know that a randomly chosen person 25 years of age or 
older is a college graduate, what is the probability that he or she is 
in the labor force? 


c. Are the events “in the labor force” and “college graduate” 
independent? Justify your answer. 
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_Mé _. FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


The principal of a large high school is concerned about the number of 
absences for students at his school. To investigate, he prints a list 
showing the number of absences during the last month for each of the 
2500 students at the school. For this population of students, the 
distribution of absences last month is skewed to the right with a mean 
of p=1.1/ = 1-1 and a standard deviation of o=1.4.0 = 1.4. 

Suppose that a random sample of 50 students is selected from the 
list printed by the principal and the sample mean number of absences 
is calculated. 


a. What is the shape of the sampling distribution of the sample 
mean? Explain. 


b. What are the mean and standard deviation of the sampling 
distribution of the sample mean? 


c. What is the probability that the mean number of absences in a 
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random sample of 50 students is less than 1? 


d. Because the population distribution is skewed, the principal is 
considering using the median number of absences last month 
instead of the mean number of absences to summarize the 
distribution. Describe how the principal could use a simulation 
to estimate the standard deviation of the sampling distribution 
of the sample median for samples of size 50. 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 


SCOre. 
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Chapter 7 Review 


Section 7.1: What Is a Sampling Distribution? 


In this section, you learned the “big ideas” of sampling distributions. The 
first big idea is the difference between a statistic and a parameter. A 
parameter is a number that describes some characteristic of a population. A 
Statistic estimates the value of a parameter using a sample from the 
population. Making the distinction between a statistic and a parameter will 
be crucial throughout the rest of the course. 


The second big idea is that statistics vary. For example, the mean 
weight in a sample of high school students is a variable that will change 
from sample to sample. This means that statistics have distributions. The 
distribution of a statistic in all possible samples of the same size is called 
the sampling distribution of the statistic. Knowing the sampling 
distribution of a statistic tells us how far we can expect a Statistic to vary 
from the parameter value and what values of the statistic should be 
considered unusual. 


The third big idea is the distinction between the distribution of the 
population, the distribution of the sample, and the sampling distribution of 
a sample statistic. Reviewing the illustration on page 446 will help you 
understand the difference between these three distributions. When you are 
writing your answers, be sure to indicate which distribution you are 
referring to. Don’t make ambiguous statements like “the distribution will 
become less variable.” 


The final big idea is how to describe a sampling distribution. To 
adequately describe a sampling distribution, you need to address shape, 
center, and variability. If the center (mean) of the sampling distribution is 
the same as the value of the parameter being estimated, then the statistic is 
called an unbiased estimator. An estimator is unbiased if it doesn’t 
consistently underestimate or consistently overestimate the parameter in 
many samples. Ideally, the variability of a sampling distribution will be 
very small, meaning that the statistic provides precise estimates of the 
parameter. Larger sample sizes result in sampling distributions with less 
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variability. 


Section 7.2: Sample Proportions 


In this section, you learned about the shape, center, and variability of the 
sampling distribution of a sample proportion. When the Large Counts 
condition (np = 10 and n(1 — p) = 10) is met, the sampling distribution of 
p’? will be approximately Normal. The mean of the sampling distribution 
of p\P is pp\=p» ~ ”, the population proportion. As a result, the sample 
proportion pp is an unbiased estimator of the population proportion p. 
When the sample size is less than 10% of the population size (the 10% 
condition), the standard deviation of the sampling distribution of the 


sample proportion is op\=p(1-p/n?» ~ VP (1—p)/ The standard 
deviation measures how far the sample proportion p/? typically varies 
from the population proportion p. 


Section 7.3: Sample Means 


In this section, you learned about the shape, center, and variability of the 
sampling distribution of a sample mean. When the population is Normally 
distributed, the sampling distribution of x Z will also be Normal for any 
sample size. When the population distribution is not Normal and the 
sample size is small, the sampling distribution of x % will resemble the 
population shape. However, the central limit theorem says that the 
sampling distribution of x Z will become approximately Normal for larger 
sample sizes (typically when n = 30), no matter what the population shape. 
You can use a Normal distribution to calculate probabilities involving the 
sampling distribution of x % if the population is Normal or the sample size 
is at least 30. 

The mean of the sampling distribution of x @ is px =pz ~ F, the 
population mean. As a result, the sample mean x @ is an unbiased 
estimator of the population mean p. When the sample size is less than 10% 
of the population size (the 10% condition), the standard deviation of the 
sampling distribution of the sample mean is ox =o/n?* ~ 7 /V™. The 
standard deviation measures how far the sample mean x = typically varies 
from the population mean pF. 
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Sampling distribution of 


pip 
Center up=pM» ~ P ux =pHe = # 
Variability pen gf ee 
op~=p(I-pn ? Vn O% a 
when the 10% condition is 
met 
Shape Approximately Normal when 


the Large Counts condition is 
met: np = 10 and n(1 — p) = 


10 


Sampling distribution of x 7 


aa, 
Vn 
when the 10% condition is met 


Normal when the population 
distribution is Normal; 
Approximately Normal if the 
population distribution is not 
Normal but the sample size is 
large (n = 30) 
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Chapter 7 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R7.1 |) Bad eggs Selling eggs that are contaminated with salmonella 


can cause food poisoning in consumers. A large egg producer 
randomly selects 200 eggs from all the eggs shipped in one day. The 
laboratory reports that 9 of these eggs had salmonella contamination. 
Identify the population, the parameter, the sample, and the statistic. 


R7.2 >» Five books An author has written 5 children’s books. The 
numbers of pages in these books are 64, 66, 71, 73, and 76. 


a. List all 10 possible SRSs of size n=3,” = 3; calculate the median 
number of pages for each sample, and display the sampling 
distribution of the sample median on a dotplot. 


b. Describe how the variability of the sampling distribution of the 
sample median would change if the sample size was increased to 
n=4.n = 4. 


c. Construct the sampling distribution of the sample median for 
samples of size n=4.n = 4. Does this sampling distribution 
support your answer to part (b)? Explain your reasoning. 


R7.3 |») Birth weights Researchers in Norway analyzed data on the 


birth weights of 400,000 newborns over a 6-year period. The 
distribution of birth weights is approximately Normal with a mean 


of 3668 grams and a standard deviation of 511 grams.12 


a. Sketch a graph that displays the distribution of birth weights for 
this population. 


b. Sketch a possible graph of the distribution of birth weights for an 
SRS of size 5. Calculate the range for this sample. 

In this population, the range (Maximum — Minimum) of birth weight 

3417 grams. We used technology to take 500 SRSs of size n=5n = 5 
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and calculate the range (Maximum — Minimum) for each sample. 
The dotplot shows the results. 


: iil ™ a 
500 


T 
1000 1500 2000 2500 3000 


Simulated sample range (g) 


0 


c. In the graph provided, there is a dot at approximately 2800. 
Explain what this value represents. 


d. Is the sample range an unbiased estimator of the population 
range? Give evidence from the graph to support your answer. 


R7.4 |) Do you jog? The Gallup Poll asked a random sample of 1540 


adults, “Do you happen to jog?” Suppose that the true proportion of 
all adults who jog is p=0.15.P = 9.15. 


a. What is the mean of the sampling distribution of pp ? 


b. Calculate and interpret the standard deviation of the sampling 
distribution of p4?. Check that the 10% condition is met. 


c. Is the sampling distribution of p/? approximately Normal? 
Justify your answer. 


d. Find the probability that between 13% and 17% of people jog in 
a random sample of 1540 adults. 


R7.5 |) Bag check Thousands of travelers pass through the airport in 


Guadalajara, Mexico, each day. Before leaving the airport, each 
passenger must pass through the customs inspection area. Customs 
agents want to be sure that passengers do not bring illegal items into 
the country. But they do not have time to search every traveler’s 
luggage. Instead, they require each person to press a button. Either a 
red or a green bulb lights up. If the red light flashes, the passenger 
will be searched by customs agents. A green light means “Go 
ahead.” Customs agents claim that 30% of all travelers will be 
stopped (red light), because the light has probability 0.30 of showing 


1045 


red on any push of the button. To test this claim, a concerned citizen 
watches a random sample of 100 travelers push the button. Only 20 
get a red light. 


a. Assume that the customs agents’ claim is true. Find the 
probability that the proportion of travelers who get a red light in 
a random sample of 100 travelers is less than or equal to the 
result in this sample. 


b. Based on your results in part (a), is there convincing evidence 
that less than 30% of all travelers will be stopped? Explain your 
reasoning. 


R7.6 |p) IQ tests The Wechsler Adult Intelligence Scale (WAIS) is a 


common IQ test for adults. The distribution of WAIS scores for 
persons over 16 years of age is approximately Normal with mean 
100 and standard deviation 15. 


a. What is the probability that a randomly chosen individual has a 
WAIS score of 105 or greater? 


b. Find the mean and standard deviation of the sampling 
distribution of the average WAIS score x £ for an SRS of 60 
people. Interpret the standard deviation. 


c. What is the probability that the average WAIS score of an SRS 
of 60 people is 105 or greater? 


d. Would your answers to any of parts (a), (b), or (c) be affected if 
the distribution of WAIS scores in the adult population was 
distinctly non-Normal? Explain your reasoning. 


R7.7 |») Detecting gypsy moths The gypsy moth is a serious threat to 


oak and aspen trees. A state agriculture department places traps 
throughout the state to detect the moths. Each month, an SRS of 50 
traps is inspected, the number of moths in each trap is recorded, and 
the mean number of moths is calculated. Based on years of data, the 
distribution of moth counts is discrete and strongly skewed with a 
mean of 0.5 and a standard deviation of 0.7. 


a. Explain why it is reasonable to use a Normal distribution to 
approximate the sampling distribution of x = for SRSs of size 
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50. 


. Estimate the probability that the mean number of moths in a 
sample of size 50 is greater than or equal to 0.6. 


. In arecent month, the mean number of moths in an SRS of size 
50 was x =0.6.% = 0.6. Based on this result, is there convincing 
evidence that the moth population is getting larger in this state? 
Explain your reasoning. 
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Chapter 7 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T7.1 A study of voting chose 663 registered voters at random shortly 
after an election. Of these, 72% said they had voted in the election. 
Election records show that only 56% of registered voters voted in 
the election. Which of the following statements is true? 


a. 72% is a sample; 56% is a population. 

b. 72% and 56% are both statistics. 

c. 72% is a Statistic and 56% is a parameter. 
d. 72% is a parameter and 56% is a statistic. 
e. 72% and 56% are both parameters. 


T7.2 The Gallup Poll has decided to increase the size of its random 
sample of voters from about 1500 people to about 4000 people 
right before an election. The poll is designed to estimate the 
proportion of voters who favor a new law banning smoking in 
public buildings. The effect of this increase is to 


a. reduce the bias of the estimate. 

b. increase the bias of the estimate. 

c. reduce the variability of the estimate. 

d. increase the variability of the estimate. 

e. reduce the bias and variability of the estimate. 


T7.3 Suppose we select an SRS of size n=100n = 100 from a large 
population having proportion p of successes. Let p/? be the 
proportion of successes in the sample. For which value of p would 
it be safe to use the Normal approximation to the sampling 
distribution of p/P ? 


a. 0.01 
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b. 0.09 
c. 0.85 
d. 0.975 
e. 0.999 


T7.4 The central limit theorem is important in statistics because it 
allows us to use a Normal distribution to find probabilities 
involving the sample mean if the 


a. sample size is reasonably large (for any population). 
b. population is Normally distributed (for any sample size). 


c. population is Normally distributed and the sample size is 
reasonably large. 


d. population is Normally distributed and the population standard 
deviation is known (for any sample size). 


e. population size is reasonably large (whether the population 
distribution is known or not). 


T7.5 The number of undergraduates at Johns Hopkins University is 
approximately 2000, while the number at Ohio State University is 
approximately 60,000. At both schools, a simple random sample of 
about 3% of the undergraduates is taken. Each sample is used to 
estimate the proportion p of all students at that university who own 
an iPod. Suppose that, in fact, p = 0.80 at both schools. Which of 
the following is the best conclusion? 


a. We expect that the estimate from Johns Hopkins will be closer 
to the truth than the estimate from Ohio State because it comes 
from a smaller population. 


b. We expect that the estimate from Johns Hopkins will be closer 
to the truth than the estimate from Ohio State because it is 
based on a smaller sample size. 


c. We expect that the estimate from Ohio State will be closer to 
the truth than the estimate from Johns Hopkins because it 
comes from a larger population. 


d. We expect that the estimate from Ohio State will be closer to 
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the truth than the estimate from Johns Hopkins because it is 
based on a larger sample size. 


e. We expect that the estimate from Johns Hopkins will be about 
the same distance from the truth as the estimate from Ohio 
State because both samples are 3% of their populations. 


T7.6 A researcher initially plans to take an SRS of size 160 from a 
certain population and calculate the sample mean x =. Later, the 
researcher decides to increase the sample size so that the standard 
deviation of the sampling distribution of x # will be half as big as 
when using a sample size of 160. What sample size should the 
researcher use? 

a. 40 

b. 80 

c. 320 

d. 640 

e. There is not enough information to determine the sample size. 

T7.7 The student newspaper at a large university asks an SRS of 250 
undergraduates, “Do you favor eliminating the carnival from the 
term-end celebration?” All in all, 150 of the 250 are in favor. 
Suppose that (unknown to you) 55% of all undergraduates favor 
eliminating the carnival. If you took a very large number of SRSs 
of size n=250n = 250 from this population, the sampling 
distribution of the sample proportion p4? would be 
a. exactly Normal with mean 0.55 and standard deviation 0.03. 


b. approximately Normal with mean 0.55 and standard deviation 
0.03. 


c. exactly Normal with mean 0.60 and standard deviation 0.03. 


d. approximately Normal with mean 0.60 and standard deviation 
0.03. 


e. heavily skewed with mean 0.55 and standard deviation 0.03. 


T7.8 Which of the following statements about the sampling distribution 
of the sample mean is incorrect? 
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a. The standard deviation of the sampling distribution will 
decrease as the sample size increases. 


b. The standard deviation of the sampling distribution measures 
how far the sample mean typically varies from the population 
mean. 


c. The sample mean is an unbiased estimator of the population 
mean. 


d. The sampling distribution shows how the sample mean is 
distributed around the population mean. 


e. The sampling distribution shows how the sample is distributed 
around the sample mean. 


T7.9 A newborn baby has extremely low birth weight (ELBW) if it 
weighs less than 1000 grams. A study of the health of such 
children in later years examined a random sample of 219 children. 
Their mean weight at birth was x = 810 grams. This sample 
mean is an unbiased estimator of the mean weight p in the 
population of all ELBW babies, which means that 


a. in all possible samples of size 219 from this population, the 
mean of the values of x Z will equal 810. 


b. in all possible samples of size 219 from this population, the 
mean of the values of x Z will equal p. 


c. as we take larger and larger samples from this population, x @ 
will get closer and closer to p. 


d. in all possible samples of size 219 from this population, the 
values of x will have a distribution that is close to Normal. 


e. the person measuring the children’s weights does so without 
any error. 


T7.10 Suppose that you are a student aide in the library and agree to be 
paid according to the “random pay” system. Each week, the 
librarian flips a coin. If the coin comes up heads, your pay for the 
week is $80. If it comes up tails, your pay for the week is $40. You 
work for the library for 100 weeks. Suppose we choose an SRS of 
2 weeks and calculate your average earnings x £. The shape of the 
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sampling distribution of x % will be 
a. Normal. 

b. approximately Normal. 

c. right-skewed. 

d. left-skewed. 


e. symmetric but not Normal. 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T7.11 Here are histograms of the values taken by three sample statistics in 
several hundred samples from the same population. The true value 
of the population parameter is marked with an arrow on each 
histogram. 


Which statistic would provide the best estimate of the parameter? 
Justify your answer. 


T7.12 The amount that households pay service providers for access to the 
Internet varies quite a bit, but the mean monthly fee is $50 and the 
standard deviation is $20. The distribution is not Normal: many 
households pay a low rate as part of a bundle with phone or 
television service, but some pay much more for Internet only or for 
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faster connections.! A sample survey asks an SRS of 50 
households with Internet access how much they pay. Let x = be 
the mean amount paid. 


a. Explain why you can’t determine the probability that the 
amount a randomly selected household pays for access to the 
Internet exceeds $55. 


b. What are the mean and standard deviation of the sampling 
distribution of x £? 


c. What is the shape of the sampling distribution of x £? Justify 
your answer. 


d. Find the probability that the average fee paid by the sample of 
households exceeds $55. 


T7.13 According to government data, 22% of American children under 
the age of 6 live in households with incomes less than the official 
poverty level. A study of learning in early childhood chooses an 
SRS of 300 children from one state and finds that p/P = 0.29. 


a. Find the probability that at least 29% of the sample are from 
poverty-level households, assuming that 22% of all children 
under the age of 6 in this state live in poverty-level households. 


b. Based on your answer to part (a), is there convincing evidence 
that the percentage of children under the age of 6 living in 
households with incomes less than the official poverty level in 
this state is greater than the national value of 22%? Explain 
your reasoning. 
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Cumulative AP® Practice Test 2 


Section I: Multiple Choice Choose the best answer for each question. 


AP2.1 The five-number summary for a data set is given by min = 5, Q, = 
18, median = 20, Q3 = 40, max = 75. If you wanted to construct a 


boxplot for the data set that would show outliers, if any existed, 
what would be the maximum possible length of the right-side 
“whisker”? 


a. 33 
b. 35 
c. 45 
d. 53 
e. 55 
AP2.2 The probability distribution for the number of heads in four tosses 


of a coin is given by 


Number of heads 0 1 2 3 4 
Probability 0.0625 0.2500 0.3750 0.2500 # 0.0625 


The probability of getting at least one tail in four tosses of a coin is 
a. 0.2500. 
b. 0.3125. 
c. 0.6875. 
d. 0.9375. 
e. 0.0625. 


AP2.3 In a certain large population of adults, the distribution of IQ scores 
is strongly left-skewed with a mean of 122 and a standard deviation 
of 5. Suppose 200 adults are randomly selected from this 
population for a market research study. For SRSs of size 200, the 
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distribution of sample mean IQ score is 

a. left-skewed with mean 122 and standard deviation 0.35. 

b. exactly Normal with mean 122 and standard deviation 5. 

c. exactly Normal with mean 122 and standard deviation 0.35. 

d. approximately Normal with mean 122 and standard deviation 5. 


e. approximately Normal with mean 122 and standard deviation 
0.35. 


AP2.4 A 10-question multiple-choice exam offers 5 choices for each 
question. Jason just guesses the answers, so he has probability 1/5 
of getting any one answer correct. You want to perform a 
simulation to determine the number of correct answers that Jason 
gets. What would be a proper way to use a table of random digits to 
do this? 


a. One digit from the random digit table simulates one answer, 
with 5 = correct and all other digits = incorrect. Ten digits from 
the table simulate 10 answers. 


b. One digit from the random digit table simulates one answer, 
with 0 or 1 = correct and all other digits = incorrect. Ten digits 
from the table simulate 10 answers. 


c. One digit from the random digit table simulates one answer, 
with odd = correct and even = incorrect. Ten digits from the 
table simulate 10 answers. 


d. One digit from the random digit table simulates one answer, 
with O or 1 = correct and all other digits = incorrect, ignoring 
repeats. Ten digits from the table simulate 10 answers. 


e. Two digits from the random digit table simulate one answer, 
with 00 to 20 = correct and 21 to 99 = incorrect. Ten pairs of 
digits from the table simulate 10 answers. 


AP2.5 Suppose we roll a fair die four times. What is the probability that a 
6 occurs on exactly one of the rolls? 


a. Met C3 (3) 
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b. jax ( 
c. saseeaths) (3) 
d. aged) ( 


e. g(16)1(36)3°C8) (3) 


AP2.6 On one episode of his show, a radio show host encouraged his 
listeners to visit his website and vote in a poll about proposed tax 
increases. Of the 4821 people who vote, 4277 are against the 
proposed increases. To which of the following populations should 
the results of this poll be generalized? 


a. All people who have ever listened to this show 

b. All people who listened to this episode of the show 
c. All people who visited the show host’s website 

d. All people who voted in the poll 

e. All people who voted against the proposed increases 


AP2.7 The number of unbroken charcoal briquets in a 20-pound bag filled 
at the factory follows a Normal distribution with a mean of 450 
briquets and a standard deviation of 20 briquets. The company 
expects that a certain number of the bags will be underfilled, so the 
company will replace for free the 5% of bags that have too few 
briquets. What is the minimum number of unbroken briquets the 
bag would have to contain for the company to avoid having to 
replace the bag for free? 


a. 404 
b. 411 
c. 418 
d. 425 
e. 448 


AP2.8 You work for an advertising agency that is preparing a new 
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television commercial to appeal to women. You have been asked to 
design an experiment to compare the effectiveness of three 
versions of the commercial. Each subject will be shown one of the 
three versions and then asked to reveal her attitude toward the 
product. You think there may be large differences in the responses 
of women who are employed and those who are not. Because of 
these differences, you should use 


a. a block design, but not a matched pairs design. 
b. acompletely randomized design. 

c. a matched pairs design. 

d. asimple random sample. 

e. a stratified random sample. 


AP2.9 Suppose that you have torn a tendon and are facing surgery to 
repair it. The orthopedic surgeon explains the risks to you. 
Infection occurs in 3% of such operations, the repair fails in 14%, 
and both infection and failure occur together 1% of the time. What 
is the probability that the operation is successful for someone who 
has an operation that is free from infection? 


a. 0.8342 
b. 0.8400 
c. 0.8600 
d. 0.8660 
e. 0.9900 


AP2.10 Social scientists are interested in the association between high 
school graduation rate (HSGR, measured as a percent) and the 
percent of U.S. families living in poverty (POV). Data were 
collected from all 50 states and the District of Columbia, and a 
regression analysis was conducted. 

The resulting least-squares regression line is given by 


POVA=59.2-0.620(HSGR)POV = 59.2 — 0.620 (HSGR) with 
r2=0.802r? = 0.802, Based on the information, which of the 
following is the best interpretation for the slope of the least- 
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Squares regression line? 


a. For each 1% increase in the graduation rate, the percent of 
families living in poverty is predicted to decrease by 
approximately 0.896. 


b. For each 1% increase in the graduation rate, the percent of 
families living in poverty is predicted to decrease by 
approximately 0.802. 


c. For each 1% increase in the graduation rate, the percent of 
families living in poverty is predicted to decrease by 
approximately 0.620. 


d. For each 1% increase in the percent of families living in 
poverty, the graduation rate is predicted to decrease by 
approximately 0.802. 


e. For each 1% increase in the percent of families living in 
poverty, the graduation rate is predicted to decrease by 
approximately 0.620. 


Questions AP2.11—AP2.13 refer to the following graph. Here is a dotplot 
of the adult literacy rates in 177 countries in a recent year, according to the 
United Nations. For example, the lowest literacy rate was 23.6%, in the 
African country of Burkina Faso. Mali had the next lowest literacy rate at 
24.0%. 


Literacy rate (%) 


AP2.11 The overall shape of this distribution is 
a. Clearly skewed to the right. 


1058 


C. 
d. 


e. 


clearly skewed to the left. 
roughly symmetric. 
uniform. 


There is no clear shape. 


AP2.12 The mean of this distribution (don’t try to find it) will be 


a. 
b. 
C. 
d. 


e. 


very close to the median. 

greater than the median. 

less than the median. 

You can’t say, because the distribution isn’t symmetric. 


You can’t say, because the distribution isn’t Normal. 


AP2.13 The country with a literacy rate of 49% is closest to which of the 
following percentiles? 


a. 
b. 
C. 
d. 


e. 


6th 

10th 
28th 
49th 


There is not enough information to calculate the percentile. 


AP2.14 The correlation between the age and height of children under the 
age of 12 is found to be r = 0.60. Suppose we use the age x of a 
child to predict the height y of the child. What can we conclude? 


a. The height is generally 60% of a child’s age. 


About 60% of the time, age will accurately predict height. 


Thirty-six percent of the variation in height is accounted for by 
the linear model relating height to age. 


. For every 1 year older a child is, the regression line predicts an 


increase of 0.6 foot in height. 


Thirty-six percent of the time, the least-squares regression line 
accurately predicts height from age. 


AP2.15 An agronomist wants to test three different types of fertilizer (A, 
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B, and C) on the yield of a new variety of wheat. The yield will be 
measured in bushels per acre. Six 1-acre plots of land were 
randomly assigned to each of the three fertilizers. The treatment, 
experimental unit, and response variable are, respectively, 


a. aspecific fertilizer, bushels per acre, a plot of land. 

b. variety of wheat, bushels per acre, a specific fertilizer. 
c. variety of wheat, a plot of land, wheat yield. 

d. aspecific fertilizer, a plot of land, wheat yield. 

e. a specific fertilizer, the agronomist, wheat yield. 

AP2.16 According to the U.S. Census, the proportion of adults in a certain 
county who owned their own home was 0.71. An SRS of 100 
adults in a certain section of the county found that 65 owned their 
home. Which one of the following represents the approximate 
probability of obtaining a sample of 100 adults in which 65 or 
fewer own their home, assuming that this section of the county has 


the same overall proportion of adults who own their home as does 
the entire county? 


100 

. eee 65 
64 

. (10065)(0.29)65(0.71)35 \ 65 


(0.71) (0.29) 


jab) 


) (0.29) (0.71) 


b 
Pi{z< =" 
c. P(z<0.65-0.71(0.71)(0.29)100) {0 
Pia 
d. P(z<0.65-0.71(0.65)(0.35)100) Eo 


P ( 0.65—0.71 
“ — (0.71)(0.29) 
e. P(z<0.65-0.71(0.71)(0.29)100) 106 


AP2.17 Which one of the following would be a correct interpretation if 
you have a z-score of +2.0 on an exam? 
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a. It means that you missed two questions on the exam. 


b. It means that you got twice as many questions correct as the 
average student. 


c. It means that your grade was 2 points higher than the mean 
grade on this exam. 


d. It means that your grade was in the upper 2% of all grades on 
this exam. 


e. It means that your grade is 2 standard deviations above the 
mean for this exam. 


AP2.18 Records from a dairy farm yielded the following information on 
the number of male and female calves born at various times of the 


day. 
Time of day 
Males 
Gender 
Females 
Total 


What is the probability that a randomly selected calf was born in 
the night or was a female? 

369 
a. 369513513 


485 


b. 485513513 
c. 116513513 


d. 116252 252 
116 

e, 116233 233 

AP2.19 When people order books from a popular online source, they are 
shipped in boxes. Suppose that the mean weight of the boxes is 1.5 
pounds with a standard deviation of 0.3 pound, the mean weight of 
the packing material is 0.5 pound with a standard deviation of 0.1 
pound, and the mean weight of the books shipped is 12 pounds 
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with a standard deviation of 3 pounds. Assuming that the weights 
are independent, what is the standard deviation of the total weight 
of the boxes that are shipped from this source? 


a. 1.84 
b. 2.60 
c. 3.02 
d. 3.40 
e. 9.10 


AP2.20 A grocery chain runs a prize game by giving each customer a 
ticket that may win a prize when the box is scratched off. Printed 
on the ticket is a dollar value ($500, $100, $25) or the statement 
“This ticket is not a winner.” Monetary prizes can be redeemed for 
groceries at the store. Here is the probability distribution of the 
amount won on a randomly selected ticket: 


Amount won $500 $100 $25 $0 
Probability 0.01 0.05 0.20 0.74 


Which of the following are the mean and standard deviation, 
respectively, of the winnings? 


a. $15.00, $2900.00 
b. $15.00, $53.85 
c. $15.00, $26.93 
d. $156.25, $53.85 
e. $156.25, $26.93 


AP2.21 A large company is interested in improving the efficiency of its 
customer service and decides to examine the length of the business 
phone calls made to clients by its sales staff. Here is a cumulative 
relative frequency graph from data collected over the past year. 
According to the graph, the shortest 80% of calls will take how 
long to complete? 


a. Less than 10 minutes 
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b. At least 10 minutes 


A 


Exactly 10 minutes 


jar 


. At least 5.5 minutes 


e. Less than 5.5 minutes 


100 5 


Percent 


oe i T T T T T = Ae 
0 2 - 6 8 10 12 14 16 


Length of phone call (min) 


Section II: Free Response Show all your work. Indicate clearly 
the methods you use, because you will be graded on the correctness 
of your methods as well as on the accuracy and completeness of 
your results and explanations. 


AP2.22 A health worker is interested in determining if omega-3 fish oil 
can help reduce cholesterol in adults. She obtains permission to 
examine the health records of 200 people in a large medical clinic 
and classifies them according to whether or not they take omega-3 
fish oil. She also obtains their latest cholesterol readings and finds 
that the mean cholesterol reading for those who are taking omega-3 
fish oil is 18 points less than the mean for the group not taking 
omega-3 fish oil. 


a. Is this an observational study or an experiment? Justify your 
answer. 


b. Explain the concept of confounding in the context of this study 
and give one example of a variable that could be confounded 
with whether or not people take omega-3 fish oil. 


c. Researchers find that the 18-point difference in the mean 
cholesterol readings of the two groups is statistically significant. 
Can they conclude that omega-3 fish oil is the cause? Why or 
why not? 
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AP2.23 The scatterplot shows the relationship between the number of 
yards allowed by teams in the National Football League and the 
number of wins for that team in a recent season, along with the 
least-squares regression line. Computer output is also provided. 


Term Coef SE Coef T-Value P-Value 
Constant 25.66 Shel, 4.78 0.000 
Yards_allowed-0.003131 0.000948 —3.30 0.002 
S = 2.65358 R-Sq = 26.65% R-Sq(adj) = 24.21% 


T T T T T 
4500 5000 $500 6000 6500 7000 


Yards allowed 


a. State the equation of the least-squares regression line. Define 
any variables you use. 


b. Calculate and interpret the residual for the Seattle Seahawks, 
who allowed 4668 yards and won 10 games. 


c. The Carolina Panthers allowed 5167 yards and won 15 games. 
What effect does the point representing the Panthers have on the 
equation of the least-squares regression line? Explain. 


AP2.24 Every 17 years, swarms of cicadas emerge from the ground in the 
eastern United States, live for about six weeks, and then die. (There 
are several different “broods,” so we experience cicada eruptions 
more often than every 17 years.) There are so many cicadas that 
their dead bodies can serve as fertilizer and increase plant growth. 
In a study, a researcher added 10 dead cicadas under 39 randomly 
selected plants in a natural plot of American bellflowers on the 
forest floor, leaving other plants undisturbed. One of the response 
variables measured was the size of seeds produced by the plants. 
Here are the boxplots and summary statistics of seed mass (in 
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milligrams) for 39 cicada plants and 33 undisturbed (control) 


plants: 

n Minimum Q, Median Q3 Maximum 
Cicada plants 390.17 0.22 0.25 0.28 0.35 
Control plants 330.14 0.19 0.25 0.26 0.29 


Cicada 
plants 


Control | 
plants 


T I ! T T 
0.15 0.20 0.25 0.30 0.35 
Seed mass (mg) 


a. Write a few sentences comparing the distributions of seed mass 
for the two groups of plants. 


b. Based on the graphical displays, which distribution likely has 
the larger mean? Justify your answer. 


c. Explain the purpose of the random assignment in this study. 


d. Name one benefit and one drawback of only using American 
bellflowers in the study. 


AP2.25 In a city library, the mean number of pages in a novel is 525 with 
a standard deviation of 200. Approximately 30% of the novels have 
fewer than 400 pages. Suppose that you randomly select 50 novels 
from the library. 


a. What is the probability that the average number of pages in the 
sample is less than 500? 


b. What is the probability that at least 20 of the novels have fewer 
than 400 pages? 


1065 


ing with 


C 


Ima 


Chapter G Est 


Confidence 


Bill Gozansky/Alamy 


Introduction 


1066 


Chapter 8 AP® Statistics Practice Test 


1067 


INTRODUCTION 


How long does a battery last on the newest iPhone, on average? What 
proportion of college undergraduates attended all of their classes last 
week? How much does the weight of a quarter-pound hamburger at a fast- 
food restaurant vary after cooking? These are the types of questions we 
would like to answer. 


anopdesignstock/Getty Images 


It wouldn’t be practical to determine the lifetime of every iPhone 
battery, to ask all undergraduates about their attendance, or to weigh every 
burger after cooking. Instead, we choose a random sample of individuals 
(batteries, undergraduates, burgers) to represent the population and collect 
data from those individuals. From what we learned in Chapter 4, if we 
randomly select the sample, we should be able to generalize our results to 
the population of interest. However, we cannot be certain that our 
conclusions are correct—a different sample would probably yield a 
different estimate. Probability helps us account for the chance variation 
due to random selection or random assignment. 

Chapter 8 begins the formal study of statistical inference—using 
information from a sample to draw conclusions about a population 
parameter such as p or pl. This is an important transition from Chapter 7, 
where you were given information about a population and asked questions 
about the distribution of a sample statistic, such as the sample proportion 
pip or the sample mean x .@. 


The following activity gives you an idea of what lies ahead. 


ACTIVITY The mystery mean 
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In this activity, each team of 3 to 4 students will try to estimate the 
mystery value of the population mean p that your teacher selected 


before class.4 


1. Before class, your teacher stored a value of p (represented by M) in 
the display calculator and then cleared the home screen so you can’t 
see the value of M. 


2. With the class watching, the teacher will execute the following 
command: 


mean(andNorm(M,20,16))™e22(andNorm(M, 20, 16)) 


This tells the calculator to choose an SRS of 16 observations from a 
Normal population with mean M and standard deviation 20 and then 
compute the mean x = of those 16 sample values. Do you believe 
that the sample mean shown is equal to the mystery mean M? 
Explain. 


3. Determine an interval of believable values for the population mean J. 
Use the result from Step 2 and what you learned about sampling 
distributions in the previous chapter. 


4. Share your team’s results with the class. 


In this chapter and the next, we will introduce the two most common 
types of formal statistical inference. Chapter 8 concerns confidence 
intervals for estimating the value of a parameter. Chapter 9 presents 
significance tests, which assess the evidence for a claim about a parameter. 
Both types of inference are based on the sampling distributions you 
studied in Chapter 7. 

In this chapter, we start by presenting the idea of a confidence interval 
in a general way that applies to estimating any unknown parameter. In 
Section 8.2, we show how to estimate a population proportion using a 
confidence interval. Section 8.3 focuses on confidence intervals for a 
population mean. 


1069 


SECTION 8.1 Confidence Intervals: The Basics 


LEARNING TARGETS By the end of the section, you should be able to: 


Identify an appropriate point estimator and calculate the value of a 
point estimate. 


Interpret a confidence interval in context. 


Determine the point estimate and margin of error from a confidence 
interval. 


Use a confidence interval to make a decision about the value of a 
parameter. 


Interpret a confidence level in context. 


Describe how the sample size and confidence level affect the 
margin of error. 


Explain how practical issues like nonresponse, undercoverage, and 
response bias can affect the interpretation of a confidence interval. 


Mr. Schiel’s class did the “Mystery mean” activity from the Introduction. 
The TI screen shot displays the information that the students received 
about the unknown population mean p. Here is a summary of what the 
class said about the calculator output: 


e The population distribution is Normal and its standard deviation is 
o=20.0 = 20. 


e A simple random sample of n=16n = 16 observations was taken from 
this population. 


e The sample mean is x =240.80.% = 240.80. 
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mean(CrandNorm(M, 20,16)) 


If we had to give a single number to estimate the value of M that Mr. 
Schiel chose, what would it be? Because the sample mean x @ is an 
unbiased estimator of the population mean p, we use the statistic x asa 
point estimator of the parameter p. The best guess for the value of p is 
x =240.80f = 240.80. This value is known as a point estimate. 


DEFINITION Point estimator, Point estimate 


A point estimator is a statistic that provides an estimate of a 
population parameter. 


The value of that statistic from a sample is called a point estimate. 


As we saw in Chapter 7, the ideal point estimator will have no bias and 
little variability. Here’s an example involving some of the more common 
point estimators. 


EXAMPLE | From batteries to smoking » 


Point estimators 
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TPopova/Getty Images 


PROBLEM: Identify the point estimator you would use to estimate the 
parameter in each of the following settings and calculate the value of the 
point estimate. 


a. 


Quality control inspectors want to estimate the mean lifetime p of the 
AA batteries produced each hour at a factory. They select a random 
sample of 50 batteries during each hour of production and then drain 
them under conditions that mimic normal use. Here are the lifetimes 
(in hours) of the batteries from one such sample: 


16:73 19.60 1631 1757 16:14 17.28 16,67 17.26 17,27 17-90 
15.59 17.54 16.46 15.63 16.82 17.16 16.62 16.71 16.69 17.98 
15.99 15.64 17.20 17.24 16.68 16.55 17.48 15.58 17.61 15.98 
15.46 16.50 16.19 16.36 17.80 16.61 16.99 16.93 16.01 16.46 
17.54 17.41 16.91 16.60 16.78 15.75 17.31 16.50 16.72 17.55 


. What proportion p of U.S. adults would classify themselves as vegan 


or vegetarian? A Pew Research Center report surveyed 1473 


randomly selected U.S. adults. Of these, 124 said they were vegan or 


vegetarian.2 


. The quality control inspectors in part (a) want to investigate the 


variability in battery lifetimes by estimating the population standard 
deviation o. 


SOLUTION: 


a. Use the sample mean x = as a point estimator for the population 
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mean pl. The point estimate is x _=16.73+15.60+-+17.5550=16.718 
7s 16.73+-15,.60+- --+17,55 — 16.718 


50 hours. 


. Use the sample proportion p’? as a point estimator for the 
population proportion p. The point estimate is p\=1241473=0.084. 


~ — 124 _ : 
p= 74 = 0.084. 


. Use the sample standard deviation s, as a point estimator for the 
population standard deviation o. The point estimate is s, = 0.664 


hour. 


FOR PRACTICE, TRY EXERCISE 1 


The Idea of a Confidence Interval 


When Mr. Schiel’s class did the “Mystery mean” activity, they obtained a 
sample mean of x =240.80.% = 240.80. Then they added and subtracted 
10 to get the interval of plausible values from 230.80 to 250.80. Where did 
the 10 come from? Their reasoning was based on the sampling distribution 
of the sample mean from Chapter 7: 


Because the population distribution is Normal, the sampling distribution 
of x Z is also Normal. 
In about 95% of samples, the value of x will be within 2 standard 


(20. = 2-4 _ 9/5) = 10) 
16 


deviations (20x=22016=2(5)=10) of the 


mystery mean J. 
Therefore, in about 95% of samples, the value of the mystery mean , 
will be within 2 standard deviations (20x =22016=2(5)=10) 

_ — 9.20, _ = 
(20; = 22. = 2(5) = 10) ; 


fx &, 
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(unknown) 


—-— Values of ¥ -———> 


When the estimate of a parameter is reported as an interval of values, it is 
called an interval estimate, or confidence interval. 


DEFINITION Confidence interval 


A confidence interval gives an interval of plausible values for a 
parameter based on sample data. 


Plausible does not mean the same thing as possible. You could argue 
that just about any value of a parameter is possible. Plausible means that 
we shouldn’t be surprised if any one of the values in the interval is equal to 
the value of the parameter. Based on their calculations, the class shouldn’t 
be surprised if Mr. Schiel revealed that the mystery mean was any value 
from 230.80 to 250.80. However, it would be surprising if the mystery 
mean was less than 230.80 or greater than 250.80. 


We use an interval of plausible values rather than a single point 
estimate to increase our confidence that we have a correct value for the 
parameter. Of course, as the cartoon illustrates, there is a trade-off between 
the amount of confidence we have that our estimate is correct and how 
much information the interval provides. 
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Garfield ©1999 Paws, Inc. Distributed by Andrews McMeel Syndication. Reprinted by permission. All rights reserved. 


Confidence intervals are constructed so that we know how much 
confidence we should have in the interval. The most common confidence 
level is 95%. You will learn how to interpret confidence levels shortly. 


DEFINITION Confidence level 


The confidence level C gives the overall success rate of the method 
used to calculate the confidence interval. That is, in C% of all 
possible samples, the interval computed from the sample data will 
capture the true parameter value. 


The Associated Press and the NORC Center for Public Affairs 
Research recently asked a random sample of U.S. adults how much 
financial difficulty they would experience if they had to pay an unexpected 
bill of $1000 right away. Overall, 65% of respondents admitted they would 
have “a little” or “a lot” of difficulty. A summary of the study reported that 
the 95% confidence interval for the proportion of U.S. adults who would 
admit to experiencing some financial difficulty is 0.613 to 0.687. That is, 
they are 95% confident that the interval from 0.613 to 0.687 captures the 
true proportion of all U.S. adults who would admit to experiencing some 
financial difficulty paying an unexpected bill of $1000 right away. 


AP® EXAM TIP 


When interpreting a confidence interval, make sure that you are 
describing the parameter and not the statistic. It’s wrong to say that we 
are 95% confident the interval from 0.613 to 0.687 captures the 
proportion of U.S. adults who admitted they would experience financial 
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difficulty. The “proportion who admitted they would experience 
financial difficulty” is the sample proportion, which is known to be 
0.65. The interval gives plausible values for the proportion who would 
admit to experiencing some financial difficulty if asked. 


INTERPRETING A CONFIDENCE INTERVAL 


To interpret a C% confidence interval for an unknown parameter, say, 
“We are C% confident that the interval from to captures 
the [parameter in context].” 


To create an interval of plausible values for a parameter, we need two 
components: a point estimate to use as the midpoint of the interval and a 
margin of error to account for sampling variability. The structure of a 
confidence interval is 


point estimate + margin of errorPoimt estimate + margin of error 


We can visualize a C% confidence interval like this: 
Point estimate 


Margin of error <—_—» Margin of error 


C% confidence interval for parameter 


Earlier, we learned that the 95% confidence interval for the proportion 
of all U.S. adults who would admit to experiencing some financial 
difficulty paying an unexpected bill of $1000 right away is 0.613 to 0.687. 
This interval could also be expressed as 


0.65+0.0370.65 + 0.037 


Point estimate 


Margin of error << » Margin of error 


0.60 0.65 0.70 


0.65 — 0.037 = 0.613 0.65 + 0.037 = 0.687 
95% confidence interval for p = proportion of all U.S, adults 


who would admit to experiencing some financial difficul ty 


Confidence intervals reported in the media are often presented as a point 
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estimate and a margin of error. 


DEFINITION Margin of error 


The margin of error of an estimate describes how far, at most, we 
expect the estimate to vary from the true population value. That is, in 
a C% confidence interval, the distance between the point estimate and 
the true parameter value will be less than the margin of error in C% of 
all samples. 


In addition to estimating a parameter, we can also use confidence 
intervals to assess claims about a parameter, as in the following example. 


EXAMPLE | Who will win the election? » 


Interpreting a confidence interval 


PROBLEM: Two weeks before a presidential election, a polling 
organization asked a random sample of registered voters the following 
question: “If the presidential election were held today, would you vote 
for Candidate A or Candidate B?” Based on this poll, the 95% 
confidence interval for the population proportion who favor Candidate 
A is (0.48, 0.54). 


Burlingham/Shutterstock.com 


a. Interpret the confidence interval. 


b. What is the point estimate that was used to create the interval? What 
is the margin of error? 
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c. Based on this poll, a political reporter claims that the majority of 
registered voters favor Candidate A. Use the confidence interval to 
evaluate this 


SOLUTION: 


a. We are 95% confident that the interval from 0.48 to 0.54 captures the 
true proportion of all registered voters who favor Candidate A in the 
election. 


= 0.51 


point estimate=0.48+0.542=0.51 point estimate ca 0.54 


margin of error=0.54—0.51=0.03 margin oferror = 0.54 — 0.51 = 0.03 


The point estimate is the midpoint of the interval. The margin of error 


is the distance from the point estimate to the endpoints of the interval. 


c. Because there are plausible values of p less than or equal to 0.50 in 
the confidence interval, the interval does not give convincing 
evidence that a majority (more than 50%) of registered voters favor 
Candidate A. 


Any value from 0.48 to 0.54 is a plausible value for the true 


proportion who favor Candidate A. 


FOR PRACTICE, TRY EXERCISE 5 


Interpreting Confidence Level 


What does it mean to be 95% confident? The following activity gives you 
a chance to explore the meaning of the confidence level. 


prPLe> 


ACTIVITY The Confidence Intervals applet ¢ 
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In this activity, you will use the Confidence Intervals applet to learn 
what it means to say that we are “95% confident” that our confidence 
interval captures the parameter value. 


1. Go to highschool.bfwpub.com/tps6e and launch the applet. Use the 
default settings: confidence level 95% and sample size n = 20. The 
graphs at the top of the display show the distribution of the population 
and the sampling distribution of the sample mean x £ for the sample 
size chosen. Both are centered at the population mean p. 


2. Click “Sample” to choose an SRS and display the resulting 
confidence interval. The 20 values in the sample are marked with 
yellow dots. The confidence interval is displayed as a horizontal line 
segment with a dot representing the sample mean x = in the middle 
of the interval. 


3. Did the interval capture the population mean p (what the applet calls a 
“hit”)? Click “Sample” a total of 10 times. How many of the intervals 
captured the population mean pl? Note: So far, you have used the 
applet to take 10 SRSs, each of size n = 20. Be sure you understand 
the difference between sample size and the number of samples taken. 


4. Reset the applet. Click “Sample 25” twice to choose 50 SRSs and 
display the confidence intervals based on those samples. How many 
intervals captured the parameter p/? Keep clicking “Sample 25” and 
observe the value of “Percent hit.” What do you notice? 
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— Population Distribution “iy 
Confidence Level (C): 95 — - Sampling Distribution : \ 


Sample Size (n): 20 


=a 
SAMPLE 
SAMPLE 25 


Hit! 48 Total: 50 


Percent hit: 96% 
RESET 
—— 


5. Repeat Step 4 using a 90% confidence level. 
6. Repeat Step 4 using an 80% confidence level. 


7. Summarize what you have learned about the relationship between 
confidence level and “Percent hit” after taking many samples. 


We will investigate the effect of changing the sample size later. 


As the activity confirms, the confidence level is the overall capture 
rate if the method is used many times. 


INTERPRETING A CONFIDENCE LEVEL 


To interpret a confidence level C, say, “If we were to select many 


random samples from a population and construct a C% confidence 
interval using each sample, about C% of the intervals would capture the 
[parameter in context].” 
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Let’s revisit the presidential election poll to practice interpreting a 
confidence level. 


EXAMPLE | Another look at the election poll » 


Interpreting a confidence level 


PROBLEM: Two weeks before a presidential election, a polling 
organization asked a random sample of registered voters the following 
question: “If the presidential election were held today, would you vote 
for Candidate A or Candidate B?” Based on this poll, the 95% 
confidence interval for the population proportion who favor Candidate 
A is (0.48, 0.54). Interpret the confidence level. 


SOLUTION: 


If we were to select many random samples of registered voters and 
construct a 95% confidence interval using each sample, about 95% of 
the intervals would capture the true proportion of all registered voters 
who favor Candidate A in the election. 


Remember that interpretations of confidence level are about the method 
used to construct the interval—not one particular interval. In fact, we can 


interpret confidence levels before data are collected! 


FOR PRACTICE, TRY EXERCISE 11 


In the preceding example, there are only two possibilities: 


. The interval from 0.48 to 0.54 captures the population proportion p. Our 
random sample was one of the many samples for which the difference 
between p and p/? is less than the margin of error. When using a 95% 
confidence level, about 95% of samples result in a confidence interval 
that captures p. 


’, The interval from 0.48 to 0.54 does not capture the population 
proportion p. Our random sample was one of the few samples for which 
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the difference between p and p/P is greater than the margin of error. 
When using a 95% confidence level, only about 5% of all samples result 
in a confidence interval that fails to capture p. 


Without conducting a census, we cannot know whether our sample is 
one of the 95% for which the interval captures p or whether it is one of the 
unlucky 5% that does not. The statement that we are “95% confident” is 
shorthand for saying, “We got these numbers using a method that gives 
correct results for 95% of samples.” 


AP® EXAM TIP 


On a given problem, you may be asked to interpret the confidence 
interval, the confidence level, or both. Be sure you understand the 
difference: the confidence interval gives a set of plausible values for the 
parameter and the confidence level describes the overall capture rate of 
the method. 


oOo The confidence level does not tell us the probability that a 
particular confidence interval captures the population parameter. 
Once a particular confidence interval is calculated, its endpoints are fixed. 
And because the value of a parameter is also a constant, a particular 
confidence interval either includes the parameter (probability = 1) or 
doesn’t include the parameter (probability = 0). As Figure 8.1 (on the next 
page) illustrates, no individual 95% confidence interval has a 95% 
probability of capturing the true parameter value. 
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— Population Distribution ; 
Confidence Level (C): 95 ~- - Sampling Distribution ak 


Sample Size (n): 20 
= 


SAMPLE 
Each of the black 


intervals captures 
ut with probability = 1 


SAMPLE 25 


Hit: 48 Total: 50 
Percent hit: 96% 


RESET 


FIGURE 8.1 Image from the Confidence Intervals applet showing that the 
probability a particular 95% confidence interval captures the true parameter value 
is either 0 or 1 (and not 0.95). 


CHECK YOUR UNDERSTANDING 


The Pew Research Center and Smithsonian magazine recently 
quizzed a random sample of 1006 U.S. adults on their knowledge of 
science.? One of the questions asked, “Which gas makes up most of the 
Earth’s atmosphere: hydrogen, nitrogen, carbon dioxide, or oxygen?” A 
95% confidence interval for the proportion who would correctly answer 
nitrogen is 0.175 to 0.225. 


1. Interpret the confidence interval. 


2. Interpret the confidence level. 
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3. Calculate the point estimate and the margin of error. 


4. If people guess one of the four choices at random, about 25% should 
get the answer correct. Does this interval provide convincing 
evidence that less than 25% of all U.S. adults would answer this 
question correctly? Explain your reasoning. 


What Affects the Margin of Error? 


Why settle for 95% confidence when estimating an unknown parameter? 
Do larger random samples yield “better” intervals? The Confidence 
Intervals applet will shed some light on these questions. 


ACTIVITY Exploring margin of error with the Confidence 


prPle> 


Intervals applet < 


In this activity, you will use the applet to explore the relationship 
between the confidence level, the sample size, and the margin of error. 


Part 1: Adjusting the Confidence Level 


1. Go to highschool.bfwpub.com/tps6e and launch the Confidence 
Intervals applet. Use the default settings: confidence level 95% and 
sample size n = 20. Click “Sample 25” to select 25 SRSs and make 25 
confidence intervals. 
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— Population Distribution “iy 
Confidence Level (C): 95 - - Sampling Distribution : \ 


Sample Size (n): 20 


= 
SAMPLE 


SAMPLE 25 


Hit! 23 Total: 25 
Percent hit: 92% 


RESET e 


. Change the confidence level to 99%. What happens to the length of 


the confidence intervals? 


. Now change the confidence level to 90%. What happens to the length 


of the confidence intervals? 


. Finally, change the confidence level to 80%. What happens to the 


length of the confidence intervals? 


. Summarize what you learned about the relationship between the 


confidence level and the margin of error for a fixed sample size. 


Part 2: Adjusting the Sample Size 


6. 


Reset the applet to the default settings: confidence level 95% and 
sample size n = 20. Click “Sample 25” to select 25 SRSs and make 25 
confidence intervals. 


. Using the slider, increase the sample size to n = 100. What do you 


notice about the length of the confidence intervals? 


. Using the slider, increase the sample size to n = 200. What do you 
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notice about the length of the confidence intervals? 


9. Summarize what you learned about the relationship between the 
sample size and the margin of error for a fixed confidence level. 


0. Does increasing the sample size increase the capture rate (percent 
hit)? Use the applet to investigate. 


As the activity illustrates, the price we pay for greater confidence is a 
wider interval. If we’re satisfied with 80% confidence, then our interval of 
plausible values for the parameter will be much narrower than if we insist 
on 90%, 95%, or 99% confidence. For example, here is an 80% confidence 
interval and a 95% confidence interval for Mr. Schiel’s mystery mean. 
Unfortunately, intervals constructed at an 80% confidence level will 
capture the true value of the parameter much less often than intervals that 
use a 95% confidence level. 


80% confidence 


e 95% confidence 


T T T T T T T T T T T T T 
230 232 234 236 238 240 242 244 246 248 250 252 254 


The activity also shows that we can get a more precise estimate of a 
parameter by increasing the sample size. Larger samples generally yield 
narrower confidence intervals at any confidence level. However, larger 
samples don’t affect the capture rate and cost more time and money to 
obtain. 


DECREASING THE MARGIN OF ERROR 


In general, we prefer an estimate with a small margin of error. The 
margin of error gets smaller when: 


e The confidence level decreases. To obtain a smaller margin of error 
from the same data, you must be willing to accept less confidence. 


e The sample size n increases. In general, increasing the sample size n 
reduces the margin of error for any fixed confidence level. 


To see why these facts are true, let’s look a bit more closely at the 
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method Mr. Schiel’s class used to calculate a confidence interval for his 
mystery mean. They started with a point estimate of x =240.80. 

& = 240.80. Then they added and subtracted 2 standard deviations to get 
the interval of plausible values from 230.80 to 250.80. 


We could rewrite this interval as 


point estimatetmargin of errorx +20x 240.80+22016 
point estimate + margin of error 
me Se: aoe 
20 


— 


if 1 6 


240.80 + 2 


This leads to the more general formula for a confidence interval: 


Statistic + (critical value) - (standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 

The critical value depends on both the confidence level C and the 
sampling distribution of the statistic. Mr. Schiel’s class used a critical 
value of 2 to be 95% confident. If they wanted to be 99.7% confident, they 
could have gone 3 standard deviations in each direction. Greater 
confidence requires a larger critical value. 


DEFINITION Critical value 


The critical value is a multiplier that makes the interval wide enough 
to have the stated capture rate. 


The margin of error also depends on the standard deviation of the 
statistic. As you learned in Chapter 7, the sampling distribution of a 
Statistic will have a smaller standard deviation when the sample size is 
larger. This is why the margin of error decreases as you increase the 
sample size. 


WHAT THE MARGIN OF ERROR DOESN’T ACCOUNT FOR 
When we calculate a confidence interval, we include the margin of error 
because we expect the value of the point estimate to vary somewhat from 
the parameter. However, the margin of error accounts for only the 
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variability we expect from random sampling. It does not account for 
practical difficulties, such as undercoverage and nonresponse in a sample 
survey. These problems can produce estimates that are much farther from 
the parameter than the margin of error would suggest. Remember this 
unpleasant fact when reading the results of an opinion poll or other sample 
survey. rl The margin of error does not account for any sources of 


bias in the data collection process. 


EXAMPLE | What’s your GPA? (») 


Factors that affect the margin of error 


OJO Images Ltd/Alamy 


PROBLEM: As part of a project about response bias, Ellery surveyed a 
random sample of 25 students from her school. One of the questions in 
the survey required students to state their GPA aloud. Based on the 
responses, Ellery said she was 90% confident that the interval from 3.14 
to 3.52 captures the mean GPA for all students at her school.4 


a. Explain what would happen to the length of the interval if the 
confidence level were increased to 99%. 


b. How would a 90% confidence interval based on a sample of size 200 
compare to the original 90% interval? 


c. Describe one potential source of bias in Ellery’s study that is not 
accounted for by the margin of error. 


SOLUTION: 
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a. The confidence interval would be wider because increasing the 
confidence level increases the margin of error. 


To increase the confidence level (capture rate), we need to use a 


larger critical value, which increases the margin of error. 


b. The confidence interval would be narrower because increasing the 
sample size decreases the margin of error. 


c. The margin of error doesn’t account for the fact that many students 
might lie about their GPAs when having to respond without 
anonymity. The mean GPA for all students might be even less than 
3.14! 


Increasing the sample size decreases the standard deviation of the 


sampling distribution of the sample mean (assuming the standard 


deviation of the sample doesn’t change). 


FOR PRACTICE, TRY EXERCISE 19 


Section 8.1 Summary 


e To estimate an unknown population parameter, start with a statistic that 
will provide a reasonable guess. The chosen statistic is a point 
estimator for the parameter. The specific value of the point estimator 
that we use gives a point estimate for the parameter. 


e A confidence interval gives an interval of plausible values for an 
unknown population parameter. The interval is computed from the data 
and has the form 


When calculating a confidence interval, it is common to use the form 


Statistic + (critical value) : (standard deviation of statistic) 
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statistic + (critical value) - (standard deviation of statistic) 


e To interpret a C% confidence interval, say, “We are C% confident that 
the interval from to captures the [parameter in context].” Be 
sure that your interpretation describes a parameter and not a statistic. 


e The confidence level C is the success rate (capture rate) of the method 
that produces the interval. If you use 95% confidence intervals often, 
about 95% of your intervals will capture the true parameter value. You 
don’t know whether a particular 95% confidence interval calculated 
from a set of data actually captures the true parameter value. 


e Other things being equal, the margin of error of a confidence interval 
gets smaller as: 


» the confidence level C decreases; 
. the sample size n increases. 


e Remember that the margin of error for a confidence interval only 
accounts for chance variation, not other sources of error like 
nonresponse and undercoverage. 


Section 8.1 Exercises 


In Exercises 1-4, identify the point estimator you would use to estimate the 
parameter and calculate the value of the point estimate. 


1. pg 495 |») Got shoes? How many pairs of shoes, on average, do 
Ne] y 


female teens have? To find out, an AP® Statistics class selected an 
SRS of 20 female students from their school. Then they recorded the 
number of pairs of shoes that each student reported having. Here are 
the data: 


50 26 26 31 97 19 24 22 23 38 
13 50 13 34 23 30 49 13 15 o1 


2. Got shoes? The class in Exercise 1 wants to estimate the variability 
in the number of pairs of shoes that female students have by 
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estimating the population standard deviation o. 


Going to the prom Tonya wants to estimate the proportion of seniors 
in her school who plan to attend the prom. She interviews an SRS of 
50 of the 750 seniors in her school and finds that 36 plan to go to the 
prom. 


Reporting cheating What proportion of students are willing to report 
cheating by other students? A student project put this question to an 
SRS of 172 undergraduates at a large university: “You witness two 
students cheating on a quiz. Do you go to the professor?” Only 19 


answered “Yes.”2 


pg 499 |») Prayer in school A New York Times/CBS News Poll 


asked a random sample of U.S. adults the question “Do you favor an 
amendment to the Constitution that would permit organized prayer in 
public schools?” Based on this poll, the 95% confidence interval for 
the population proportion who favor such an amendment is (0.63, 
0.69). 


a. Interpret the confidence interval. 


b. What is the point estimate that was used to create the interval? 
What is the margin of error? 


c. Based on this poll, a reporter claims that more than two-thirds of 
U.S. adults favor such an amendment. Use the confidence interval 
to evaluate this claim. 


Losing weight A Gallup poll asked a random sample of U.S. adults, 
“Would you like to lose weight?” Based on this poll, the 95% 
confidence interval for the population proportion who want to lose 
weight is (0.56, 0.62).® 


a. Interpret the confidence interval. 


b. What is the point estimate that was used to create the interval? 
What is the margin of error? 


c. Based on this poll, Gallup claims that more than half of U.S. adults 
want to lose weight. Use the confidence interval to evaluate this 
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Za 


10. 


claim. 


Bottling cola A particular type of diet cola advertises that each can 
contains 12 ounces of the beverage. Each hour, a supervisor selects 
10 cans at random, measures their contents, and computes a 95% 
confidence interval for the true mean volume. For one particular hour, 
the 95% confidence interval is 11.97 ounces to 12.05 ounces. 


a. Does the confidence interval provide convincing evidence that the 
true mean volume is different than 12 ounces? Explain your 
answer. 


b. Does the confidence interval provide convincing evidence that the 


true mean volume is 12 ounces? Explain your answer. 


Fun size candy A candy bar manufacturer sells a “fun size” version 
that is advertised to weigh 17 grams. A hungry teacher selected a 
random sample of 44 fun size bars and found a 95% confidence 
interval for the true mean weight to be 16.945 grams to 17.395 grams. 


a. Does the confidence interval provide convincing evidence that the 
true mean weight is different than 17 grams? Explain your answer. 


b. Does the confidence interval provide convincing evidence that the 


true mean weight is 17 grams? Explain your answer. 


Shoes The AP® Statistics class in Exercise 1 also asked an SRS of 20 
boys at their school how many pairs of shoes they have. A 95% 
confidence interval for ».G—pB=the true difference 

Hic — Hp = the true difference in the mean number of pairs of shoes 
for girls and boys is 10.9 to 26.5. 


a. Interpret the confidence interval. 


b. Does the confidence interval give convincing evidence of a 
difference in the true mean number of pairs of shoes for boys and 
girls at the school? Explain your answer. 


Lying online Many teens have posted profiles on sites such as 
Facebook. A sample survey asked random samples of teens with 
online profiles if they included false information in their profiles. Of 
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11. 


12, 
13. 


14. 


15. 


170 younger teens (ages 12 to 14) polled, 117 said “Yes.” Of 317 
older teens (ages 15 to 17) polled, 152 said “Yes.”2 A 95% 
confidence interval for pY—pO=the true difference 

py — po = the true difference jn the proportions of younger teens and 
older teens who include false information in their profile is 0.120 to 
0.297. 


a. Interpret the confidence interval. 


b. Does the confidence interval give convincing evidence of a 
difference in the true proportions of younger and older teens who 
include false information in their profiles? Explain your answer. 

pg 501 |») More prayer in school Refer to Exercise 5. Interpret 


the confidence level. 
More weight loss Refer to Exercise 6. Interpret the confidence level. 


Household income The 2015 American Community Survey 
estimated the median household income for each state. According to 
ACS, the 90% confidence interval for the 2015 median household 
income in Arizona is $51,492 + $431. 


a. Interpret the confidence interval. 


b. Interpret the confidence level. 


More income The 2015 American Community Survey estimated the 
median household income for each state. According to ACS, the 90% 
confidence interval for the 2015 median household income in New 
Jersey is $72,222 + $610. 


a. Interpret the confidence interval. 


b. Interpret the confidence level. 


How confident? The figure shows the result of taking 25 SRSs from 
a Normal population and constructing a confidence interval for the 
population mean using each sample. Which confidence level—80%, 
90%, 95%, or 99%—do you think was used? Explain your reasoning. 
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16. How confident? The figure shows the result of taking 25 SRSs from 
a Normal population and constructing a confidence interval for the 
population mean using each sample. Which confidence level—80%, 
90%, 95%, or 99%—do you think was used? Explain your reasoning. 


17. Explaining confidence A 95% confidence interval for the mean body 
mass index (BMI) of young American women is 26.8 + 0.6. Discuss 
whether each of the following explanations is correct, based on that 
information. 


a. Weare confident that 95% of all young women have BMI between 
26.2 and 27.4. 


b. We are 95% confident that future samples of young women will 
have mean BMI between 26.2 and 27.4. 


c. Any value from 26.2 to 27.4 is believable as the true mean BMI of 
young American women. 
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d. If we take many samples, the population mean BMI will be 


e. 


between 26.2 and 27.4 in about 95% of those samples. 


The mean BMI of young American women cannot be 28. 


18. Explaining confidence The admissions director for a university 
found that (107.8, 116.2) is a 95% confidence interval for the mean 
IQ score of all freshmen. Discuss whether each of the following 
explanations is correct, based on that information. 


a. 


There is a 95% probability that the interval from 107.8 to 116.2 
contains pl. 


There is a 95% chance that the interval (107.8, 116.2) contains x — 


This interval was constructed using a method that produces 
intervals that capture the true mean in 95% of all possible samples. 


If we take many samples, about 95% of them will contain the 
interval (107.8, 116.2). 


The probability that the interval (107.8, 116.2) captures p is either 
0 or 1, but we don’t know which. 


19. pg 505 |») Prayer in school Refer to Exercise 5. 


a. 


Explain what would happen to the length of the interval if the 
confidence level were increased to 99%. 


. How would a 95% confidence interval based on double the sample 


size compare to the original 95% interval? 


The news article goes on to say: “The theoretical errors do not take 
into account additional errors resulting from the various practical 
difficulties in taking any survey of public opinion.” List some of 
the “practical difficulties” that may cause errors which are not 
included in the +3 percentage point margin of error. 


20. Losing weight Refer to Exercise 6. 


a. 


Explain what would happen to the length of the interval if the 
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confidence level was decreased to 90%. 


b. How would a 95% confidence interval based on triple the sample 
size compare to the original 95% interval? 


c. As Gallup indicates, the 3 percentage point margin of error for this 
poll includes only sampling variability (what they call “sampling 
error’). What other potential sources of error (Gallup calls these 
“nonsampling errors”) could affect the accuracy of the 95% 
confidence interval? 


21. California’s traffic People love living in California for many 
reasons, but traffic isn’t one of them. Based on a random sample of 
572 employed California adults, a 90% confidence interval for the 
average travel time to work for all employed California adults is 23 


minutes to 26 minutes.2 


a. Interpret the confidence level. 


b. Name two things you could do to reduce the margin of error. What 
drawbacks do these actions have? 


c. Describe how nonresponse might lead to bias in this survey. Does 
the stated margin of error account for this possible bias? 


22. Employment in California Each month the government releases 
unemployment statistics. The stated unemployment rate doesn’t 
include people who choose not to be employed, such as retirees. 
Based on a random sample of 1000 California adults, a 99% 
confidence interval for the proportion of all California adults 
employed in the workforce is 0.532 to 0.612.2 


a. Interpret the confidence level. 


b. Name two things you could do to reduce the margin of error. What 
drawbacks do these actions have? 


c. Describe how untruthful answers might lead to bias in this survey. 
Does the stated margin of error account for this possible bias? 


Multiple Choice Select the best answer for Exercises 23-26. 
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Exercises 23 and 24 refer to the following setting. A researcher plans to 
use a random sample of houses to estimate the mean size (in square feet) 
of the houses in a large population. 


23. The researcher is deciding between a 95% confidence level and a 
99% confidence level. Compared with a 95% confidence interval, a 
99% confidence interval will be 


a. narrower and would involve a larger risk of being incorrect. 
b. wider and would involve a smaller risk of being incorrect. 

c. narrower and would involve a smaller risk of being incorrect. 
d. wider and would involve a larger risk of being incorrect. 


e. wider and would have the same risk of being incorrect. 


24. After deciding on a 95% confidence level, the researcher is deciding 
between a sample of size n = 500 and a sample of size n = 1000. 
Compared with using a sample size of n = 500, a confidence interval 
based on a sample size of n = 1000 will be 


a. narrower and would involve a larger risk of being incorrect. 
b. wider and would involve a smaller risk of being incorrect. 

c. narrower and would involve a smaller risk of being incorrect. 
d. wider and would involve a larger risk of being incorrect. 


e. narrower and would have the same risk of being incorrect. 


25. Ina poll conducted by phone, 
I. Some people refused to answer questions. 
II. People without telephones could not be in the sample. 


III. Some people never answered the phone in several calls. 


Which of these possible sources of bias is included in the +2% margin 
of error announced for the poll? 


a. I only 
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b. Il only 
c. III only 
d. I, Il, and III 


e. None of these 


26. You have measured the systolic blood pressure of an SRS of 25 
company employees. A 95% confidence interval for the mean systolic 
blood pressure for the employees of this company is (122, 138). 
Which of the following statements is true? 


a. 95% of the sample of employees have a systolic blood pressure 
between 122 and 138. 


b. 95% of the population of employees have a systolic blood pressure 
between 122 and 138. 


c. If the procedure were repeated many times, 95% of the resulting 
confidence intervals would contain the population mean systolic 
blood pressure. 


d. If the procedure were repeated many times, 95% of the time the 
population mean systolic blood pressure would be between 122 
and 138. 


e. If the procedure were repeated many times, 95% of the time the 
sample mean systolic blood pressure would be between 122 and 
138. 


Recycle and Review 


27. Power lines and cancer (4.2, 4.3) Does living near power lines cause 
leukemia in children? The National Cancer Institute spent 5 years and 
$5 million gathering data on this question. The researchers compared 
638 children who had leukemia with 620 who did not. They went into 
the homes and measured the magnetic fields in children’s bedrooms, 
in other rooms, and at the front door. They recorded facts about 
power lines near the family home and also near the mother’s 
residence when she was pregnant. Result: No association between 
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leukemia and exposure to magnetic fields of the kind produced by 
power lines was found22 


a. Was this an observational study or an experiment? Justify your 
answer. 


b. Does this study prove that living near power lines doesn’t cause 
cancer? Explain your answer. 


28. Sisters and brothers (3.1, 3.2) How strongly do physical 
characteristics of sisters and brothers correlate? Here are data on the 
heights (in inches) of 11 adult pairs:44 


Brother 71 68 66 67 70 71 70 73 72 #65 66 
Sister 69 64 6 63 6 62 65 64 66 59 62 


a. Construct a scatterplot using brother’s height as the explanatory 
variable. Describe what you see. 


b. Use technology to compute the least-squares regression line for 
predicting sister’s height from brother’s height. 


c. Interpret the slope in context. 


d. Calculate and interpret the residual for the first pair listed in the 
table. 
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SECTION 8.2 Estimating a Population Proportion 


LEARNING TARGETS By the end of the section, you should be able to: 


e State and check the Random, 10%, and Large Counts conditions for 
constructing a confidence interval for a population proportion. 


e Determine the critical value for calculating a C% confidence 
interval for a population proportion using a table or technology. 


e Construct and interpret a confidence interval for a population 
proportion. 


e Determine the sample size required to obtain a C% confidence 
interval for a population proportion with a specified margin of error. 


In Section 8.1, we saw that a confidence interval can be used to estimate 
an unknown population parameter. We are often interested in estimating 
the proportion p of some outcome in a population. Here are some 
examples: 
e What proportion of U.S. adults are unemployed right now? 
e What proportion of high school students have cheated on a test? 
e What proportion of pine trees in a national park are infested with 
beetles? 
e What proportion of college students pray daily? 
e What proportion of a company’s laptop batteries last as long as the 
company claims? 
This section shows you how to construct and interpret a confidence 


interval for a population proportion. The following activity gives you a 
feel for what lies ahead. 


ACTIVITY The beads 


Before class, your teacher prepared a large population of different- 
colored beads and put them into a container. In this activity, you and 
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your team will create an interval estimate for the actual proportion of 
beads in the population that have a particular color (e.g., red). 


1. Asa class, discuss how to use the cup provided to get a simple 
random sample of beads from the container. 


2. Have a student take an SRS of beads. Separate the beads into two 
groups: those that are red and those that aren’t. Count the number of 
beads in each group. 


3. Determine the point estimate p/? for the unknown population 
proportion p of red beads in the container. 


4. In teams of 3 or 4 students, calculate a 95% confidence interval for 
the true proportion of red beads p. Hint: You may want to refer to 
Section 7.2 about the sampling distribution of a sample proportion. 


5. Compare the results with those of the other teams in the class. 


billnoll/Getty Images 


Constructing a Confidence Interval for p 


When Mr. Buckley’s class did the preceding activity, the random sample 
of 251 beads they selected included 107 red beads and 144 other beads. 
Starting with the general formula for a confidence interval from Section 
ol: 


point estimate=statistict+margin of error(critical value) - 
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(standard deviation of statistic) 
point estimate + margin of error 


= statistic + (critical value) - (standard deviation of statistic) 


they determined the values to substitute into the formula using what they 
learned about the sampling distribution of the sample proportion p/? in 
Section 7.2. 
e Statistic: The class decided to use pA=107/251=0.426 
Pp = 107/251 = 0.426 because p/P is an unbiased estimator of p. 
e Critical value: The class decided to use critical value = 2 based on the 
68—95-99.7 rule for Normal distributions. 


e Standard deviation of statistic: Remembering that the standard deviation 
[p—p) 


of the sampling distribution of p/P is op\=p(1-p)n,”? ~V~» the 
class decided to use p\=0.426P = 9-426 in the formula to get 
/ 0.426(1—0.426) 


= 0.031. 


0.426(1-0.426)251=0.031.V 25 


+— Values of p —— 


The class’s 95% confidence interval is 
0.426=0.426++2(0.031)0.062=(0.364, 0.488) 
0.426 + 2(0.031) 
= 0.426 + 0.062 = (0.364, 0.488) 


The class is 95% confident that the interval from 0.364 to 0.488 captures 
the true proportion of red beads in Mr. Buckley’s container. 


The interval constructed by Mr. Buckley’s class is nearly correct. Here 
is the exact formula for a one-sample z interval for a population 
proportion. 
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ONE-SAMPLE z INTERVAL FOR A POPULATION 
PROPORTION 


When the conditions are met, a C% confidence interval for the 
unknown proportion p is 


pitz*pA(1-p\jn 


where z* is the critical value for the standard Normal curve with C% of 
its area between —z* and z*. 


CONDITIONS FOR ESTIMATING p There are three conditions that 
must be met for this formula to be valid—one for each of the three 
components in the formula. 


4 ~ x p(1 — fp) 
pee V n 


Let’s discuss them one at a time. 


4 p(1 — p) 


. The Random Condition and p4P When our data come from a random 
sample, we can make an inference about the population from which the 
sample was selected. If the data come from a convenience sample or 
voluntary response sample, we should have no confidence that the 
resulting value of p/P is a good estimate of p. To be sure that p/P isa 
valid point estimate, we check the Random condition: The data come 
from a random sample from the population of interest. 


Another important reason for random selection is to introduce 
chance into the data-production process. We can model chance behavior 
with a probability distribution, like the sampling distributions of Chapter 
7. Our method of calculation assumes that the data come from an SRS 
of size n from the population of interest. Other types of random samples 
(e.g., stratified or cluster) might be preferable to an SRS in a given 
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setting, but they require more complex calculations than the ones we’ |l 
use. When an example, exercise, or AP® Statistics exam item refers to a 
“random sample” without saying “stratified” or “cluster,” you can 
assume the sample is an SRS. 


The Large Counts Condition and z* When Mr. Buckley’s class used 
the 68—95—99.7 rule to determine the critical value for their confidence 
interval, they were assuming that the sampling distribution of p\P was 
approximately Normal. If the distribution of p/? is approximately 
Normal, then p? will be within 2 standard deviations of p in about 
95% of samples. This means the value of p will be within 2 standard 
deviations of p/P in about 95% of samples. Thus, using a critical value 
of 2 will result in approximately 95% confidence. 


From what we learned in Chapter 7, we can use the Normal 
approximation to the sampling distribution of p\P as long as np = 10 
and n(1 — p) = 10. In practice, of course, we don’t know the value of p. 
If we did, we wouldn’t need to construct a confidence interval for it! In 
large random samples, p/? will be close to p. So we replace p by p/P 
checking the Large Counts condition:np\>10"P 2 10 and n(1-p/)=10. 
n(1—p) > 10. 

When the Large Counts condition is met, we can use a Normal 
distribution to calculate the critical value z* for any confidence level. 
You’ll learn how to do this soon. 


—  _ [BO =) 
ene r n 


| pa-p) 
The 10% Condition and pA(i-pA)nV "As you learned in 
Chapter 7, the formula for the standard deviation of the sampling 
distribution of p/P? assumes that individual observations are 
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independent. However, when we’re sampling without replacement from 
a (finite) population, the observations are not independent because 
knowing the outcome of one trial helps us predict the outcome of future 
trials. Whenever we are sampling without replacement—which is nearly 
always—we need to check the 10% condition: n < 0.10N, where n is the 
sample size and N is the population size. 


When the 10% condition is met, the standard deviation of the 
sampling distribution of p/? is 
_ /pU-p 
Oo; 4! 
op’=p(i-pn’ Von 


Because we don’t know the value of p, we replace it with the sample 
proportion p/P? as we did with the Large Counts condition. The 
resulting quantity is called the standard error (SE) of the sample 
proportion pip: 


DEFINITION Standard error 


When the standard deviation of a statistic is estimated from data, 
the result is called the standard error of the statistic. 


SEp\=p\(1-p\)n ” 


o- 7 
Some books refer to op ? as the standard error of p\P and to what we call the 


standard error as the estimated standard error. 


Like the standard deviation, the standard error describes how much the 
sample proportion p/P typically varies from the population proportion p 
in repeated SRSs of size n. 

Here is a summary of the three conditions for constructing a 
confidence interval for p. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE 
INTERVAL ABOUT A PROPORTION 
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e Random: The data come from a random sample from the population 
of interest. 


© 10%: When sampling without replacement, n < 0.10N. 


e Large Counts: Both np\”? and n(i-pyr( ~ P) are at least 10. 


Let’s verify that the conditions were met for the interval calculated by 
Mr. Buckley’s class. 


EXAMPLE |The beads (>) 
Checking conditions 


Studioshots/Alamy 


PROBLEM: Mr. Buckley’s class wants to construct a confidence 
interval for p = the true proportion of red beads in the container, which 
includes over 3000 beads. Recall that the class’s sample of 251 beads 
had 107 red beads and 144 other beads. Check if the conditions for 
constructing a confidence interval for p are met. 


SOLUTION: 


e Random: The class took a random sample of 251 beads from the 
container. / 
°o 10%: 251 beads is less than 10% of 3000. “ 


e Large Counts: 
np=251(107251)=107>10 
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andn(1—p)=251(1-107251)=251(144251)=144>10 
np = 251 (2) — 107 > 10 and 


251 


n(1 —p) = 251 (1 7 i) = 251 (#4) = 144>10 


/ 
FOR PRACTICE, TRY EXERCISE 29 


Notice that npAnp and n(i-pryr( ~ P) are the number of successes 
and failures in the sample. In the preceding example, we could address the 
Large Counts condition simply by saying, “The numbers of successes 
(107) and failures (144) in the sample are both at least 10.” 


WHAT HAPPENS IF ONE OF THE CONDITIONS IS VIOLATED? 
If the data come from a voluntary response or convenience sample, there’s 
no point in constructing a confidence interval for p. Violation of the 
Random condition severely limits our ability to make any inference 
beyond the data at hand. 


Simulation studies have shown that a variation of our method for calculating a 95% 


confidence interval for p can result in closer to a 95% capture rate in the long run, 
especially for small sample sizes. This simple adjustment, first suggested by Edwin 
Bidwell Wilson in 1927, is sometimes called the plus four estimate. Just pretend we 
have four additional observations, two of which are successes and two of which are 
failures. Then calculate the “plus four interval” using the plus four estimate in place 


of p\P and sample size n + 4 in our usual formula. 


The figure shows a screen shot from the Confidence Intervals for 
Proportions applet at the book’s website, highschool.bfwpub.com/tps6e. 
We set n = 20 and p = 0.25. The Large Counts condition is not met 
because np = 20(0.25) = 5. We used the applet to generate 1000 random 
samples and construct 1000 95% confidence intervals for p. Only 902 of 
those 1000 intervals contained p = 0.25, a capture rate of 90.2%. When the 
Large Counts condition is violated, the capture rate will almost always be 
less than the one advertised by the confidence level when the method is 
used many times. 
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0 .10 20 30 .40 .550 .60 .70 .80 .90 1.0 
Population Proportion (p): 0.25 


Confidence Level (C): 95 
Sst 


Sample Size (n): 20 
———ail 


SAMPLE 


Hit: 902 Total: 1000 
Percent hit: 0.902 


RESET 


0 10 20 30 40 50 .60 .70 80 90 1.0 


Violating the 10% condition means that we are sampling a large 
fraction of the population, which should be a good thing! Unfortunately, 
the formula we use for the standard deviation of p\? gives a value that is 
too large when the 10% condition is violated. Confidence intervals based 
on this formula are longer than they need to be. If many 95% confidence 
intervals for a population proportion are constructed in this way, more than 
95% of them will capture p. The actual capture rate is almost always 
greater than the reported confidence level when the 10% condition is 
violated. 


CALCULATING CRITICAL VALUES How do we get the critical 
value z* for our confidence interval? If the Large Counts condition is met, 
we can use a Normal curve. For their 95% confidence interval in the beads 
activity, Mr. Buckley’s class used a critical value of 2. Based on the 68— 
95—99.7 rule for Normal distributions, they figured that p? will be within 
2 standard deviations of p in about 95% of all samples. Thus, p should be 
within 2 standard deviations of p’? in about 95% of all samples. 
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We can get a more precise critical value from Table A or a calculator. 
As Figure 8.2 shows, the central 95% of the standard Normal distribution 
is marked off by 2 points, z* and —z*. We use the * to remind you that this 
is a critical value, not a standardized score that has been calculated from 


data. 
Standard 
Normal curve 


jf 


Area = ().95 


Area = ().025 Area = ().025 


2 () Z 
FIGURE 8.2 Finding the critical value z* for a 95% confidence interval starts by 
labeling the middle 95% under a standard Normal curve and calculating the area in 


each tail. 


Because of the symmetry of the Normal curve, the area in each tail is 
0.05 / 2 = 0.025. Once you know the tail areas, there are two ways to 
calculate the value of z*: 


e Using Table A: Search the body of Table A to find the point —z* with 
area 0.025 to its left. The entry z = -1.96 is what we are looking for, so 
z* = 1.96. 


zZ 05 06 07 
-2.0 0202 0197 0192 
=e 0256 Vrs S | 0244 
-1.8 0322 0314 0307 


e Using technology: The command invNorm(area:0.025, mean:0, SD:1) 
gives Z = —1.960, so z* = 1.960. 


Now we can officially calculate a 95% confidence interval using the 
data from Mr. Buckley’s class: 


p\+z*p\(1-p)n=0.426+1.960.426(1-0.426)251=0.426+0.061=(0.365, 
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, /p(1—p) [0.426(1—0.426) 
+ 2*4/ = 0.426 + 1.96, / —————__—- 

wae a \ 251 
0.487) = 0.426 + 0.061 = (0.365, 0.487) 


Mr. Buckley’s class is 95% confident that the interval from 0.365 to 
0.487 captures the true proportion of red beads in his container. Notice that 
the margin of error is slightly smaller for this interval than when the class 
used 2 for the critical value. 

To find a level C confidence interval, we need to catch the central C% 
under the standard Normal curve. Here’s an example that shows how to 
get the critical value z* for a different confidence level and use it to 
calculate a confidence interval. 


EXAMPLE | Read any good books lately? » 


Calculating a critical value and confidence interval 


PROBLEM: According to a 2016 Pew Research Center report, 73% of 
American adults have read a book in the previous 12 months. This 
estimate was based on a random sample of 1520 American adults.42 
Assume the conditions for inference are met. 


a. Determine the critical value z* for a 90% confidence interval for a 
proportion. 


b. Construct a 90% confidence interval for the proportion of all 
American adults who have read a book in the previous 12 months. 


c. Interpret the interval from part (b). 


Lisa Solonynko/Alamy 
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SOLUTION: 


a. Using Table A: z* = 1.64 or z* = 1.65 


Using technology: The command invNorm (area:0.05, mean:0, 
SD:1) gives z = —1.645, so z* = 1.645. 


b. 0.73£1.6450.73(1-0.73)1520=0.73+0.019=(0.711,0.749) 


0.73 + 1.6454/ 


= 0.73 + 0.019 
= (0.711, 0.749) 


Area = 0.0 
Area = 0.05 lalate 


-Z*= -1.645 0 Z*= 1.645 


c. We are 90% confident that the interval from 0.711 to 0.749 captures 
p = the true proportion of American adults who have read a book in 
the previous 12 months. 


FOR PRACTICE, TRY EXERCISE 35 


CHECK YOUR UNDERSTANDING 


(\ 


y Sleep Awareness Week begins in the spring with the release of the 
National Sleep Foundation’s annual poll of U.S. sleep habits and ends 
with the beginning of daylight savings time, when most people lose an 
hour of sleep.t¥ In the foundation’s random sample of 1029 U.S. adults, 
48% reported that they “often or always” got enough sleep during the 
last 7 nights. 
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1. Identify the parameter of interest. 

2. Check if the conditions for constructing a confidence interval for p 
are met. 

3. Find the critical value for a 99% confidence interval. Then calculate 
the interval. 


4. Interpret the interval in context. 


Putting It All Together: The Four-Step Process 


Taken together, the examples about Mr. Buckley’s class and the beads 
activity show you how to get a confidence interval for an unknown 
population proportion p. Because there are many details to remember 
when constructing and interpreting a confidence interval, it is helpful to 
group them into four steps. 


CONFIDENCE INTERVALS: A FOUR-STEP PROCESS 


State: State the parameter you want to estimate and the 
confidence level. 

Plan: Identify the appropriate inference method and check 
conditions. 
Do: If the conditions are met, perform calculations. 
Conclude: Interpret your interval in the context of the problem. 


The next example illustrates the four-step process in action. 


EXAMPLE | Distracted walking (») fi. 
4 STEP 


Constructing and interpreting a confidence interval 
for p 
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mimagephotography 
/Shutterstock.com 


PROBLEM: A recent poll of 738 randomly selected cell-phone users 
found that 170 of the respondents admitted to walking into something or 
someone while talking on their cell phone./4 Construct and interpret a 
95% confidence interval for the proportion of all cell-phone users who 
would admit to walking into something or someone while talking on 
their cell phone. 


SOLUTION: 


STATE: 95% CI for p = the true proportion of all cell-phone users who 
would admit to walking into something or someone while talking on 
their cell phone. 


STATE: State the parameter you want to estimate and the confidence 


level. 


PLAN: One-sample z interval for p. 


PLAN: Identify the appropriate inference method and check conditions. 


e Random: Random sample of 738 cell-phone users. / 


o 10%: It is reasonable to assume that 738 is less than 10% of all 
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cell-phone users. / 


e Large Counts: The number of successes (170) and the number of 
failures (738 — 170 = 568) are both at least 10. / 


Remember that np\"P is the number of successes and n(1-p‘) 


n(1 — p) is the number of failures: 


np=738(170738)=170n(1-p’)=738(568738)=568 


170 
5 = 738 ( —— ) =170 
= (a) 


| 568 
1—$) = 738 = 568 
aw) (= 


DO:p=170/738=0.2300.230+1.9600.230(1-0.230)738=0.230+0.030= 
DO : p = 170/738 = 0.230 

0.230 + 1.960 / ——— 

= 0.230 + 0.030 


(0.200, 0.260) = (0.200, 0.260) 


DO: If the conditions are met, perform calculations. 


CONCLUDE: We are 95% confident that the interval from 0.200 to 
0.260 captures p = the true proportion of all cell-phone users who would 
admit to walking into something or someone while talking on their cell 
phone. 


CONCLUDE: Interpret your interval in the context of the problem. 


Make sure your conclusion is about the population proportion (users who 


would admit) and not the sample proportion (those who admitted). 


FOR PRACTICE, TRY EXERCISE 41 
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AP® EXAM TIP 


If a free response question asks you to construct and interpret a 
confidence interval, you are expected to do the entire four-step process. 
That includes clearly defining the parameter, identifying the procedure, 
and checking conditions. 


rt) Remember that the margin of error in a confidence interval 


only accounts for sampling variability! There are other sources of error 
that are not taken into account. As is the case with many surveys, we are 
forced to assume that respondents answer truthfully. If they don’t, then we 
shouldn’t be 95% confident that our interval captures the truth. Other 
problems like nonresponse and question wording can also affect the results 
of a survey. Lesson: Sampling beads is much easier than sampling people! 


Your calculator will handle the “Do” part of the four-step process, as 
the following Technology Corner illustrates. 


17. Technology Corner |CONSTRUCTING A 
CONFIDENCE INTERVAL FOR A POPULATION 
PROPORTION |) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


The TI-83/84 can be used to construct a confidence interval for an 
unknown population proportion. We’|l demonstrate using the previous 
example. Of n = 738 cell-phone users surveyed, X = 170 admitted to 
walking into something or someone while talking on their cell phone. 
To construct a confidence interval: 


e Press [STAT], then choose TESTS and 1-PropZInt. 


e When the 1-PropZInt screen appears, enter X = 170, n = 738, and 
confidence level = 0.95. Note: The value you enter for X is the 
number of successes (not the proportion of successes) and must be an 
integer. 
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e Highlight “Calculate” and press ENTER |, The 95% confidence 
interval for p is reported, along with the sample proportion p/? and 
the sample size, as shown here. 


x: 178 

n: 738 
C-Level:8.95 
Calculate 


NORMAL FLOAT AUTO REAL RADIAN MP n 


(0.19997,0. 26073) 
P=8,. 2303523035 
n=738 


AP® EXAM TIP 


You may use your calculator to compute a confidence interval on the 


AP® Statistics exam. But there’s a risk involved. If you just give the 
calculator answer with no work, you’ |! get either full credit for the “Do” 
step (if the interval is correct) or no credit (if it’s wrong). If you opt for 
the calculator-only method, be sure to complete the other three steps, 
including identifying the procedure (e.g., one-sample z interval for p) 
and give the interval in the Do step (e.g., 0.19997 to 0.26073). 


Choosing the Sample Size 


In planning a study, we may want to choose a sample size that allows us to 
estimate a population proportion within a given margin of error. The 
formula for the margin of error (ME) in the confidence interval for p is 


x 


1—p 


* 


er ee ME = 24 
ME=z*p\(1-p/)n 
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To calculate the sample size, substitute values for ME, z*, and p/P , and 
solve for n. Unfortunately, we won’t know the value of p/P until after the 
study has been conducted. This means we have to guess the value of p/P 
when choosing n. Here are two ways to do this: 


. Use a guess for p\P based on a pilot (preliminary) study or past 
experience with similar studies. 


’. Use p\=0.5P = 9-5 as the guess. The margin of error ME is largest when 
p=0.5P = 9.5, so this guess is conservative. If we get any other p/P? 
when we do our study, the margin of error will be smaller than planned. 


Once you have a guess for p?, the formula for the margin of error can be 
solved to give the required sample size n. 


SAMPLE SIZE FOR DESIRED MARGIN OF ERROR 
WHEN ESTIMATING p 


To determine the sample size n that will yield a C% confidence interval 
for a population proportion p with a maximum margin of error ME, 
solve the following inequality for n: 


Zz 
z*p\(1-p/)n<ME \ 
where p/P is a guessed value for the sample proportion. The margin of 
error will always be less than or equal to ME if you use p\=0.5P = 9.9, 


Here’s an example that shows you how to determine the sample size. 


EXAMPLE | Customer satisfaction >» 


Determining sample size 
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wbritten/Getty Images 


PROBLEM: A company has received complaints about its customer 
service. The managers intend to hire a consultant to carry out a survey 
of customers. Before contacting the consultant, the company president 
wants some idea of the sample size that she will be required to pay for. 
One value of interest is the proportion p of customers who are satisfied 
with the company’s customer service. She decides that she wants the 
estimate to be within 3 percentage points (0.03) at a 95% confidence 
level. How large a sample is needed? 


SOLUTION: 


1.964) 
1.960.5(1 — 0.5)n<0.03 | 


We have no idea about the true proportion p of satisfied customers, so 


we decide to use pA=0.5P = 0.5 as our guess. 


/0.5(1—0.5) _ 0.03 
0.5(1-0.5)n<0.031.96 \ n ~ 1.96 


Divide both sides by 1.96. 


0.5(1-0.5) & ) : 


n 1.96 


0.5(1 — 0.5)n<(0.031.96)2 
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Square both sides. 


0.5(1 - 0.5)<n(0.031.96)2 


Multiply both sides by n. 


03 
1.9 


n>1067.111 nm > 1067.111 


Make sure to follow the inequality when rounding your answer. 


The sample needs to include at least 1068 customers. 
FOR PRACTICE, TRY EXERCISE 49 
Why not round to the nearest whole number—in this case, 1067? 
Because a smaller sample size will result in a larger margin of error, 
possibly more than the desired 3 percentage points for the poll. In general, 


we round to the next highest integer when solving for sample size to make 
sure the margin of error is less than or equal to the desired value. 


CHECK YOUR UNDERSTANDING 
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y Refer to the preceding example about the company’s customer 
satisfaction survey. 


1. In the company’s prior-year survey, 80% of customers surveyed said 
they were satisfied. Using this value as a guess for p/?, find the 
sample size needed for a margin of error of at most 3 percentage 
points with 95% confidence. How does this compare with the 
required sample size from the example? 


2. What if the company president demands 99% confidence instead of 
95% confidence? Would this require a smaller or larger sample size, 
assuming everything else remains the same? Explain your answer. 


Section 8.2 Summary 


e The conditions for constructing a confidence interval about a population 
proportion p are as follows: 


: Random: The data come from a random sample from the population 
of interest. 


© 10%: When sampling without replacement, n < 0.10N. 


. Large Counts: Both np/ and n(1—pA)"P and (1 —P) are at least 10. 
That is, the number of successes and the number of failures in the 
sample are both at least 10. 


e When the conditions are met, the C% one-sample z interval for p is 


[p(l—p 
Fiz) p(1—p) 
ptz*p\(1—p)n Von 


where z* is the critical value for the standard Normal curve with C% of 
its area between —z* and z*. 


e When we use the value of pip to estimate the standard deviation of the 
sampling distribution of p/?, the result is the standard error of p‘: 
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a _ ,/p(l-p) 
Pp: SE; = V = . 


SEp/=p(1-p)n. 
e When asked to construct and interpret a confidence interval, follow the 
four-step process: 


STATE: State the parameter you want to estimate and the 
4 STEP 
confidence level. 


PLAN: Identify the appropriate inference method and check 
conditions. 
DO: If the conditions are met, perform calculations. 
CONCLUDE: Interpret your interval in the context of the problem. 


e The sample size needed to obtain a confidence interval with a maximum 
margin of error ME for a population proportion involves solving 


| p(1—p) 


24] < ME 
z*p\(1-p\jn<ME Von 


for n, where p/P is a guessed value for the sample proportion, and z* is 
the critical value for the confidence level you want. Use p\=0.5? = 9-5 
if you don’t have a good idea about the value of p.?: 


8.2 Technology Corner | » 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


17. Constructing a confidence interval for a population 
proportion Page 518 


Section 8.2 Exercises 


For Exercises 29 to 32, check whether each of the conditions is met for 
calculating a confidence interval for the population proportion p. 


29. pg 513 Rating school food Latoya wants to estimate the 
Nae 8 


proportion of the seniors at her boarding school who like the cafeteria 
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food. She interviews an SRS of 50 of the 175 seniors and finds that 
14 think the cafeteria food is good. 


30. High tuition costs Glenn wonders what proportion of the students at 
his college believe that tuition is too high. He interviews an SRS of 
50 of the 2400 students and finds 38 of those interviewed think tuition 
is too high. 


31. Salty chips A quality control inspector takes a random sample of 25 
bags of potato chips from the thousands of bags filled in an hour. Of 
the bags selected, 3 had too much salt. 


32. Whelks and mussels The small round holes you often see in 
seashells were drilled by other sea creatures, who ate the former 
dwellers of the shells. Whelks often drill into mussels, but this 
behavior appears to be more or less common in different locations. 
Researchers collected whelk eggs from the coast of Oregon, raised 
the whelks in the laboratory, then put each whelk in a container with 
some delicious mussels. Only 9 of 98 whelks drilled into a mussel.42 


33. The 10% condition When constructing a confidence interval for a 
population proportion, we check that the sample size is less than 10% 
of the population size. 


a. Why is it necessary to check this condition? 


b. What happens to the capture rate if this condition is violated? 


34. The Large Counts condition When constructing a confidence 
interval for a population proportion, we check that both np and 


n(1—p)" andn(1—p) are at least 10. 
a. Why is it necessary to check this condition? 
b. What happens to the capture rate if this condition is violated? 


35. pg 516 |) Selling online According to a recent Pew Research 


Center report, many American adults have made money by selling 
something online. In a random sample of 4579 American adults, 914 
reported that they earned money by selling something online in the 
previous year 16 Assume the conditions for inference are met. 
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36. 


37. 


38. 


39. 


a. Determine the critical value z* for a 98% confidence interval for a 
proportion. 


b. Construct a 98% confidence interval for the proportion of all 
American adults who would report having earned money by selling 
something online in the previous year. 


c. Interpret the interval from part (b). 


Reporting cheating What proportion of students are willing to report 
cheating by other students? A student project put this question to an 
SRS of 172 undergraduates at a large university: “You witness two 
students cheating on a quiz. Do you go to the professor?” Only 19 
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answered “Yes. Assume the conditions for inference are met. 


a. Determine the critical value z* for a 96% confidence interval for a 
proportion. 


b. Construct a 96% confidence interval for the proportion of all 
undergraduates at this university who would go to the professor. 


c. Interpret the interval from part (b). 


More online sales Refer to Exercise 35. Calculate and interpret the 
standard error of p/P for these data. 


More cheating Refer to Exercise 36. Calculate and interpret the 
standard error of p/P for these data. 


Going to the prom Tonya wants to estimate what proportion of her 
school’s seniors plan to attend the prom. She interviews an SRS of 50 
of the 750 seniors in her school and finds that 36 plan to go to the 
prom. 


a. Identify the population and parameter of interest. 


b. Check conditions for constructing a confidence interval for the 
parameter. 


c. Construct a 90% confidence interval for p. 


d. Interpret the interval in context. 
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40. Student government The student body president of a high school 
claims to know the names of at least 1000 of the 1800 students who 
attend the school. To test this claim, the student government advisor 
randomly selects 100 students and asks the president to identify each 
by name. The president successfully names only 46 of the students. 


a. Identify the population and parameter of interest. 


b. Check conditions for constructing a confidence interval for the 
parameter. 


c. Construct a 99% confidence interval for p. 


d. Interpret the interval in context. 


41. pg 517 |») Video games A Pew Research Center report on gamers 


and gaming estimated that 49% of U.S. adults play video games on a 
computer, TV, game console, or portable device such as a cell phone. 
This estimate was based on a random sample of 2001 U.S. adults. 
Construct and interpret a 95% confidence interval for the proportion 


of all U.S. adults who play video games.28 


42. September 11 A recent study asked U.S. adults to name 10 historic 
events that occurred in their lifetime that have had the greatest impact 
on the country. The most frequently chosen answer was the 
September 11, 2001, terrorist attacks, which was included by 76% of 
the 2025 randomly selected U.S. adults. Construct and interpret a 
95% confidence interval for the proportion of all U.S. adults who 
would include the 9/11 attacks on their list of 10 historic events. 


43. Age and video games Refer to Exercise 41. The study also estimated 
that 67% of adults aged 18-29 play video games, but only 25% of 
adults aged 65 and older play video games. 


a. Explain why you do not have enough information to give 
confidence intervals for these two age groups separately. 


b. Do you think a 95% confidence interval for adults aged 18-29 
would have a larger or smaller margin of error than the estimate 
from Exercise 41? Explain your answer. 
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44. 


45. 


46. 


47. 


Age and September 11 Refer to Exercise 42. The study also reported 
that 86% of millennials included 9/11 in their top-10 list and 70% of 
baby boomers included 9/11. 


a. Explain why you do not have enough information to give 
confidence intervals for millennials and baby boomers separately. 


b. Do you think a 95% confidence interval for baby boomers would 


have a larger or smaller margin of error than the estimate from 
Exercise 42? Explain your answer. 


Food fight A 2016 survey of 1480 randomly selected U.S. adults 
found that 55% of respondents agreed with the following statement: 
“Organic produce is better for health than conventionally grown 


produce.” 2 


a. Construct and interpret a 99% confidence interval for the 
proportion of all U.S. adults who think that organic produce is 
better for health than conventionally grown produce. 


b. Does the interval from part (a) provide convincing evidence that a 


majority of all U.S. adults think that organic produce is better for 
health? Explain your answer. 


Three branches According to a recent study by the Annenberg 
Foundation, only 36% of adults in the United States could name all 
three branches of government. This was based on a survey given to a 
random sample of 1416 U.S. adults.22 


a. Construct and interpret a 90% confidence interval for the 
proportion of all U.S. adults who could name all three branches of 
government. 


b. Does the interval from part (a) provide convincing evidence that 


less than half of all U.S. adults could name all three branches of 
government? Explain your answer. 


Prom totals Use your interval from Exercise 39 to construct and 
interpret a 90% confidence interval for the total number of seniors 
planning to go to the prom. 
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48. 


49. 


50. 


ol. 


Student body totals Use your interval from Exercise 40 to construct 
and interpret a 99% confidence interval for the total number of 
students at the school that the student body president can identify by 
name. Then use your interval to evaluate the president’s claim. 


pg 520 |») School vouchers A small pilot study estimated that 


44% of all American adults agree that parents should be given 
vouchers that are good for education at any public or private school of 
their choice. 


a. How large a random sample is required to obtain a margin of error 
of at most 0.03 with 99% confidence? Answer this question using 
the pilot study’s result as the guessed value for p/P. 


b. Answer the question in part (a) again, but this time use the 


conservative guess p\=0.5P = 9.5, By how much do the two 
sample sizes differ? 


Can you taste PTC? PTC is a substance that has a strong bitter taste 
for some people and is tasteless for others. The ability to taste PTC is 
inherited. About 75% of Italians can taste PTC, for example. You 
want to estimate the proportion of Americans who have at least one 
Italian grandparent and who can taste PTC. 


a. How large a sample must you test to estimate the proportion of 
PTC tasters within 0.04 with 90% confidence? Answer this 
question using the 75% estimate as the guessed value for p/P. 


b. Answer the question in part (a) again, but this time use the 


conservative guess p\=0.5P = 9.5, By how much do the two 
sample sizes differ? 


Starting a nightclub A college student organization wants to start a 
nightclub for students under the age of 21. To assess support for this 
proposal, they will select an SRS of students and ask each respondent 
if he or she would patronize this type of establishment. What sample 
size is required to obtain a 90% confidence interval with a margin of 
error of at most 0.04? 


52. Election polling Gloria Chavez and Ronald Flynn are the candidates 
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53. 


4. 


for mayor in a large city. We want to estimate the proportion p of all 
registered voters in the city who plan to vote for Chavez with 95% 
confidence and a margin of error no greater than 0.03. How large a 
random sample do we need? 


Teens and their TV sets According to a Gallup Poll report, 64% of 
teens aged 13 to 17 have TVs in their rooms. Here is part of the 
footnote to this report: 


These results are based on telephone interviews with a randomly 
selected national sample of 1028 teenagers in the Gallup Poll Panel 
of households, aged 13 to 17. For results based on this sample, one 
can say ... that the maximum error attributable to sampling and other 
random effects is +3 percentage points. In addition to sampling error, 
question wording and practical difficulties in conducting surveys can 
introduce error or bias into the findings of public opinion polls.24 


a. We omitted the confidence level from the footnote. Use what you 
have learned to estimate the confidence level, assuming that Gallup 
took an SRS. 


b. Give an example of a “practical difficulty” that could lead to bias 
in this survey. 


Gambling and the NCAA Gambling is an issue of great concern to 
those involved in college athletics. Because of this concern, the 
National Collegiate Athletic Association (NCAA) surveyed randomly 
selected student athletes concerning their gambling-related 
behaviors.22 Of the 5594 Division I male athletes who responded to 
the survey, 3547 reported participation in some gambling behavior. 
This includes playing cards, betting on games of skill, buying lottery 
tickets, betting on sports, and similar activities. A report of this study 
cited a 1% margin of error. 


a. The confidence level was not stated in the report. Use what you 
have learned to estimate the confidence level, assuming that the 
NCAA took an SRS. 


b. The study was designed to protect the anonymity of the student 
athletes who responded. As a result, it was not possible to calculate 
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the number of students who were asked to respond but did not. 
How does this fact affect the way that you interpret the results? 


Multiple Choice Select the best answer for Exercises 55—58. 


55. A Gallup poll found that only 28% of American adults expect to 
inherit money or valuable possessions from a relative. The poll’s 
margin of error was +3 percentage points at a 95% confidence level. 
This means that 


a. the poll used a method that gets an answer within 3% of the truth 
about the population 95% of the time. 


b. the percent of all adults who expect an inheritance must be 
between 25% and 31%. 


c. if Gallup takes another poll on this issue, the results of the second 
poll will lie between 25% and 31%. 


d. there’s a 95% chance that the percent of all adults who expect an 
inheritance is between 25% and 31%. 


e. Gallup can be 95% confident that between 25% and 31% of the 
sample expect an inheritance. 


56. Refer to Exercise 55. Suppose that Gallup wanted to cut the margin of 
error in half from 3 percentage points to 1.5 percentage points. How 
should they adjust their sample size? 


a. Multiply the sample size by 4. 
b. Multiply the sample size by 2. 
c. Multiply the sample size by 1/2. 
d. Multiply the sample size by 1/4. 


e. There is not enough information to answer this question. 


57. Most people can roll their tongues, but many can’t. The ability to roll 
the tongue is genetically determined. Suppose we are interested in 
determining what proportion of students can roll their tongues. We 
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test a simple random sample of 400 students and find that 317 can roll 
their tongues. The margin of error for a 95% confidence interval for 
the true proportion of tongue rollers among students is closest to 
which of the following? 


a. 0.0008 
b. 0.02 
c. 0.03 
d. 0.04 
e. 0.05 


58. A newspaper reporter asked an SRS of 100 residents in a large city 
for their opinion about the mayor’s job performance. Using the results 
from the sample, the C% confidence interval for the proportion of all 
residents in the city who approve of the mayor’s job performance is 
0.565 to 0.695. What is the value of C? 


a. 82 
b. 86 
c. 90 
d. 95 
e. 99 


Review and Recycle 


59. Oranges (6.1, 7.3) A home gardener likes to grow various kinds of 
citrus fruit. One of his mandarin orange trees produces oranges whose 
circumferences follow a Normal distribution with mean 21.1 cm and 
standard deviation 1.8 cm. 


a. What is the probability that a randomly selected orange from this 
tree has a circumference greater than 22 cm? 


b. What is the probability that a random sample of 20 oranges from 
this tree has a mean circumference greater than 22 cm? 
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60. More oranges (1.2, 2.2) The gardener in the previous exercise 
randomly selects 20 mandarin oranges from the tree and counts the 
number of seeds in each orange. Here are the data: 


3.4669 11 11 12 13 13 14 14 16 17 22 23 23 24 28 30 


a. Graph the data using a dotplot. 


b. Based on your graph, is it plausible that the number of seeds from 
oranges on this tree follows a distribution that is approximately 
Normal? Explain your answer. 
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SECTION 8.3 Estimating a Population Mean 


LEARNING TARGETS. By the end of the section, you should be able to: 


e Determine the critical value for calculating a C% confidence 
interval for a population mean using a table or technology. 


State and check the Random, 10%, and Normal/Large Sample 


conditions for constructing a confidence interval for a population 
mean. 


Construct and interpret a confidence interval for a population mean. 


Determine the sample size required to obtain a C% confidence 
interval for a population mean with a specified margin of error. 


Inference about a population proportion usually arises when we study 
categorical variables. We learned how to construct and interpret 
confidence intervals for a population proportion p in Section 8.2. To 
estimate a population mean, we have to record values of a quantitative 
variable for a sample of individuals. It makes sense to try to estimate the 
mean amount of sleep that students at a large high school got last night but 
not their mean eye color! In this section, we’ll examine confidence 
intervals for a population mean p. 


The Problem of Unknown o 


In Section 8.2, we used this formula for a confidence interval for a 
population proportion: 


/p(1—p) 
n 


pt 2z*4/ 
p\tz*p\(1-pA)n- \ 


In more general terms, this is 


point estimate+ margin of error= statistict+ (critical value) 
(standard deviation of the statistic) 
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point estimate -+ margin of error 


= statistic + (critical value)(standard deviation of the statistic) 


The formula sheet on the AP® Statistics exam includes the general formula stated 


here. However, some people prefer to use 


Statistic + (critical value)(standard error of the statistic) 
statistic + (critical value) (standard error of the statistic) 


because we are usually estimating the standard deviation with data from a sample. 


A confidence interval for a population mean has a formula with the 
same structure. Using the sample mean x = as the point estimate for the 
population mean p and o/n?/V™ as the standard deviation of the sampling 
distribution of x % gives 

Co 


5+ 2” 
x +z*on Vv 


S 


mr 


This interval is called a one-sample z interval for a population mean. 
Unfortunately, if we don’t know the true value of p, we rarely know the 
true value of o either. We can use s, as an estimate for o, but things don’t 


work out as nicely as we might like. Let’s explore why this is true. 


pePle> 


ACTIVITY Confidence interval BINGO! ¢ 


In this activity, you will investigate the problem caused by replacing o 
with s, when calculating a confidence interval for p, and how to fix it. 
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photowind/Shutterstock 


A farmer wants to estimate the mean weight (in grams) of all 
tomatoes grown on her farm. To do so, she will select a random sample 
of 4 tomatoes, calculate the mean weight (in grams), and use the sample 
mean x = to create a 99% confidence interval for the population mean 
pt. Suppose that the weights of all tomatoes on the farm are 
approximately Normally distributed with a mean of 100 grams anda 
standard deviation of 20 grams. 


Let’s use an applet to simulate taking an SRS of n = 4 tomatoes and 
calculating a 99% confidence interval for p using three different 
methods. 


Method 1 (assuming o is known) 


Eto a 4257622. 
x +z*on=x +2.576204 vn V4 


1. Launch the Simulating Confidence Intervals for a Population 
Parameter applet at www.rossmanchance.com/applets. Choose 
“Means” from the drop-down menu and leave the other menus as 
“Normal” and “z with sigma.” Then enter p = 100, o = 20, n= 4, and 
intervals = 1. Set the confidence level = 99%, as shown in the screen 
shot. 
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Means 


Normal 


z with sigma 


Intervals 1 


Sample 


Conf level 99 


Recalculate 


2. Press “Sample.” The applet will select an SRS of n = 4 tomatoes and 
calculate a confidence interval for p. This interval will be displayed as 
a horizontal line. The vertical line identifies p = 100. If the interval 
captures p/ = 100, the interval will be green. If the interval misses p = 
100, the interval will be red. 


100 
Means v 
Normal v 
z with sigma v 
u | 100 
o 20 
n 4 
Intervals 1 
Sample 
Conf level 99 % 
Recalculate 


Intervals containing 
1/1 = 100.0% 
Running Total 
1/1 = 100.0% 


Sort 


Reset 


60 80 100 120 140 


3. Press “Sample” many times, shouting out “BINGO!” whenever you 
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get an interval that misses p = 100 (i.e., a red interval). Stop when 
your teacher calls time. 


4. How well did Method 1 work? Compare the running total in the lower 
left corner with the stated confidence level of 99%. 


Method 2 (using s, as an estimate for 0) 


E+ 2° l= F + 2.576— 
x +z*sxn=x +2.576sx4 = — 


1. Press the “Reset” button in the lower left. Then, in the third drop- 
down menu, choose “z with s.” This replaces o with s, in the formula 


for the confidence interval. Keep everything else the same. 
2. Press “Sample” many times, shouting out “BINGO!” whenever you 


get an interval that misses p = 100 (i.e., a red interval). Stop when 
your teacher calls time. 


3. Did Method 2 work as well as Method 1? Discuss with your 
classmates. 


4. Now change the number of intervals to 50 and press “Sample” 20 
times, for a total of more than 1000 intervals. Compare the running 
total in the lower left corner with the stated confidence level of 99%. 
What do you notice about the length of the intervals that missed? 

To increase the capture rate of the intervals to 99%, we need to 
make the intervals longer. We can do this by using a different critical 
value, called a t* critical value. You’ ll learn how to calculate this 
number soon. 


Method 3 (using s, as an estimate for o and a ¢* critical value instead of 


a z* critical value) 


x +t*sxn=x +???sx4 yn J/4 


1. Press the “Reset” button in the lower left. Then, in the third drop- 
down menu, choose “t” and change “Intervals” to 1. Keep everything 
else the same. 
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2. Press “Sample” many times, shouting out “BINGO!” whenever you 
get an interval that misses p = 100 (i.e., a red interval). Stop when 
your teacher calls time. 


3. Did Method 3 work better than Method 2? How does it compare to 
Method 1? Discuss with your classmates. 


4. Now change the number of intervals to 50 and press “Sample” 20 
times, for a total of at least 1000 intervals. Compare the running total 
in the lower left corner with the stated confidence level of 99%. What 
do you notice about the length of the intervals compared to Method 
Pale 


Figure 8.3 shows the results of repeatedly constructing confidence 
intervals using a z* critical value and the sample standard deviation s,, as 
described in Method 2 of the preceding activity. Of the 1000 intervals 
constructed, only 925 (that’s 92.5%) captured the population mean. That’s 
far below our desired 99% confidence level! 


Even though X wasn’t far 
from yw, this interval 

o 20 missed because the value 
nl4 of s, was much smaller 
than o. 


u 100 


Intervals 100 


Sample 


Conf level 99 


Recalculate 


Intervals containing p 
90/100 = 90.0% 
Running Total 
925/1000 = 92.5% 


Sort 
Reset 


FIGURE 8.3 Screen shot from the Simulating Confidence Intervals for a 
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Population Parameter applet at www.rossmanchance.com/applets, showing the 
capture rate of a “99%” confidence interval when using a z* critical value with the 
sample standard deviation s,. 


What went wrong? The intervals that missed (those in red) came from 
samples with small standard deviations s, and from samples in which the 


sample mean x © was far from the population mean p. In those cases, 
multiplying sx/48x/V4 by z* = 2.576 didn’t produce long enough intervals 
to reach p = 100. To achieve a 99% capture rate, we need to multiply by a 
larger critical value. But what critical value should we use? 


CALCULATING t* CRITICAL VALUES In the activity, you 
discovered that Method 3 produces confidence intervals that capture p at 
the advertised confidence level. That is, when constructing “99%” 


a « Sx 


= zit—, 
confidence intervals using the formula x +t*sxn, Vv" about 99% of 
the intervals capture the population mean p. This interval is called a one- 
sample t interval for a population mean. 


The critical value in this interval is denoted t* because it comes from a 
t distribution, not the standard Normal distribution. The critical value ¢t* 
has the same interpretation as z*: it measures how many standard 
deviations we need to extend from the point estimate to get the desired 
level of confidence. There is a different t distribution for each sample size. 
We specify a particular t distribution by giving its degrees of freedom (df). 
When we perform inference about a population mean p using at 
distribution, the appropriate degrees of freedom are found by subtracting 1 
from the sample size n, making df =n — 1. 


Figure 8.4 shows two different t distributions, along with the standard 
Normal distribution. Because the t distributions are wider than the standard 
Normal distribution, t* critical values will always be larger than z* critical 
values for a specified level of confidence. As you learned in the “Bingo” 
activity, we need to use a critical value larger than z* to compensate for 
the variability introduced by using the sample standard deviation s, as an 


estimate for the population standard deviation o. As the degrees of 
freedom increase, the t distributions approach the standard Normal 
distribution. This makes sense because the value of s, will typically be 
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closer to o as the sample size increases. 


t distributions have 
more area in the tails 
than the standard 

Normal distribution. 


—- (,2 degrees 
of freedom 
ww t,9 degrees 
of freedom 
— Standard 
Normal 


FIGURE 8.4 Density curves for the t distributions with 2 and 9 degrees of 
freedom and the standard Normal distribution. All are symmetric with center 0. 
The ¢ distributions have more variability than the standard Normal distribution, but 
approach the standard Normal distribution as df increases. 


You will learn more about t distributions in Chapter 9. For now, we 
will focus on how to calculate the critical value t* for various sample sizes 
and confidence levels. 


Table B in the back of the book gives t* critical values for the t 
distributions. Each row in the table contains critical values for the t 
distribution whose degrees of freedom (df) appear at the left of the row. To 
make the table easy to use, several of the more common confidence levels 
C are given at the bottom of the table. 


The t distribution and the t inference procedures were invented by William S. Gosset 


(1876-1937). Gosset worked for the Guinness brewery, and his goal in life was to 
make better beer. He used his new t procedures to find the best varieties of barley 
and hops. Gosset’s statistical work helped him become head brewer. Because Gosset 
published under the pen name “Student,” you will often see the t distribution called 


“Student’s t” in his honor. 


THE ONE-SAMPLE t INTERVAL FOR A POPULATION 
MEAN 
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When the conditions are met, a C% confidence interval for the 
unknown mean p is 


zit 
x +Ht*sxn Vv 


where ¢* is the critical value for the t distribution with n — 1 degrees of 
freedom (df) and C% of the area between —t* and ¢*. 


When you use Table B to determine the correct value of t* for a given 
confidence interval, all you need to know are the confidence level C and 
the degrees of freedom (df). In the activity, we calculated 99% confidence 
intervals with n= 4, sodf=4-1=3. 


Tail probability p 
df 02 01 005 0025 
1 15.89 31.82 63.66 127.3 
2 4.849 6.965 9.925 14.09 
3 3.482 4541 | 5841 7.453 
00 2.054 2.326 2.576 2.807 
96% 98% 99% 99.5% 


Confidence level C 


According to the Table B excerpt in the margin, for 99% confidence 
and 3 degrees of freedom, t* = 5.841. That is, the interval should extend 
5.841 standard deviations on both sides of the point estimate to have a 
capture rate of 99%. 

Unfortunately, ‘Table B doesn’t include every possible df. If the correct 
df isn’t listed, use the greatest df available that is less than the correct df. 
“Rounding up” to a larger df will result in confidence intervals that are too 
narrow. The intervals won’t be wide enough to include the true population 
value as often as suggested by the confidence level. 


EXAMPLE | Finding ¢* critical values (>) 
Using Table B 
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PROBLEM: What critical value t* from Table B should be used in 
constructing a confidence interval for the population mean in each of the 
following settings? 


a. A 95% confidence interval based on an SRS of size n = 12 


b. A 90% confidence interval from a random sample of 48 observations 


SOLUTION: 
at = 2201 
Tail probability p 
df 05 025 02 01 
10 1.812 2.228 ase 2.764 
11 1.796 Pea a Deve wat: 
12 1.782 2.179 2.303 Deel 
00 1.645 1.960 2.054 DIg0G 
90% 95% 96% 98% 


Confidence level C 


In Table B, we consult the row corresponding to df = 12 -1= 11. 


We move across that row to the entry that is directly above 95% 
confidence level on the bottom of the chart. 


b. t* = 1.684 
Tail probability p 

df 10 05 025 02 
30 1.310 1.697 2.042 2.147 
40 1.303 ices 2.021 ee 
50 1.299 1.676 2.009 2.109 
oo 1.282 1.645 1.960 2.054 

80% 90% 95% 96% 


Confidence level C 


| With 48 observations, we want to find the t* critical value for df = 48 | 
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— 1 = 47 and 90% confidence. There is no df = 47 row in Table B, so 
we use the more conservative df = 40. 


FOR PRACTICE, TRY EXERCISE 61 


The bottom row of Table B gives z* critical values. That’s because t 
distributions approach the standard Normal distribution as the degrees of 
freedom approach infinity. 


Because t* is larger than z* for a given sample size and confidence 
level, intervals calculated with ¢* will have a higher capture rate than 
intervals using z*. When the conditions are met, the actual percent of 
intervals that contain the population mean should be equal to the stated 
confidence level. Using t* rather than z* solves the problem created by 
using the sample standard deviation s, as an estimate for the population 


standard deviation o. 


Technology will quickly produce t* critical values for any sample size. 


18. Technology Corner |USING INVERSE t >» 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Most newer TI-84 calculators allow you to find critical values t* using 
the inverse t command. Let’s use the inverse t command to find the 
critical values in parts (a) and (b) of the example. 


e Press (DISTR) and choose invT(. 


e For part (a), we need to find the critical value for 95% confidence, so 
we want an area of 0.025 in each tail. 


OS 2.55 or later: In the dialog box, enter these values: area: 0.025, 
df: 11, choose Paste, and then press ENTER. Note: For the inverse t 
command, the area always refers to area to the left. 


Older Os: Complete the command invT(0.025,11) and press 
[ENTER| 


e For part (b), we need 90% confidence, so we want an area of 0.05 in 
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each tail. Use the command invT(0.05,47). 


AL FLOR D REAL DIAN MP 


invT(@.@25,11) 
-2. 200985143 


invT(@.05,.47) 
-1.677926664 


Note that the t critical values are t* = 2.201 and t* = 1.678, 
respectively. The critical value for part (b) is slightly smaller here, 
because we were able to use df = 47 rather than rounding down to df = 


40. 


Now that you’ve learned how to calculate a t* critical value, it’s time 
to make a simple observation. Inference for proportions uses z; inference 
for means uses t. That’s one reason why distinguishing categorical from 


quantitative variables is so important. 


CHECK YOUR UNDERSTANDING 


(\ 


\" 


y Use Table B to find the critical value t* that you would use for a 
confidence interval for a population mean p in each of the following 
settings. If possible, check your answer with technology. 


1. A 96% confidence interval based on a random sample of 22 
observations 
2. A 99% confidence interval from an SRS of 71 observations 


Conditions for Estimating py 


As with proportions, you must check some important conditions before 
constructing a confidence interval for a population mean. The first two 
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conditions should be familiar by now. The Random condition is crucial for 
doing inference. If the data don’t come from a random sample, you can’t 
draw conclusions about a larger population. When sampling without 
replacement, the 10% condition ensures that our formula for the standard 
deviation is approximately correct. 


The third condition is different, however. When calculating a 
confidence interval for a population proportion, we check the Large 
Counts condition to ensure it is appropriate to use the standard Normal 
distribution to calculate the z* critical value. When calculating a 
confidence interval for a population mean, we check the Normal/Large 
Sample condition to ensure it is appropriate to use a t distribution to 
calculate the ¢* critical value. 


For the t* critical value to produce confidence intervals with a capture 
rate that is equal to the confidence level, the population distribution must 
be Normal. If we are told that the population is Normally distributed, then 
the condition has been met. Unfortunately, it is rare that we actually know 
a population distribution is Normal. When the population shape is 
unknown, there are two ways to satisfy the Normal/Large Sample 
condition. 


e If the sample size is small (n < 30), graph the sample data and ask, “Is it 
plausible that these data came from a Normal population?” If there is no 
strong skewness or outliers in the data, then the answer is “Yes.” 
Remember to include the graph of sample data when checking the 
Normal/Large Sample condition in this way. 


e If the sample size is large (n => 30), using a ¢t* critical value will produce 
confidence intervals with a capture rate that is approximately equal to 
the confidence level, even when the population distribution is not 
Normal. 

Here is an image from the Simulating Confidence Intervals for a 
Population Parameter applet at www.rossmanchance.com/applets, 
showing the capture rate of a 95% one-sample t interval when selecting 
1000 samples of size n = 50 from a population that is strongly skewed to 
the right (an exponential distribution). 

The capture rate isn’t quite 95%, but it’s fairly close. The agreement 
between the stated confidence level and actual capture rate will improve 
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by increasing the sample size or by sampling from a population that is 
closer to Normal. 


Means 
Exponential 
t 
un 0.5 
n 50 
Intervals 100 


Sample 


Conf level 95 
Recalculate 
Intervals containing py 
96/100 = 96.0% 


Running Total 
943/1000 = 94.3% 


Sort 


Reset 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE 
INTERVAL ABOUT A MEAN 


e Random: The data come from a random sample from the population 
of interest. 


© 10%: When sampling without replacement, n < 0.10N. 


Normal/Large Sample: The population has a Normal distribution or 
the sample size is large (n > 30). If the population distribution has 
unknown shape and n < 30, use a graph of the sample data to assess 
the Normality of the population. Do not use t procedures if the graph 
shows strong skewness or outliers. 


Here are some examples of checking the Normal/Large Sample 
condition. 
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EXAMPLE |GPAs, wood, and SATs (>) 
Checking the Normal/Large Sample condition 


Ken Cavanagh/Alamy 


PROBLEM: Determine if the Normal/Large Sample condition is met in 
each of the following settings. 


a. To estimate the average GPA of students at your school, you 
randomly select 50 students. Here is a histogram of their GPAs: 


10) 


Frequency 


GPA 


b. How much force does it take to pull wood apart? The stemplot shows 
the force (in pounds) required to pull a piece of Douglas fir apart for 
each of 20 randomly selected pieces. 


c. Suppose you want to estimate the mean SAT Math score at a large 
high school. Here is a boxplot of the SAT Math scores for a random 
sample of 20 students at the school: 
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Key: 31/3 = 313 pounds of 
force required to pulla 
piece of Douglas fir apart 


WWWW PPMP PN PO 
WN OOONAUEW 


259 
99 
033677 
0236 
— 
400 500 600 700 S00 
SAT Math score 


SOLUTION: 


a. Yes; the sample size is large (50>30).(29 = 30). 


In addition, it is also plausible that the sample came from a Normal 


population because the histogram doesn’t have strong skewness or 
outliers. 


b. No; the stemplot is strongly skewed to the left with possible low 
outliers and n=20<30.n = 20 < 30. 


It is not plausible that this sample came from a Normal population 
because of the strong skewness and outliers in the stemplot. 


c. Yes; even though n=20<30,” = 29 < 30, the boxplot is only 
moderately skewed to the right and there are no outliers. 


It is plausible that this sample came from a Normal population 


because the boxplot doesn’t show strong skewness or outliers. 


FOR PRACTICE, TRY EXERCISE 63 


In the preceding example, we used a histogram, a stemplot, and a 
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boxplot to address the Normal/Large Sample condition. You can also use 
dotplots and Normal probability plots to assess Normality. Each of these 
graphs has strengths and weaknesses, so we recommend following the 
advice of your teacher when choosing a graph to use. 


In some cases, it is challenging to determine if the skewness in a graph 
should be considered “strong.” One way to judge skewness is to compare 
the distance from the maximum to the median and from the median to the 
minimum. Let’s take a closer look at the stemplot from part (b) and the 
boxplot from part (c) in the preceding example. 


e In the stemplot that records the force required for pulling a piece of 
wood apart, maximum — median = 336 — 319.5 = 16.5 and median — 
minimum = 319.5 — 230 = 89.5. The half of the stemplot with smaller 
values is more than 5 times as long as the half of the stemplot with 
larger values. 


e Inthe SAT Math example, maximum — median * 775 — 525 = 250 and 
median — minimum * 525 — 375 = 150. The right half of the boxplot 
(everything greater than the median, including the whisker) is less than 
twice as long as the left half. 


Key: 31|3 = 313 pounds of 
force required to pulla 
piece of Douglas fir apart 
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The stemplot is quite a bit more skewed than the boxplot. 
Unfortunately, there is no accepted rule of thumb for identifying strong 
skewness. For that reason, we have chosen the data sets in examples and 
exercises to avoid borderline cases. 


| AP® EXAM TIP | 
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If a question on the AP® Statistics exam asks you to construct and 
interpret a confidence interval, all the conditions should be met. 
However, you are still required to state the conditions and show 
evidence that they are met—including a graph if the sample size is 
small and the data are provided. 


Constructing a Confidence Interval for p 


When the conditions are met, a C% confidence interval for the unknown 
mean p! is 
8 
et+¢°— 


x +t*sxn vn 


Because we don’t know the value of the population standard deviation o, 


Sz 


we replace it with the sample standard deviation s,. The value sxnv” is 


called the standard error of the sample meanx =, or just the standard error 
of the mean: 


SBe =—2. 
SEx =sxn vn 


Like the standard deviation of x £, the standard error of x % describes 
how much the sample mean x = typically varies from the population mean 
pi in repeated SRSs of size n. 


As with proportions, some books refer to the standard deviation of the sampling 


distribution of x 2 as the standard error and what we call the standard error of the 
mean as the estimated standard error. The standard error of the mean is often 
abbreviated SEM. 


The following example illustrates how to construct and interpret a 
confidence interval for a population mean. By now, you should recognize 
the four-step process. 
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EXAMPLE |More books (>) fi. 
4 STEP 
A one-sample t interval for jp 


Niloo/Shutterstock.com 


PROBLEM: In the previous section, you learned that about 73% of 
American adults claim to have read a book in the previous 12 months. 
This was based on a 2016 Pew Research Center study that interviewed a 
random sample of 1520 American adults. The same study also reported 
that the average number of books read by the members of the sample 
(including those who reported reading 0 books) was x =12@ = 12 
books with a standard deviation of s, = 18 books.?2 


a. Construct and interpret a 95% confidence interval for the mean 
number of books read by all Americans in the previous 12 months. 


b. In 2011, Pew reported that American adults read an average of 14 
books in the previous 12 months. Does your interval from part (a) 
provide convincing evidence that the 2016 mean is different than 14 
books? Explain your answer. 


SOLUTION: 


a. STATE: 95% CI for p: = the true mean number of books read by all 
American adults in the previous 12 months. 
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Make sure to do the four-step process! 


PLAN: One-sample t interval for p 
e Random: Random sample of 1520 American adults / 
° 10%: 1520 is less than 10% of all American adults. / 


e Normal/Large Sample: The sample size is large (n=1520230). 
(n = 1520 > 30). y 

DO: Using Table B: df = 1000 and t* = 1.962 

Using technology: df = 1519 and t* = 1.962 


12+1.962181520=12+0.91=(11.09, 12.91) 
12 + 1.962 = = 12+0.91 = (11.09, 12.91) 


4/1520 


x +t*Sxn 


CONCLUDE: We are 95% confident that the interval from 11.09 books 
to 12.91 books captures p = the true mean number of books read by all 
American adults in the previous 12 months. 


b. Yes; because 14 is not in the confidence interval, 14 is not a 
plausible value for the mean number of books read by all American 
adults in the previous 12 months. 


Make sure your conclusion is in context, includes units (books), and is 


about a population mean. 


FOR PRACTICE, TRY EXERCISE 69 


In this example, the sample standard deviation (s, = 18) is greater than 


the sample mean (x =12% = 12). If the population were Normally 
distributed, we would expect the minimum to be about 2 or 3 standard 
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deviations below the mean. However, because the minimum of 0 books is 
not even 1 standard deviation below the mean, we know the population 
distribution is skewed to the right. Furthermore, Pew reported that the 
median number of books read by the members of the sample was 4 books. 
This is quite a bit less than the mean, again suggesting that the population 
distribution is skewed to the right. Of course, because of the large sample 
size, we don’t need to worry about the shape of the population distribution 
when checking the Normal/Large Sample condition. 


Here is another example, this time with the actual data values and a 
smaller sample size. 


fi 


EXAMPLE |Video screen tension » BE 


4 STEP 
Constructing a confidence interval for p 


Pgiam/Getty Images 


PROBLEM: A manufacturer of high-resolution video terminals must 
control the tension on the mesh of fine wires that lies behind the surface 
of the viewing screen. Too much tension will tear the mesh, and too 
little will allow wrinkles. The tension is measured by an electrical 
device with output readings in millivolts (mV). Some variation is 
inherent in the production process. Here are the tension readings from a 
random sample of 20 screens from a single day’s production: 


269.5 297.0 269.6 283.3 304.8 280.4 233.5 257.4 317.5 327.4 
264.7 307.7 310.0 343.3 328.1 342.6 338.8 340.1 374.6 336.1 


Construct and interpret a 90% confidence interval for the mean tension 
of all the screens produced on this day. 
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SOLUTION: 


STATE: 90% CI for p: = the true mean tension of all the video terminals 
produced this day. 


Remember to do the four-step process! 


PLAN: One-sample t interval for p 
e Random: Random sample of 20 screens produced that day 


o 10%: Assume that 20 is less than 10% of all video terminals 
produced that day. “ 


e Normal/Large Sample: The dotplot does not show strong skewness or 
outliers. / 


Screen tension (mY) 


Because there is no strong skewness or outliers in the sample, it is 


plausible that the population distribution of screen tension is Normal. 


DO: x =306.32 mV and sx=36.21 mV= = 306.32 mV ands, = 36.21mV 


With df=19, t*=1.729306.32+1.72936.2120=306.32+14.00=(292.32, 
With df = 19, ¢* = 1.729 


306.32 + 1.729 
320.32) = 306.32 + 14.00 = (292.32, 320.32) 


When raw data are provided, use your calculator to find the mean and 
standard deviation of the sample. 


x +t*Sxn 
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CONCLUDE: We are 90% confident that the interval from 292.32 mV 
to 320.32 mV captures p = the true mean tension in the entire batch of 
video terminals produced that day. 


Make sure your conclusion is in context, includes units (mV), and is 


about a population mean. 


FOR PRACTICE, TRY EXERCISE 73 


AP® EXAM TIP 


It is not enough just to make a graph of the data on your calculator 
when assessing Normality. You must sketch the graph on your paper to 
receive credit. You don’t have to draw multiple graphs—any 
appropriate graph will do. 


As you probably guessed, your calculator will compute a one-sample t 
interval for a population mean from sample data or summary statistics. 


19. Technology Corner |CONSTRUCTING A 
CONFIDENCE INTERVAL FOR A POPULATION MEAN 


Ne 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


Confidence intervals for a population mean using t distributions can be 
constructed on the TI-83/84, avoiding the use of Table B. Here is a brief 
summary of the techniques when you have the actual data values and 
when you have only numerical summaries: 


1. Using summary statistics (see the “More books” example, page 535) 
e Press [STAT] arrow over to TESTS, and choose Tinterval.... 


e On the TInterval screen, adjust your settings as shown and choose 
Calculate. 
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NORMAL FLOAT AUTO REAL RADIAN HP fl NORMAL FLOAT AUTO REAL RADIAN MP tl 


TIinterval TInterval 
Inet: Data (11.094,12. 986) 
x:12 


x=12 
Sx:18 Sx=18 
n:1520 n=1528 
C-Level:8.95 
Calculate 


2. Using raw data (see the “Video screen tension” example, page 536) 


e Enter the 20 video screen tension readings in list L1. Proceed to 
the TInterval screen as in Step 1, but choose Data as the input 
method. Then adjust your settings as shown and calculate the 
interval. 

NORHAL FLOAT AUTO REAL RADIAN HP HH NORMAL FLOAT AUTO REAL RADIAN HP fl 


Tinterval 
Inet: BE) Stats (292.32,320.32) 
List:Li xX=306.32 
Frea:i Sx=36. 20928349 


C-Level:0.9 n=20 


Calculate 


CHECK YOUR UNDERSTANDING 


Biologists studying the healing of skin wounds measured the rate at 
which new cells closed a cut made in the skin of an anesthetized newt. 


Here are data from a random sample of 18 newts, measured in 


micrometers (millionths of a meter) per hour:24 


29 27 34 40 22 28 14 35 26 35 12 30 23 18 11 22 23 33 


Calculate and interpret a 95% confidence interval for the mean healing 
rate [. 
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Choosing the Sample Size 


A wise user of statistics never plans data collection without thinking about 
the analysis at the same time. You can arrange to have both high 
confidence and a small margin of error by taking enough observations. 
When the population standard deviation o is unknown and conditions are 
met, the C% confidence interval for is 


where ¢* is the critical value for confidence level C and degrees of 
freedom df = n- 1. The margin of error (ME) of the confidence interval is 


oe i 


To determine the sample size for a desired margin of error, it makes 
sense to set the expression for ME less than or equal to the specified value 
and solve the inequality for n. There are two problems with this approach: 


.. We don’t know the sample standard deviation s, because we haven’t 


produced the data yet. 


’, The critical value t* depends on the sample size n that we choose. 


The second problem is more serious. To get the correct value of t*, we 
need to know the sample size. But that’s what we’re trying to find! There 
is no easy solution to this problem. 


We recommend that you come up with a reasonable estimate for the 
population standard deviation o from a similar study that was done in the 
past or from a small-scale pilot study. By pretending that o is known, we 
can use the one-sample z interval for p: 

Co 


@+2*— 


x +z*on di 


nN 


Using the appropriate standard Normal critical value z* for confidence 
level C, we can solve for n using 


2*—_< ME 


z*on<ME Vv" 
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Here is a summary of this strategy. 


SAMPLE SIZE FOR A DESIRED MARGIN OF ERROR 
WHEN ESTIMATING p 


To determine the sample size n that will yield a C% confidence interval 
for a population mean with a specified margin of error ME: 


e Get a reasonable value for the population standard deviation o from 
an earlier or pilot study. 


Find the critical value z* from a standard Normal curve for 
confidence level C. 


Set the expression for the margin of error to be less than or equal to 
ME and solve for n: 


z*-_< ME 
z*onsME Vv" 


The procedure is best illustrated with an example. 


EXAMPLE | How many monkeys? (>) 


Determining sample size from margin of error 


Richard Nowitz/Medical Images USA 


PROBLEM: Researchers would like to estimate the mean cholesterol 
level y of a particular variety of monkey that is often used in laboratory 
experiments. They would like their estimate to be within 1 milligram per 
deciliter (mg/dl) of the true value of p at a 95% confidence level. A 
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previous study involving this variety of monkey suggests that the 
standard deviation of cholesterol level is about 5 mg/dl. What is the 
minimum number of monkeys the researchers will need to test to get a 
satisfactory estimate? 


SOLUTION: 
1.96 <1 


1.96-5<1,/n 
(1.96-5)? <n 
1.965n<1 96-04 < n 


For 95% confidence, z* = 1.96. Use o = 5 as our best guess for the 


standard deviation. Set the expression for the margin of error to be at 


most 1 and solve for n. 


1.96 <1 

1.96-5<1,/n 

(1.96.5)? <n 
1.96: 5<1n 96.04 < n 


Multiply both sides by nv. 


1.96— <1 
1.96-5<1/n 
(1.96 -5)? <n 


(1.96: 5)2<n 96.04 < n 


Square both sides. 
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1.96—- <4 
1.96-5<1/n 
(1.96-5)? <n 


96.04<n 96.04 = n 


Remember that your answer should be an integer and it must follow the 


inequality sign. 


The researchers would need at least 97 monkeys to estimate the 
cholesterol levels to their satisfaction. 


FOR PRACTICE, TRY EXERCISE 77 


Taking observations costs time and money. The required sample size 
may be impossibly expensive. Notice that it is the size of the sample that 
determines the margin of error. The size of the population does not 
influence the sample size we need. This is true as long as the population is 
much larger than the sample. 

o@ Although we assumed a value for the population standard 


deviation o to calculate the required sample size, you should not 
assume a value for o when calculating a confidence interval for jp. 
When calculating a confidence interval for p/, the sample size is known so 
there is no need to use an assumed value of o. 

There are other methods of determining sample size that do not require 
us to use a known value of the population standard deviation o. These 
methods are beyond the scope of this text. Our advice: consult with a 
Statistician when planning your study! 


CHECK YOUR UNDERSTANDING 


(\ 


\ 


y Administrators at your school want to estimate how much time 
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students spend on homework, on average, during a typical week. They 
want to estimate p at the 90% confidence level with a margin of error of 
at most 30 minutes. A pilot study indicated that the standard deviation 
of time spent on homework per week is about 154 minutes. How many 
students need to be surveyed to meet the administrators’ goal? 


Section 8.3 Summary 


Confidence intervals for the mean p of a population are based on the 
sample mean x ©. If we know o (which is very rare), we use a z* 
critical value and the standard Normal distribution to calculate a one- 
sample z interval for p. 


In practice, we usually don’t know o. Replacing the standard deviation 
of the sampling distribution of x (ox =o/n)” (oz =0//n) by the 
standard error of x (SEx =sx/n)” (SEZ = sz//n) requires use of the 
t distribution with n — 1 degrees of freedom (df) rather than the standard 
Normal distribution when calculating a confidence interval for a 
population mean. 


The conditions for constructing a confidence interval for a population 
mean are 


= Random: The data come from a random sample from the population 
of interest. 


© 10%: When sampling without replacement, n < 0.10N. 


« Normal/Large Sample: The population has a Normal distribution or 
the sample size is large (n > 30). If the population distribution has 
unknown shape and n < 30, use a graph of the sample data to assess 
the Normality of the population. Do not use t procedures if the graph 
shows strong skewness or outliers. 


When conditions are met, a C% confidence interval for the mean 1 is 
given by the one-sample t interval: 
+ 52 


f 


Faxt 
x +t*sxn 
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The critical value t* is chosen so that the t curve with n — 1 degrees of 
freedom has C% of the area between —t* and t*. Use Table B or 
technology to calculate ¢*. 


e Follow the four-step process—State, Plan, Do, Conclude— 
~~ whenever you are asked to construct and interpret a confidence 
interval for a population mean. Remember: inference for 
proportions uses z; inference for means uses t. 


e The sample size needed to obtain a confidence interval with 
approximate margin of error ME for a population mean involves solving 
Co 


7 _< ME 
z*on<ME Vv" 


for n, where the standard deviation o is a reasonable value from a 
previous or pilot study, and z* is the critical value for the level of 
confidence we want. 


8.3 Technology Corners | » 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
18. Using inverse t Page 531 


19. Constructing a confidence interval for a population mean 


Page 537 


Section 8.3 Exercises 


61. pg 530 |») Critical values What critical value t* from Table B 


should be used for a confidence interval for the population mean in 
each of the following situations? 


a. A 95% confidence interval based on n = 10 randomly selected 
observations 


b. A 99% confidence interval from an SRS of 20 observations 
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c. A 90% confidence interval based on a random sample of 77 
individuals 


62. Critical values What critical value t* from Table B should be used 


63. 


64. 


for a confidence interval for the population mean in each of the 
following situations? 


a. A 90% confidence interval based on n = 12 randomly selected 
observations 


b. A 95% confidence interval from an SRS of 30 observations 


c. A 99% confidence interval based on a random sample of size 58 


pg 533 |p) Weeds among the corn Velvetleaf is a particularly 


annoying weed in cornfields. It produces lots of seeds, and the seeds 
wait in the soil for years until conditions are right for sprouting. How 
many seeds do velvetleaf plants produce? The histogram shows the 
counts from a random sample of 28 plants that came up in a cornfield 
when no herbicide was used.22 Determine if the Normal/Large 
Sample condition is met in this context. 


8 


6 


Frequency 


i = -# 


0 I 2 3 q 5 6 7 8 
Number of seeds (thousands) 
Judy is interested in the reading level of a medical journal. She 
records the length of a random sample of 100 words. The histogram 
displays the distribution of word length for her sample. Determine if 
the Normal/Large Sample condition is met in this context. 
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Frequency 


Word length 


65. Check them all Determine if the conditions are met for constructing 
a confidence interval for the population mean in each of the following 
settings. 


a. How much time do students at your school spend on the Internet? 
You collect data from the 32 members of your AP® Statistics class 
and calculate the mean amount of time that these students spent on 
the Internet yesterday. 


b. Is the real-estate market heating up? To estimate the mean sales 
price, a realtor in a large city randomly selected 100 home sales 
from the previous 6 months in her city. These sales prices are 
displayed in the boxplot. 


! | ] T | 
0) 200 400) 600 S800 LOOO 


Sales price ($1000s) 
66. Check them all Determine if the conditions are met for constructing 
a confidence interval for the population mean in each of the following 
Settings. 


a. We want to estimate the average age at which U.S. presidents have 
died. So we obtain a list of all U.S. presidents who have died and 
their ages at death. 


b. Do teens text more than they call? To find out, an AP® Statistics 
class at a large high school collected data on the number of text 
messages and calls sent or received by each of 25 randomly 
selected students. The boxplot displays the difference (Texts — 
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67. 


68. 


69. 


70. 


Calls) for each student. 


a 


T T T T T T T | 
2() 0 20 40) 60 SO 100 120 


Difference (Texts — Calls) 
Blood pressure A medical study finds that x =114.9% = 114.9 and s, 
= 9.3 for the seated systolic blood pressure of the 27 randomly 


selected adults. What is the standard error of the mean? Interpret this 
value in context. 


Travel time to work A study of commuting times reports the travel 
times to work of a random sample of 20 employed adults in New 
York State. The mean is x _=31.25% = 31.25 minutes and the standard 
deviation is s, = 21.88 minutes. What is the standard error of the 


mean? Interpret this value in context. 


pg 535 | ®) Bone loss by nursing mothers Breast-feeding mothers 


secrete calcium into their milk. Some of the calcium may come from 
their bones, so mothers may lose bone mineral. Researchers measured 
the percent change in bone mineral content (BMC) of the spines of 47 
randomly selected mothers during three months of breast-feeding.2© 
The mean change in BMC was —3.587% and the standard deviation 
was 2.506%. 


a. Construct and interpret a 99% confidence interval to estimate the 
mean percent change in BMC in the population of breast-feeding 
mothers. 


b. Based on your interval from part (a), do these data give convincing 


evidence that nursing mothers lose bone mineral, on average? 
Explain your answer. 


Reading scores in Atlanta The Trial Urban District Assessment 
(TUDA) is a government-sponsored study of student achievement in 
large urban school districts. TUDA gives a reading test scored from 0 
to 500. A score of 243 is a “basic” reading level and a score of 281 is 
“proficient.” Scores for a random sample of 1470 eighth-graders in 
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71. 


72s 


7a 


74. 


Atlanta had a mean of 240 with standard deviation of 42.17.22 


a. Construct and interpret a 99% confidence interval for the mean 
reading test score of all Atlanta eighth-graders. 


b. Based on your interval from part (a), is there convincing evidence 


that the mean reading test score for all Atlanta eighth-graders is 
less than the basic level? Explain your answer. 


America’s favorite cookie Ann and Tori wanted to estimate the 
average weight of an Oreo cookie to determine if it was less than 
advertised (34 grams for 3 cookies). They selected a random sample 
of 36 cookies and found the weight of each cookie (in grams). The 
mean weight was x =11.3921% = 11.3921 grams with a standard 
deviation of s, = 0.0817 grams.78 Construct and interpret a 90% 


confidence interval for the true mean weight of an Oreo cookie. 


Fruit fly thorax lengths Fruit flies are used frequently in genetic 
research because of their quick reproductive cycle. The length of the 
thorax (in millimeters) was measured for each fly in a random sample 
of 49 male fruit flies. The mean length was x =0.8004 mm , 

< = 0.8004 mm , with a standard deviation of s, = 0.0782 mm.22 
Construct and interpret a 90% confidence interval for the true mean 
thorax length of a male fruit fly. 


pg 536 |) Pepperoni pizza Melissa and Madeline love pepperoni 


pizza, but sometimes they are disappointed with the small number of 
pepperonis on their pizza. To investigate, they went to their favorite 
pizza restaurant at 10 random times during the week and ordered a 
large pepperoni pizza.2¥ Here are the number of pepperonis on each 
pizza: 


47 36 25 a7 46 36 49 oe 32 34 


Construct and interpret a 95% confidence interval for the true mean 
number of pepperonis on a large pizza at this restaurant. 


Catching goldfish for school Carly and Maysem plan to be 
preschool teachers after they graduate from college. To prepare for 
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75. 


76. 


77. 


78. 


snack time, they want to know the mean number of goldfish crackers 
in a bag of original-flavored goldfish. To estimate this value, they 
randomly selected 12 bags of original-flavored goldfish and counted 
the number of crackers in each bag.24 Here are their data: 


317) 330) «3250-3323 332) 337) 324) 342) «330 «6349 335333 


Construct and interpret a 95% confidence interval for the true mean 
number of crackers in a bag of original- flavored goldfish. 


A plethora of pepperoni? Refer to Exercise 73. 


a. Explain why it was necessary to inspect a graph of the sample data 
when checking the Normal/Large Sample condition. 


b. According to the manager of the restaurant, there should be an 


average of 40 pepperonis on a large pizza. Based on the interval, is 
there convincing evidence that the average number of pepperonis 
is less than 40? Explain your answer. 


A school of fish Refer to Exercise 74. 


a. Explain why it was necessary to inspect a graph of the sample data 
when checking the Normal/Large Sample condition. 


b. According to the packaging, there are supposed to be 330 goldfish 


in each bag of crackers. Based on the interval, is there convincing 
evidence that the average number of goldfish is less than 330? 
Explain your answer. 


pg 539 |») Estimating BMI The body mass index (BMI) of all 


American young women is believed to follow a Normal distribution 
with a standard deviation of about 7.5. How large a sample would be 
needed to estimate the mean BMI p in this population to within +1 
with 99% confidence? 


The SAT again High school students who take the SAT Math exam a 
second time generally score higher than on their first try. Past data 
suggest that the score increase has a standard deviation of about 50 
points. How large a sample of high school students would be needed 
to estimate the mean change in SAT score to within 2 points with 
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95% confidence? 


79. Willows in Yellowstone Writers in some fields summarize data by 
giving x © and its standard error rather than x % and s,. Biologists 
studying willow plants in Yellowstone National Park reported their 
results in a table with columns labeled x +SEZ + SE The table entry 
for the heights of willow plants (in centimeters) in one region of the 
park was 61.55 + 19.03.22 The researchers measured a total of 23 
plants. 


a. Find the sample standard deviation s, for these measurements. 


b. A hasty reader believes that the interval given in the table is a 95% 
confidence interval for the mean height of willow plants in this 
region of the park. Find the actual confidence level for the given 
interval. 


80. Blink When two lights close together blink alternately, we “see” one 
light moving back and forth if the time between blinks is short. What 
is the longest interval of time between blinks that preserves the 
illusion of motion? Ask subjects to turn a knob that slows the blinking 
until they “see” two lights rather than one light moving. A report 
gives the results in the form “mean plus or minus the standard error of 
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the mean.”22 Data for 12 subjects are summarized as 251 + 45 (in 


milliseconds). 


a. Find the sample standard deviation s,, for these measurements. 


b. A hasty reader believes that the interval given in the report is a 
95% confidence interval for the population mean. Find the actual 
confidence level for the given interval. 


Multiple Choice Select the best answer for Exercises 81—84. 


81. One reason for using a t distribution instead of the standard Normal 
distribution to find critical values when calculating a level C 
confidence interval for a population mean is that 


a. z can be used only for large samples. 
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b. z requires that you know the population standard deviation o. 


c. z requires that you can regard your data as an SRS from the 
population. 


d. z requires that the sample size is less than 10% of the population 
size. 


e. az critical value will lead to a wider interval than a t critical value. 


82. You have an SRS of 23 observations from a large population. The 


83. 


84. 


distribution of sample values is roughly symmetric with no outliers. 
What critical value would you use to obtain a 98% confidence 
interval for the mean of the population? 


a. 2,177 
b. 2.183 
C: 2.326 
d. 2.500 
e. 2.508 


A quality control inspector will measure the salt content (in 
milligrams) in a random sample of bags of potato chips from an hour 
of production. Which of the following would result in the smallest 
margin of error in estimating the mean salt content p? 


a. 90% confidence; n = 25 
b. 90% confidence; n = 50 
c. 95% confidence; n = 25 
d. 95% confidence; n = 50 


e. n= 100 at any confidence level 


Scientists collect data on the blood cholesterol levels (milligrams per 
deciliter of blood) of a random sample of 24 laboratory rats. A 95% 
confidence interval for the mean blood cholesterol level p is 80.2 to 
89.8. Which of the following would cause the most worry about the 
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validity of this interval? 

a. There is a clear outlier in the data. 

b. A stemplot of the data shows a mild right skew. 

c. You do not know the population standard deviation o. 
d. The population distribution is not exactly Normal. 


e. None of these are a problem when using a ¢ interval. 
Recycle and Review 


85. Watching TV (6.1, 7.3) Choose a young person (aged 19 to 25) at 
random and ask, “In the past seven days, how many days did you 
watch television?” Call the response X for short. Here is the 


probability distribution for x:34 


Days 0 1 a 2 4 5 6 7 
Probability 0.04 0.03 0.06 0.08 0.09 0.08 0.05 27? 


a. What is the probability that X = 7? Justify your answer. 


b. The mean of the random variable X is jy = 5.44 and the standard 


deviation is ox = 2.14. Interpret these values. 


c. Suppose that you asked 100 randomly selected young people (aged 
19 to 25) to respond to the question and found that the mean x # 
of their responses was 4.96. Would this result surprise you? Justify 
your answer. 


86. Price cuts (4.2) Stores advertise price reductions to attract customers. 
What type of price cut is most attractive? Experiments with more than 
one factor allow insight into interactions between the factors. A study 
of the attractiveness of advertised price discounts had two factors: 
percent of all foods on sale (25%, 50%, 75%, or 100%) and whether 
the discount was stated precisely (e.g., as in “60% off”) or as a range 
(as in “40% to 70% off”). Subjects rated the attractiveness of the sale 
on a scale of 1 to 7. 
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. List the treatments for this experiment, assuming researchers will 
use all combinations of the two factors. 


. Describe how you would randomly assign 200 volunteer subjects 
to treatments. 


. Explain the purpose of the random assignment in part (b). 


. The figure shows the mean ratings for the eight treatments formed 
from the two factors.22 Based on these results, write a careful 
description of how percent on sale and precise discount versus 
range of discounts influence the attractiveness of a sale. 
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_Mé _. FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


Members at a popular fitness club currently pay a $40 per month 
membership fee. The owner of the club wants to raise the fee to $50 
but is concerned that some members will leave the gym if the fee 
increases. To investigate, the owner plans to survey a random sample 
of the club members and construct a 95% confidence interval for the 
proportion of all members who would quit if the fee was raised to 
$50. 


a. Explain the meaning of “95% confidence” in the context of 
the study. 


b. After the owner conducted the survey, he calculated the 
confidence interval to be 0.18 + 0.075. Interpret this interval 
in the context of the study. 


c. According to the club’s accountant, the fee increase will be 
worthwhile if fewer than 20% of the members quit. According 
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to the interval from part (b), can the owner be confident that 
the fee increase will be worthwhile? Explain. 


d. One of the conditions for calculating the confidence interval 
in part (b) is that npA>10" = 10 and n(1-p)>10. 
n(1—p) > 10. Explain why it is necessary to check this 
condition. 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 


“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score. 
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Chapter 8 Review 


Section 8.1: Confidence Intervals: The Basics 


In this section, you learned that a point estimate is the single best guess for 
the value of a population parameter. You also learned that a confidence 
interval provides an interval of plausible values for a parameter. To 
interpret a confidence interval, say, “We are C% confident that the interval 
from ___ to___ captures the [parameter in context],” where C is the 
confidence level of the interval. 


The confidence level C describes the percentage of confidence 
intervals that we expect to capture the value of the parameter in repeated 
sampling. To interpret a C% confidence level, say, “If we took many 
samples and used them to construct C% confidence intervals, about C% of 
those intervals would capture the [parameter in context].” 


Confidence intervals are formed by including a margin of error on 
either side of the point estimate. The size of the margin of error is 
determined by several factors, including the confidence level C and the 
sample size n. Increasing the sample size n makes the standard deviation 
of our estimate smaller, decreasing the margin of error. Increasing the 
confidence level C makes the margin of error larger, to ensure that the 
capture rate of the interval increases to C%. Remember that the margin of 
error only accounts for sampling variability—it does not account for any 
bias in the data collection process. 


Section 8.2: Estimating a Population Proportion 


In this section, you learned how to construct and interpret confidence 
intervals for a population proportion. Several important conditions must be 
met for this type of confidence interval to be valid. First, the data used to 
calculate the interval must come from a random sample from the 
population of interest (the Random condition). When the sample is taken 
without replacement from the population, the sample size should be less 
than 10% of the population size (the 10% condition). Finally, the observed 
number of successes np’? and observed number of failures n(1—p’) 
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n(1—P) must both be at least 10 (the Large Counts condition). 
The formula for calculating a confidence interval for a population 
proportion is 
.», [BU-B) 


priz*pA(i-pryn Von 


where p/P is the sample proportion, z* is the critical value, and n is the 
sample size. The value of z* is based on the confidence level C. To find z*, 
use Table A or technology to determine the values of z* and —z* that 
capture the middle C% of the standard Normal distribution. 


The four-step process (State, Plan, Do, Conclude) is perfectly suited 
for problems that ask you to construct and interpret a confidence interval. 
You should state the parameter you are estimating and the confidence 
level, plan your work by naming the type of interval you will use and 
checking the appropriate conditions, do the calculations, and make a 
conclusion in the context of the problem. You can use technology for the 
Do step, but make sure that you identify the procedure you are using and 
type in the values correctly. 


Finally, an important part of planning a study is determining the size of 
the sample to be selected. The necessary sample size is based on the 
confidence level, the proportion of successes, and the desired margin of 
error. To calculate the minimum sample size, solve the following 
inequality for n, where pip is a guessed value for the sample proportion: 


—y 


.. | P(l—p) 
z*p\(1—pA)n<ME._ Vn 


< ME 


If you do not have an approximate value of p\P from a previous study ora 
pilot study, use p\=0.5P = 9-5 to determine the sample size that will yield 
a value less than or equal to the desired margin of error. 


Section 8.3: Estimating a Population Mean 

In this section, you learned how to construct and interpret confidence 
intervals for a population mean. The Random and 10% conditions are the 
same as those for proportions. There’s one new condition for means: the 
population must be Normally distributed or the sample size must be at 
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least 30 (the Normal/Large Sample condition). If the population 
distribution’s shape is unknown and the sample size is less than 30, graph 
the sample data and check for strong skewness or outliers. If there is no 
strong skewness or outliers, it is reasonable to assume that the population 
distribution is approximately Normal. 

The formula for calculating a confidence interval for a population 
mean is 


= . 32 
zt t*— 
x +t*sxn vn 


where x = is the sample mean, ¢* is the critical value, s, is the sample 
standard deviation, and n is the sample size. We use a ¢ critical value 
instead of a z critical value when the population standard deviation is 
unknown—which is almost always the case. The value of t* is based on 
the confidence level C and the degrees of freedom (df = n — 1). To find ¢t*, 
use Table B or technology to determine the values of t* and —t* that 
capture the middle C% of the appropriate t distribution. 


You also learned how to estimate the sample size when planning a 
study, as in Section 8.2. To calculate the minimum sample size, solve the 
following inequality for n, where o is a guessed value for the population 
standard deviation: 


z*-—_< ME 
z*on<ME V2 


Confidence interval for p Confidence interval for py 

Name (TI- One-sample z interval for p (1- One-sample t interval for p 
83/84) PropZ Int) (TInterval) 
Formula pA+z*p\(1-pA)n zit sz 

‘ [pQ-P) x +t*sxn vn 

pxrz \ 

Tn 

Conditions 


e Random: The datacome  e Random: The data come 
from arandom sample from from arandom sample from 
the population of interest. the population of interest. 


o 10%: When sampling o 10%: When sampling 
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without replacement, n < 


0.10N. 


without replacement, n < 
0.10N. 


e Large Counts: Both np\np e Normal/Large Sample: 


and n(i-pryr(1 —P) are at 
least 10. That is, the number 


of successes and the 
number of failures in the 


sample are both at least 10. 


What Did You Learn? 


Learning Target 


Identify an appropriate point 
estimator and calculate the 
value of a point estimate. 


Interpret a confidence 
interval in context. 


Determine the point estimate 
and margin of error from a 
confidence interval. 


Use a confidence interval to 
make a decision about the 
value of a parameter. 


Interpret a confidence level 
in context. 


Describe how the sample size 
and confidence level affect 


Section 
8.1 


8.1 


8.1 


8.1 


8.1 


8.1 
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The population has a 
Normal distribution or the 
sample size is large (n = 
30). If the population 
distribution has unknown 
shape and n < 30, use a 
graph of the sample data to 
assess the Normality of the 
population. Do not use t 
procedures if the graph 
shows strong skewness or 
outliers. 


Relevant 
Related Chapter 
Example on Review 
Page(s) Exercise(s) 
495 R8.3 
499 R8.4, R8.5 
499 R8.2 
499 R8.5 
501 R8.2 
505 R8.8 


the margin of error. 


Explain how practical issues 
like nonresponse, 
undercoverage, and response 
bias can affect the 
interpretation of a confidence 
interval. 


State and check the Random, 
10%, and Large Counts 
conditions for constructing a 
confidence interval for a 
population proportion. 


Determine the critical value 
for calculating a C% 
confidence interval for a 
population proportion using a 
table or technology. 
Construct and interpret a 
confidence interval for a 
population proportion. 


Determine the sample size 
required to obtain a C% 
confidence interval for a 
population proportion with a 
specified margin of error. 


Determine the critical value 
for calculating a C% 
confidence interval for a 
population mean using a 
table or technology. 


State and check the Random, 
10%, and Normal/Large 
Sample conditions for 
constructing a confidence 
interval for a population 
mean. 


Construct and interpret a 
confidence interval for a 
population mean. 


8.1 


8.2 


8.2 


8.2 


8.2 


8.3 


8.3 


8.3 
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905 


913 


516 


916, 517 


920 


930, 531 


933 


935, 536 


Determine the sample size 
required to obtain a C% 
confidence interval for a 
population mean with a 
specified margin of error. 


8.3 


1177 


939 


Chapter 8 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R8.1 |) It’s critical Find the appropriate critical value for constructing a 


confidence interval in each of the following settings. 


a. 


Estimating a population proportion p at a 94% confidence level 
based on an SRS of size 125 


Estimating a population mean p/ at a 99% confidence level based 
on an SRS of size 58 


R8.2 |») Batteries A company that produces AA batteries tests the 


lifetime of a random sample of 30 batteries using a special device 
designed to imitate real-world use. Based on the testing, the 
company makes the following statement: “Our AA batteries last an 
average of 430 to 470 minutes, and our confidence in that interval is 
95%.”28 


a. 


Determine the point estimate, margin of error, standard error, and 
sample standard deviation. 


A reporter translates the statistical announcement into “plain 
English” as follows: “95% of this company’s AA batteries last 
between 430 and 470 minutes.” Comment on this interpretation. 


Your friend, who has just started studying statistics, claims that 
there is a 95% probability that the mean lifetime will fall between 
430 and 470 minutes. Do you agree? Explain your reasoning. 


Give a Statistically correct interpretation of the confidence level 
that could be published in a newspaper report. 


R8.3 |p) We love football! A Gallup poll conducted telephone 


interviews with a random sample of adults aged 18 and older. Data 
were obtained for 1000 people. Of these, 370 said that football is 
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their favorite sport to watch on television. 


a. Define the parameter p in this setting. 


What point estimator will you use to estimate p? What is the 
value of the point estimate? 


Do you believe that the value of the point estimate is equal to the 
value of p? Explain your answer. 


R8.4 |») Running red lights A random digit dialing telephone survey of 


R8.5 


880 drivers asked, “Recalling the last ten traffic lights you drove 
through, how many of them were red when you entered the 
intersections?” Of the 880 respondents, 171 admitted that at least 
one light had been red.24 


a. 


Construct and interpret a 95% confidence interval for the 
population proportion. 


Nonresponse is a practical problem for this survey—only 21.6% 
of calls that reached a live person were completed. Another 
practical problem is that people may not give truthful answers. 
What is the likely direction of the bias: Do you think more or 
fewer than 171 of the 880 respondents really ran a red light? 
Why? Are these sources of bias included in the margin of error? 


») Engine parts A random sample of 16 of the more than 200 auto 


engine crankshafts produced in one day was selected. Here are 
measurements (in millimeters) of a critical component on these 
crankshafts: 


224.120 224.001 224.017 223.982 223.989 223.961 223.960 224.089 
223.987 223.976 223.902 223.980 224.098 224.057 223.913 223.999 


a. Construct and interpret a 95% confidence interval for the mean 


length of this component on all the crankshafts produced on that 
day. 


The mean length is supposed to be p = 224 mm but can drift 
away from this target during production. Does your interval from 
part (a) suggest that the mean has drifted from 224 mm? Explain 
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your answer. 
R8.6 |) Do you go to church? The Gallup Poll plans to ask a random 


sample of adults whether they attended a religious service in the last 
7 days. How large a sample would be required to obtain a margin of 
error of at most 0.01 in a 99% confidence interval for the population 
proportion who would say that they attended a religious service? 


R8.7 |) Good wood? A lab supply company sells pieces of Douglas fir 
4 inches long and 1.5 inches square for force experiments in science 
classes. From experience, the strength of these pieces of wood 
follows a distribution with standard deviation 3000 pounds. You 
want to estimate the mean load needed to pull apart these pieces of 
wood to within 1000 pounds with 95% confidence. How large a 
sample is needed? 

R8.8 |») It’s about ME Explain how each of the following would affect 
the margin of error of a confidence interval, if all other things 
remained the same. 

a. Increasing the confidence level 
b. Quadrupling the sample size 


R8.9 |) t time When constructing confidence intervals for a population 


mean, we almost always use critical values from a t distribution 
rather than the standard Normal distribution. 


a. When is it necessary to use a t critical value rather than a z 
critical value when constructing a confidence interval for a 
population mean? 


b. For a particular level of confidence, explain what happens to the t 
critical values as the degrees of freedom increase. 
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Chapter 8 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T8.1 The Gallup Poll interviews 1600 people. Of these, 18% say that 
they jog regularly. The news report adds: “The poll had a margin 
of error of plus or minus three percentage points at a 95% 
confidence level.” You can safely conclude that 


a. 


95% of all Gallup Poll samples like this one give answers 
within +3% of the true population value. 


the percent of the population who jog is certain to be between 
15% and 21%. 


c. 95% of the population jog between 15% and 21% of the time. 


we can be 95% confident that the sample proportion is captured 
by the confidence interval. 


if Gallup took many samples, 95% of them would find that 
18% of the people in the sample jog. 


T8.2 The weights (in pounds) of three adult males are 160, 215, and 
195. What is the standard error of the mean for these data? 


a. 
b. 


Cc. 


d. 


e. 


190 

27.84 
22,73 
16.07 
13.13 


T8.3 In preparing to construct a one-sample t interval for a population 
mean, suppose we are not sure if the population distribution is 
Normal. In which of the following circumstances would we not be 
safe constructing the interval based on an SRS of size 14 from the 
population? 


a. A stemplot of the data is roughly bell-shaped. 
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b. A histogram of the data shows slight skewness. 


c. A boxplot shows that the values above the median are much 
more variable than the values below the median. 


d. The sample standard deviation is large. 
e. The sample standard deviation is small. 


T8.4 Many television viewers express doubts about the validity of 
certain commercials. In an attempt to answer their critics, Timex 
Group USA wishes to estimate the true proportion p of all 
consumers who believe what is shown in Timex television 
commercials. What is the smallest number of consumers that 
Timex can survey to guarantee a margin of error of 0.05 or less at 
a 99% confidence level? 


a. 550 
b. 600 
c. 650 
d. 700 
e. 750 


T8.5 You want to compute a 90% confidence interval for the mean of a 
population with an unknown population standard deviation. The 
sample size is 30. What critical value should you use for this 
interval? 


a. 1.645 
b. 1.699 
c. 1.697 
d. 1.96 
e. 2.045 


T8.6 A radio talk show host with a large audience is interested in the 
proportion p of adults in his listening area who think the drinking 
age should be lowered to eighteen. To find this out, he poses the 
following question to his listeners: “Do you think that the drinking 
age should be reduced to eighteen in light of the fact that 18-year- 
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olds are eligible for military service?” He asks listeners to go to his 
website and vote “Yes” if they agree the drinking age should be 
lowered and “No” if not. Of the 100 people who voted, 70 
answered “Yes.” Which of the following conditions are violated? 


I. Random 
II. 10% 
Ill. Large Counts 
a. I only 
b. Il only 
c. If only 
d. I and II only 
e. I, II, and III 


T8.7 A 90% confidence interval for the mean p of a population is 
computed from a random sample and is found to be 90 + 30. 
Which of the following could be the 95% confidence interval 
based on the same data? 


a. 90+ 21 
b. 90 + 30 
c. 90 + 39 
d. 90 + 70 


e. Without knowing the sample size, any of the above answers 
could be the 95% confidence interval. 


T8.8 Suppose we want a 90% confidence interval for the average 
amount spent on books by freshmen in their first year at a major 
university. The interval is to have a margin of error of at most $2. 
Based on last year’s book sales, we estimate that the standard 
deviation of the amount spent will be close to $30. The number of 
observations required is closest to which of the following? 


a, 25 
b. 30 
C...225 
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d. 609 
e. 865 


T8.9 A telephone poll of an SRS of 1234 adults found that 62% are 
generally satisfied with their lives. The announced margin of error 
for the poll was 3%. Does the margin of error account for the fact 
that some adults do not have telephones? 


a. Yes; the margin of error accounts for all sources of error in the 
poll. 


b. Yes; taking an SRS eliminates any possible bias in estimating 
the population proportion. 


c. Yes; the margin of error accounts for undercoverage but not 
nonresponse. 


d. No; the margin of error accounts for nonresponse but not 
undercoverage. 


e. No; the margin of error only accounts for sampling variability. 


T8.10 A Census Bureau report on the income of Americans says that with 
90% confidence the median income of all U.S. households in a 
recent year was $57,005 with a margin of error of $742. Which of 
the following is the most appropriate conclusion? 


a. 90% of all households had incomes in the interval $57,005 + 
$742. 


b. We can be sure that the median income for all households in the 
country lies in the interval $57,005 + $742. 


c. 90% of the households in the sample interviewed by the Census 
Bureau had incomes in the interval $57,005 + $742. 


d. The Census Bureau got the result $57,005 + $742 using a 
method that will capture the true median income 90% of the 
time when used repeatedly. 


e. 90% of all possible samples of this same size would result in a 
sample median that falls within $742 of $57,005. 


Section II: Free Response Show all your work. Indicate clearly the 
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methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T8.11 The U.S. Forest Service is considering additional restrictions on the 
number of vehicles allowed to enter Yellowstone National Park. To 
assess public reaction, the service asks a random sample of 150 
visitors if they favor the proposal. Of these, 89 say “Yes.” 


a. Construct and interpret a 99% confidence interval for the 
proportion of all visitors to Yellowstone who favor the 
restrictions. 


b. Based on your work in part (a), is there convincing evidence 
that more than half of all visitors to Yellowstone National Park 
favor the proposal? Justify your answer. 


T8.12 Many people have asked the question, but few have been patient 
enough to collect the data. How many licks does it take to get to 
the center of a Tootsie Pop? After some intense research, a 
researcher revealed a 95% confidence interval for the mean number 
of licks to be 317.64 licks to 394.56 licks.28 


a. Interpret the confidence interval. 


b. Calculate the point estimate and margin of error used to 
construct the interval. 


c. Name two things the researcher could do to decrease the margin 
of error. Discuss a drawback of each. 


T8.13 A milk processor monitors the number of bacteria per milliliter in 
raw milk received at the factory. A random sample of 10 one- 
milliliter specimens of milk supplied by one producer gives the 
following data: 


9370 4890 5100 4500 5260 5150 4900 4760 4700 4870 


Construct and interpret a 90% confidence interval for the 
population mean J. 
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Chapter 9 Testing a Claim 


Bill Gozansky/Alamy 
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Chapter 9 AP® Statistics Practice Test 
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INTRODUCTION 


Confidence intervals are one of the two most common methods of 
Statistical inference. You can use a confidence interval to estimate a 
population parameter, like the proportion of all U.S. adults who exercise 
regularly or the true mean amount of time that students at a large 
university spent on social media yesterday. 


What if we want to test a claim about a parameter? For instance, the 
U.S. Bureau of Labor Statistics claims that the national unemployment rate 
in March 2017 was 4.5%. A citizens’ group suspects that the actual rate 
was higher. The second common method of inference, called a 
significance test, allows us to weigh the evidence in favor of or against a 
particular claim. 


DEFINITION Significance test 


A significance test is a formal procedure for using observed data to 
decide between two competing claims (called hypotheses). The 
claims are usually statements about a parameter, like the population 
proportion p or the population mean p. 


A significance test is sometimes referred to as a test of significance, a hypothesis test, 


or a test of hypotheses. 


Here is an activity that illustrates the reasoning of a significance test. 


ACTIVITY I’m a great free-throw shooter! 


In this activity, you and your classmates will perform a simulation to 
test a claim about a population proportion. 

A basketball player claims to make 80% of the free throws that he 
attempts. That is, he claims p=0.80P ~ 0.80 where p is the true 
proportion of free throws he will make in the long run. We suspect that 
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he is exaggerating and that p<0.80? < 9-80, 

Suppose the player shoots 50 free throws and makes 32 of them. His 
sample proportion of made shots is p\=32/50=0.64? = 32/50 = 0.64 
This result gives some evidence that the player really makes less than 
80% of his free throws in the long run. But do we have convincing 
evidence that p<0.80?? < 9-80? Or is it plausible that an 80% shooter 
will have a performance this poor by chance alone? You can use a 
simulation to find out. 


otnaydur/Shutterstock 


1. Using the pie chart provided by your teacher, label the 80% region 
“made shot” and the 20% region “missed shot.” Straighten out one of 
the ends of a paper clip so that there is a loop on one side and a 
pointer on the other. On a flat surface, place a pencil through the loop, 
and put the tip of the pencil on the center of the pie chart. Then flick 
the paper clip and see where the pointed end lands: made shot or 
missed shot. 


2. Flick the paper clip a total of 50 times, and count the number of times 
that the pointed end lands in the “made shot” region. 


3. Compute the sample proportion p’? of made shots in your simulation 
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from Step 2. Plot this value on the class dotplot drawn by your 
teacher. 


4. Repeat Steps 2 and 3 as needed to get at least 40 trials of the 
simulation for your class. 


5. Based on the class’s simulation results, how likely is it for an 80% 
shooter to make 64% or less when he shoots 50 free throws? 


6. Based on your answer to Question 5, does the observed p/=0.64 
p = 0.64 result give convincing evidence that the player is 
exaggerating? Or is it plausible that an 80% shooter can have a 
performance this poor by chance alone? 


In the activity, the shooter made only 32 of 50 free-throw attempts 
(p\=0.64)(P = 0.64) There are two possible explanations for why he made 
less than 80% of his shots: 


(p = 0.80) and his bad performance 


. The player’s claim is true (p=0.80) 
happened by chance alone. 


’. The player’s claim is false (p<0.80)(P < 0.80) That is, the population 


proportion is less than 0.80 so the sample result is not an unlikely 
outcome. 

If explanation 1 is plausible, then we don’t have convincing evidence 
that the shooter is exaggerating—his poor performance could have 
occurred purely by chance. However, if it is unlikely for an 80% shooter to 
get a proportion of 0.64 or less in 50 attempts, then we can rule out 
explanation 1. 

We used software to simulate 400 sets of 50 shots, assuming that the 
player is really an 80% shooter. Figure 9.1 shows a dotplot of the results. 
Each dot on the graph represents the sample proportion p’? of made shots 
in one set of 50 attempts. 
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In 400 sets of 50 shots, 
there were only 3 sets 


when our shooter made 
less than or equal to 
the observed p = 0.64. 


13 ele i ee 
\ 4 T T 

0.6 4 ().7 0.8 0.9 
p = 0.64 


p = sample proportion of made shots 


FIGURE 9.1 Dotplot of the simulated sampling distribution of p/ 
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, the proportion of free throws made by an 80% shooter in a sample of 50 shots. 


The simulation shows that it would be very unlikely for an 80% free- 
throw shooter to make 32 or fewer out of 50 free throws (p/<0.64) 


(p < 0.64) just by chance. This gives us convincing evidence that the 


player is less than an 80% shooter. 


Section 9.1 focuses on the underlying logic of significance tests. Once 
the foundation is laid, we consider the implications of using these tests to 
make decisions—about everything from free-throw shooting to the 
effectiveness of a new drug. In Section 9.2, we present the details of 
performing a test about a population proportion. Section 9.3 shows how to 
test a claim about a population mean. Along the way, we examine the 
connection between confidence intervals and significance tests. 
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SECTION 9.1 Significance Tests: The Basics 


LEARNING TARGETS By the end of the section, you should be able to: 


State appropriate hypotheses for a significance test about a 
population parameter. 


Interpret a P-value in context. 


Make an appropriate conclusion for a significance test. 


Interpret a Type I error and a Type II error in context. Give a 
consequence of each error in a given setting. 


As noted in the definition, a significance test starts with a careful 
statement of the claims we want to compare. Let’s take a closer look at 
how to state these claims. 


Stating Hypotheses 


In the free-throw shooter activity, the player claims that his long-run 
proportion of made free throws is p=0.80? = 9-80. This is the claim we 
seek evidence against. We call it the null hypothesis, abbreviated Ho. 
Usually, the null hypothesis is a statement of “no difference.” For the free- 


throw shooter, no difference from what he claimed gives HO:p=0.80 
H, o:-p= 0.80. 


Remember: The null hypothesis is the dull hypothesis! 


The claim we hope or suspect to be true instead of the null hypothesis 
is called the alternative hypothesis. We abbreviate the alternative 
hypothesis as Ha/4«. In this case, we suspect the player might be 
exaggerating, so our alternative hypothesis is Ha: p<0.80Ha : p < 0.80, 


DEFINITION Null hypothesis HO, Alternative 
hypothesis HaHa 
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The claim that we weigh evidence against in a significance test is 
called the null hypothesis (H0).(#0) - 


The claim that we are trying to find evidence for is the alternative 
hypothesis (Ha).(#<) - 


Some people insist that all three possibilities—greater than, less than, and equal to— 


should be accounted for in the hypotheses. For the free-throw shooter example, 
because the alternative hypothesis is Ha: p<0.80,/4 ai? p< 0.80, they would write 
the null hypothesis as Ho:p>0.80./4 0: p= 0.80. ty spite of the mathematical 
= 0.80 


appeal of covering all three cases, we use only the value p=0.80P = 


calculations. So we’ll stick with Ho:p=0.80. 0: p= 0.80. 


in our 


In the free-throw shooter example, our hypotheses are 
HO: p=0.80/0 : p = 0.80 
Ha: p<0.804a :p < 0.80 


where p is the true proportion of free throws he will make in the long run. 
The alternative hypothesis is one-sided (p<0.80)(P < 0.80) because we 
suspect the player makes less than 80% of his free throws. If you suspect 
that the true value of a parameter may be either greater than or less than 


the null value, use a two-sided alternative hypothesis. 


DEFINITION One-sided and two-sided alternative 
hypothesis 


The alternative hypothesis is one-sided if it states that a parameter is 
greater than the null value or if it states that the parameter is less than 
the null value. 


The alternative hypothesis is two-sided if it states that the parameter 
is different from the null value (it could be either greater than or less 
than). 


The null hypothesis has the form HO:parameter=null value 
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Ho : parameter = null value A one-sided alternative hypothesis has one of 
the forms Ha: parameter<null value/fa : parameter < null value oy Ha: 
parameter>null value/a : parameter > null value, A two-sided alternative 
hypothesis has the form Ha: parameter#null value 
H, : parameter # null value To determine the correct form of Ha/c, read 
the problem carefully. 

It is common to refer to a significance test with a one-sided alternative 
hypothesis as a one-sided test or one-tailed test and to a test with a two- 
sided alternative hypothesis as a two-sided test or two-tailed test. 


EXAMPLE | Juicy pineapples » 
Stating hypotheses 


PROBLEM: At the Hawaii Pineapple Company, managers are 
interested in the size of the pineapples grown in the company’s fields. 
Last year, the mean weight of the pineapples harvested from one large 
field was 31 ounces. A different irrigation system was installed in this 
field after the growing season. Managers wonder if this change will 
affect the mean weight of pineapples grown in the field this year. 


State appropriate hypotheses for performing a significance test. Be sure 
to define the parameter of interest. 


SOLUTION: 
Ho:p=3140 7p= 31 
Ha:p43140 :pF Sl 


where p = the true mean weight of all pineapples grown in the field this 
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year. 


Because managers wonder if the mean weight of this year’s pineapples 


will differ from last year’s mean weight of 31 ounces, the alternative 
hypothesis is two-sided. 


FOR PRACTICE, TRY EXERCISE 1 


oOo The hypotheses should express the belief or suspicion we have 
before we see the data. It is cheating to look at the data first and then 
frame the alternative hypothesis to fit what the data show. For example, 
the data for the pineapple study showed that x =31.935f = 31.935 ounces 
for arandom sample of 50 pineapples grown in the field this year. You 
should not change the alternative hypothesis to Ha:p>314a (> 31 after 
looking at the data. 


AP® EXAM TIP 


Hypotheses always refer to a population, not to a sample. Be sure to 


state HOM0 and Hae in terms of population parameters. It is never 
correct to write a hypothesis about a sample statistic, such as HO: 
p=0.80H0 : P = 0.80 or Ha:x 431Ma : = # 31, 


CHECK YOUR UNDERSTANDING 


‘ 


. 


y For each of the following settings, state appropriate hypotheses for 
performing a significance test. Be sure to define the parameter of 
interest. 


1. According to the National Sleep Foundation, 85% of teens are getting 
too little sleep on school nights. Jannie wonders whether this result 
holds in her large high school. She asks an SRS of 100 students at the 
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school how much sleep they get on a typical night. In all, 75 of the 
students are getting less than the recommended amount of sleep. 


2. As part of its marketing campaign for the 2010 census, the U.S. 
Census Bureau advertised “10 questions, 10 minutes—that’s all it 
takes.” On the census form itself, we read, “The U.S. Census Bureau 
estimates that, for the average household, this form will take about 10 
minutes to complete, including the time for reviewing the instructions 
and answers.” We suspect that the time it takes to complete the form 
may be longer than advertised. 


Interpreting P-values 


It may seem strange to you that we state a null hypothesis and then try to 
find evidence against it. Maybe it would help to think about how a 
criminal trial works in the United States. The defendant is “innocent until 
proven guilty.” That is, the null hypothesis is innocence and the 
prosecution must offer convincing evidence against this hypothesis and in 
favor of the alternative hypothesis: guilt. That’s exactly how significance 
tests work, although in statistics we deal with evidence provided by data 
and use a probability to say how strong the evidence is. 

In the free-throw shooter activity at the beginning of the chapter, a 
player who claimed to make 80% of his free throws made only 
p\=32/50=0.64? = 32/50 = 0.64 in a random sample of 50 free throws. 
This is evidence against the null hypothesis that p=0.80? ~ 0.80 and in 
favor of the alternative hypothesis p<0.80? < 9-80. But is the evidence 
convincing? To answer this question, we want to know how likely it is for 
an 80% shooter to make 64% or less by chance alone in a random sample 
of 50 attempts. This probability is called a P-value. 


DEFINITION P-value 


The P-value of a test is the probability of getting evidence for the 
alternative hypothesis Ha//o as strong or stronger than the observed 
evidence when the null hypothesis HO/20 is true. 
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T Ty-T 


0.6 4 


p = 0.64 
p = sample proportion of made shots 


In 400 sets of 50 shots, 
there were only 3 sets 
when our shooter made 
less than or equal to 
the observed p = 00.64. 
a) ttttiiiiiil.. 
0.7 0.8 


0.9 


Small P-values give convincing evidence for Ha/4« because they say 
that the observed result is unlikely to occur when HO/40 is true. Large P- 
values fail to give convincing evidence for Ha/« because they say that the 
observed result is likely to occur by chance alone when H0/40 is true. 

We used simulation to estimate the P-value for our free-throw shooter: 
3/400=0.00753/ 400 = 9.0075 How do we interpret this P-value? 
Assuming that the player makes 80% of his free throws in the long run, 
there is about a 0.0075 probability of getting a sample proportion of 0.64 
or less just by chance in a set of 50 shots. 

We’ll show you how to calculate P-values later. For now, let’s focus 
on interpreting them. 


EXAMPLE | Healthy bones (>) 


Interpreting a P-value 
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Martin Shields/Alamy 


PROBLEM: Calcium is a vital nutrient for healthy bones and teeth. 
The National Institutes of Health (NIH) recommends a calcium intake of 
1300 milligrams (mg) per day for teenagers. The NIH is concerned that 
teenagers aren’t getting enough calcium, on average. Is this true? 
Researchers decide to perform a test of 


Ho : p = 1300 

HO: p=1300Ha: p1<1300 Ma : # < 1300 
where i is the true mean daily calcium intake in the population of 
teenagers. They ask a random sample of 20 teens to record their food 
and drink consumption for 1 day. The researchers then compute the 
calcium intake for each student. Data analysis reveals that x =1198 
& = 1198 mg and sx=4115 = 411 mg. Researchers performed a 
significance test and obtained a P-value of 0.1404. 


a. Explain what it would mean for the null hypothesis to be true in this 
setting. 


b. Interpret the P-value. 
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SOLUTION: 


a. If Ho:p=13004 0? # = 1300 jg true, then the mean daily calcium 
intake in the population of teenagers is 1300 mg. 


b. Assuming that the mean daily calcium intake in the teen population 
is 1300 mg, there is a 0.1404 probability of getting a sample mean of 
1198 mg or less just by chance in a random sample of 20 teens. 


FOR PRACTICE, TRY EXERCISE 9 


Remember: A P-value measures the strength of evidence for the 
alternative hypothesis (and against the null hypothesis). In the preceding 
example, the sample mean was x =1198% = 1198 mg. This result gives 
some evidence for Ha: u<13004a : # < 1300 because 1198<1300 
1198 < 1300, The P-value is the probability of getting evidence for Ha/a 
as strong or stronger than the observed result when H0/40 is true. We can 
write the P-value as a conditional probability. For the “Healthy bones” 


example, the P-value =P(x <1198| 1=1300)=0.1404 
= P(E < 1198] ~ = 1300) = 0.1404 


When Ha/Za is two-sided (parameter“null value), 
(parameter  nullvalue) , values of the sample statistic less than or greater 
than the null value both count as evidence for Ha/4«. Suppose we want to 
perform a test of HO: p=0.5/40 : p = 9-5 versus Ha: p#0.5 a ‘pF 0.5 
based on an SRS with a sample proportion of p=0.65P = 0.65. This result 
gives some evidence for Ha: pz0.5 4a :P # 9-5 because 0.6540.5 
0.65 # 0.5 In this case, evidence for Ha/e as strong or stronger than the 
observed result includes any value of pip greater than or equal to 0.65 as 
well as any value of p/P less than or equal to 0.35. Why? Because 
p=0.35? = 9-35 is just as different from the null value of p=0.5? = 9-5 as 
p=0.65? = 9-65. For this scenario, the P-value is equal to the conditional 


probability P(p’<0.35 or pA20.65] p=0.5)? (P < 9.35 or p > 0.65) p = 0.5). 


Making Conclusions 


The final step in performing a significance test is to draw a conclusion 
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about the competing claims being tested. We make a decision based on the 
strength of the evidence in favor of the alternative hypothesis (and against 
the null hypothesis) as measured by the P-value. 


jgroup/Getty Images 


e Ifthe observed result is unlikely to occur by chance alone when Hoo 
is true (small P-value), we will “reject HO/0 .” 


e Ifthe observed result is not unlikely to occur by chance alone when HO 
Ap is true (large P-value), we will “fail to reject Hof.” 


This wording may seem unusual at first, but it’s consistent with what 
happens in a criminal trial. Once the jury has weighed the evidence against 
the null hypothesis of innocence, they return one of two verdicts: “guilty” 
(reject HO0 ) or “not guilty” (fail to reject Hoo). A not-guilty verdict 
doesn’t guarantee that the defendant is innocent, just that there’s not 
convincing evidence of guilt. Likewise, a fail-to-reject Hoo decision ina 
significance test doesn’t guarantee that HO/0 is true. 


HOW TO MAKE A CONCLUSION IN A SIGNIFICANCE 
TEST 


e If the P-value is small, reject Hoo and conclude that there is 


convincing evidence for HaHa (in context). 


e If the P-value is not small, fail to reject Hoo and conclude that 
there is not convincing evidence for Hae (in context). 
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In the free-throw shooter activity, the estimated P-value was 0.0075. 
Because the P-value is small, we reject HO: p=0.80/0 : p = 9.80, We have 
convincing evidence that the player makes fewer than 80% of his free 
throws in the long run. 

For the teen calcium study, the P-value was 0.1404. Because the P- 
value is not small, we fail to reject HO: u=130040 : # = 1300 We don’t 
have convincing evidence that teens are getting less than 1300 mg of 
calcium per day, on average. 

How small does a P-value have to be for us to reject HO? In 
Chapter 4 , we suggested that you use a boundary of 5% when determining 
whether a result is statistically significant. That is equivalent to saying, 
“View a P-value less than 0.05 as small.” Choosing this boundary value 
means we require evidence for Hata so strong that it would happen less 
than 5% of the time just by chance when H0/40 is true. 

Sometimes it may be preferable to use a different boundary value— 
like 0.01 or 0.10—when drawing a conclusion in a significance test. We 
will explain why shortly. The chosen boundary value is called the 
significance level. We denote it by a@, the Greek letter alpha. 


DEFINITION Significance level 


The significance level a@ is the value that we use as a boundary for 
deciding whether an observed result is unlikely to happen by chance 
alone when the null hypothesis is true. 


When we use a fixed significance level «@ to draw a conclusion in a 
significance test, here are the two possibilities: 


P-value<a — reject HO = convincing evidence for Ha (in context) 
P-value < a — reject Hj — convincing evidence for H, (in context) 


P-value>a — fail to reject HO — not convincing evidence for Ha (in context) 
P-value > a — fail to reject Hyp — not convincing evidence for H, (in context) 


If the P-value is less than a@, we say that the result is “statistically 
significant at the a=a = level.” 
Significance at the a=0.05a = 0.05 level is often expressed by the 
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statement “The results were significant (P<0.05)(P < 0.05) » Here, P 
stands for the P-value. The P-value is more informative than a statement of 
significance because it describes the strength of evidence for the 
alternative hypothesis. For example, both an observed result with P=0.03 
P = 0.03 and an observed result with P=0.0003P = 0.0003 are significant 
at the a=0.05a = 0.05 level. But the P-value of 0.0003 gives much 
stronger evidence against HO/0 and in favor of Ha/o than the P-value of 
0.03. 


EXAMPLE | Better batteries (>) 


Making conclusions 


DSGpro/Getty Images 


PROBLEM: A company has developed a new deluxe AAA battery that 
is supposed to last longer than its regular AAA battery.t However, these 
new batteries are more expensive to produce, so the company would like 
to be convinced that they really do last longer. Based on years of 
experience, the company knows that its regular AAA batteries last for 
30 hours of continuous use, on average. The company selects an SRS of 
15 deluxe AAA batteries and uses them continuously until they are 
completely drained. The sample mean lifetime is x =33.93f = 33.93 
hours. A significance test is performed using the hypotheses 


HO: u=3010 ‘p= 30 
Ha: u>304a > pe > 30 


where i is the true mean lifetime (in hours) of the deluxe AAA 
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batteries. The resulting P-value is 0.0717. What conclusion would you 
make at the a=0.05a = 0.05 level? 


SOLUTION: 


Because the P-value of 0.0717>a=0.05,9-0717 > a = 0.05, we fail to 
reject HO#Z0. We don’t have convincing evidence that the true mean 
lifetime of the company’s deluxe AAA batteries is greater than 30 
hours. 


What does the P-value of 0.0717 tell us? Assuming H0:y=30 


Ho : 4 = 30 ig true, there is a 0.0717 probability of getting a sample 
mean of 33.9 hours or more in a random sample of 15 batteries. 


FOR PRACTICE, TRY EXERCISE 15 


Beginning users of significance tests generally find it easier to compare 
a P-value to a significance level than to interpret the P-value correctly in 
context. For that reason, we will include stating a significance level as a 
required part of every significance test. We’ll also ask you to explain what 
a P-value means in a variety of settings. Just remember that the P-value 
measures the strength of evidence for the alternative hypothesis and 
against the null hypothesis. 


Be careful how you write conclusions when the P-value is large. Don’t 
conclude that the null hypothesis is true just because we didn’t find 
convincing evidence for the alternative hypothesis. For example, it would 
be incorrect to conclude that the company’s deluxe AAA batteries last 
exactly 30 hours, on average. We found some evidence that the new 
batteries last longer, but the evidence wasn’t convincing enough to reject 
Ho. oOo Never “accept H04o” or conclude that HO is true! That 


would be like the jury in a criminal trial declaring the defendant 


on | 


“innocent” when they really mean “not guilty 


This is consistent with what you learned in Chapter 8: a confidence interval that 


includes the value 30 would include many other plausible values for p also. 
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AP® EXAM TIP 


We recommend that you follow the two-sentence structure from the 
example when writing the conclusion to a significance test. The first 
sentence should give a decision about the null hypothesis—reject HOMo 


or fail to reject H0/0—based on an explicit comparison of the P-value 
to a stated significance level. The second sentence should provide a 
statement about whether or not there is convincing evidence for Ha/a 
in the context of the problem. 


In practice, the most commonly used significance level is a=0.05 
a = 0.05, This is mainly due to Sir Ronald A. Fisher, a famous statistician 
who worked on agricultural experiments in England during the early 
twentieth century. Fisher was the first to suggest deliberately using random 
assignment in an experiment. In a paper published in 1926, Fisher wrote 
that it is convenient to draw the line at about the level at which we can say: 
“Either there is something in the treatment, or a coincidence has occurred 


such as does not occur more than once in twenty trials.”2 


When a researcher plans to draw a conclusion based on a significance 
level, oOo aa should be stated before the data are produced. Otherwise, 
a deceptive user of statistics might choose o@ after the data have been 
analyzed in an attempt to manipulate the conclusion. This is just as 
inappropriate as choosing an alternative hypothesis after looking at the 
data. 


Type I and Type II Errors 


When we draw a conclusion from a significance test, we hope our 
conclusion will be correct. But sometimes it will be wrong. There are two 
types of mistakes we can make: a Type I error or a Type II error. 


DEFINITION Type I error, Type II error 


A Type I error occurs if a test rejects HO#0 when HO is true. 
That is, the test finds convincing evidence that Ha/« is true when it 
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really isn’t. 


A Type II error occurs if a test fails to reject HO#0 when Ha/Za is 
true. That is, the test does not find convincing evidence that Ha/o is 
true when it really is. 


The possible outcomes of a significance test are summarized in Figure 
ree 


Truth about the population 


Ay true H,, true 


. a Correct 
Reject H,| Type Terror : 
Conclusion | : yP conclusion 
based on 
sample ns . Correct = 
>, Rep rrit o C meee | 
Fail to reject Hy enniliniod ype II error 


FIGURE 9.2 The two types of errors in significance tests. 


If HO is true: 


e Our conclusion is correct if we don’t find convincing evidence that Ha 
Aa js true. 


¢ We make a Type I error if we find convincing evidence that Ha/a is 
true. 


If Ha/Zo is true: 


¢ Our conclusion is correct if we find convincing evidence that Ha/o is 
true. 


e We make a Type IJ error if we do not find convincing evidence that Ha 
Ha is true 


Only one error is possible at a time, depending on the conclusion we 
make. 


Here’s a helpful reminder to keep the two types of errors straight. “Fail to” goes with 


Type II. 


It is important to be able to describe Type I and Type II errors in the 
context of a problem. Considering the consequences of each of these types 
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of error is also important, as the following example shows. 


EXAMPLE |Perfect potatoes (>) 
Type I and Type II errors 


Steve Cukrov/Alamy Stock Photo 


PROBLEM: A potato chip producer and its main supplier agree that 
each shipment of potatoes must meet certain quality standards. If the 
producer determines that more than 8% of the potatoes in the shipment 
have “blemishes,” the truck will be sent away to get another load of 
potatoes from the supplier. Otherwise, the entire truckload will be used 
to make potato chips. To make the decision, a supervisor will inspect a 
random sample of 500 potatoes from the shipment. The producer will 
then perform a test at the a=0.05a = 0.05 significance level of 


HO: p=0.08/0 : p = 0.08 
Ha: p>0.08 4a :p > 0.08 
where p = the true proportion of potatoes with blemishes in a given 


truckload. Describe a Type I and a Type IJ error in this setting, and give 
a possible consequence of each. 


SOLUTION: 


Type I error: The producer finds convincing evidence that more than 
8% of the potatoes in the shipment have blemishes, when the true 
proportion is really 0.08. 

Consequence: The potato-chip producer sends away the truckload of 
acceptable potatoes, wasting time and depriving the supplier of money. 
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Type II error: The producer does not find convincing evidence that 
more than 8% of the potatoes in the shipment have blemishes, when the 
true proportion is greater than 0.08. 

Consequence: More potato chips are made with blemished potatoes, 
which may upset customers and lead to decreased sales. 


FOR PRACTICE, TRY EXERCISE 23 


Which is more serious: a Type I error or a Type II error? That depends 
on the situation. For the potato-chip producer, a Type II error seems more 
serious because it may lead to lower-quality potato chips and decreased 
sales. 

The most common significance levels are a=0.05a = 0.05, a=0.01 
a = 0.01, and a=0.10e = 0.10, Which is the best choice for a given 
significance test? That depends on whether a Type I error or a Type II 
error is more serious. 

In the “Perfect potatoes” example, a Type I error occurs if the true 
proportion of blemished potatoes in a shipment is p=0.08? = 9-98 | but we 
get a value of the sample proportion p/P small enough to yield a P-value 
less than a=0.05a = 0.05, When HO/40 is true, this will happen 5% of the 
time just by chance. In other words, P(Type I error)=a= @. 


TYPE I ERROR PROBABILITY 


The probability of making a Type I error in a significance test is equal 
to the significance level aa. 


We can decrease the probability of making a Type I error ina 
significance test by using a smaller significance level. For instance, the 
potato-chip producer could use a=0.010 = 0.01 instead of a=0.05a = 0.05 
. But there is a trade-off between P(Type I error) and P(Type II error): as 
one increases, the other decreases. If we make it more difficult to reject HO 
Ho by decreasing a@, we increase the probability that we will not find 
convincing evidence for Ha/4« when it is true. That’s why it is important 


1209 


to consider the possible consequences of each type of error before 
choosing a significance level. 


CHECK YOUR UNDERSTANDING 


(\ 


\ 


y The manager of a fast-food restaurant wants to reduce the 
proportion of drive-thru customers who have to wait longer than 2 
minutes to receive their food after placing an order. Based on store 
records, the proportion of customers who had to wait longer than 2 
minutes was p=0.63? = 9-63. To reduce this proportion, the manager 
assigns an additional employee to drive-thru orders. During the next 
month, the manager collects a random sample of 250 drive-thru times 


as AAA ; 

and finds that p\=144250=0.576? ~ 250 aes The manager then 
performs a test of the following hypotheses at the a=0.10a = 0.10 
significance level: 

HO0:p=0.63 0 : p = 0.63 

Ha:p<0.63 Ha : Pp < 0.63 
where p=? ~ the true proportion of drive-thru customers who have to 
wait longer than 2 minutes to receive their food. 
1. Describe a Type I error and a Type II error in this setting. 
2. Which type of error is more serious in this case? Justify your answer. 


3. Based on your answer to Question 3, do you agree with the 
company’s choice of a=0.102a = 0.10? Why or why not? 


4. The P-value of the manager’s test is 0.0385. Interpret the P-value. 


Section 9.1 Summary 


e A significance test is a procedure for using observed data to decide 
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between two competing claims, called hypotheses. The hypotheses are 
often statements about a parameter, like the population proportion p or 
the population mean p. 


The claim that we weigh evidence against in a significance test is called 


the null hypothesis (Ho) (0 ) The null hypothesis has the form HO: 
parameter=null value,Ho : parameter = null value. 


The claim about the population that we are trying to find evidence for is 
the alternative hypothesis (Ha)(#4 a), 


: A one-sided alternative hypothesis has the form Ha:parameter<null 


value/la : parameter < null value oy Ha:parameter>null value 
H, : parameter > null value. 


: A two-sided alternative hypothesis has the form Ha:parameter“null 
valuefta : parameter # null value 


Often, HO/0 is a statement of no change or no difference. The 
alternative hypothesis states what we hope or suspect is true. 


The P-value of a test is the probability of getting evidence for the 
alternative hypothesis Ha/4o that is as strong as or stronger than the 
observed evidence when the null hypothesis HO/0 is true. 


Small P-values are evidence against the null hypothesis and for the 
alternative hypothesis because they say that the observed result is 
unlikely to occur when HO/40 is true. To determine if a P-value should 
be considered small, we compare it to the significance level aa. 


We make a conclusion in a significance test based on the P-value. 


« If P-value <a:< @ : Reject Hoo and conclude that there is 
convincing evidence for Hata (in context). 


» If P-value >a:= @ : Fail to reject HO#0 and conclude that there is not 
convincing evidence for Ha (in context). 


When we make a conclusion in a significance test, there are two kinds 
of mistakes we can make. 


» A Type I error occurs if we reject Hoo when it is, in fact, true. In 
other words, the data give convincing evidence for Ha/4« when the 
null hypothesis is correct. 
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. A Type II error occurs if we fail to reject HO#0 when Hao is true. 
In other words, the data don’t give convincing evidence for Hatta, 
even though the alternative hypothesis is correct. 


e The probability of making a Type I error is equal to the significance 
level aa. There is a trade-off between P(Type I error) and P(Type II 
error): as one increases, the other decreases. So it is important to 
consider the possible consequences of each type of error before 
choosing a significance level. 


Section 9.1 Exercises 


In Exercises 1-6, state appropriate hypotheses for performing a 
significance test. Be sure to define the parameter of interest. 


1. pg 555 |) No homework? Mr. Tabor believes that less than 75% 


of the students at his school completed their math homework last 
night. The math teachers inspect the homework assignments from a 
random sample of 50 students at the school. 


2. Don’t argue! A Gallup poll report revealed that 72% of teens said 
they seldom or never argue with their friends.2 Yvonne wonders 
whether this result holds true in her large high school, so she surveys 
a random sample of 150 students at her school. 


3. How much juice? One company’s bottles of grapefruit juice are 
filled by a machine that is set to dispense an average of 180 milliliters 
(ml) of liquid. A quality-control inspector must check that the 
machine is working properly. The inspector takes a random sample of 
40 bottles and measures the volume of liquid in each bottle. 


4. Attitudes The Survey of Study Habits and Attitudes (SSHA) is a 
psychological test that measures students’ attitudes toward school and 
study habits. Scores range from 0 to 200. Higher scores indicate 
better attitudes and study habits. The mean score for U.S. college 
students is about 115. A teacher suspects that older students have 
better attitudes toward school, on average. She gives the SSHA to an 
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SRS of 45 of the over 1000 students at her college who are at least 30 
years of age. 


5. Cold cabin? During the winter months, the temperatures at the 
Starneses’ Colorado cabin can stay well below freezing (32°F32 F or 
0°C0 C) for weeks at a time. To prevent the pipes from freezing, 
Mrs. Starnes sets the thermostat at 50°F.50 F. The manufacturer 
claims that the thermostat allows variation in home temperature of 
o0=3°F.¢ = 3 F. Mrs. Starnes suspects that the manufacturer is 
overstating the consistency of the thermostat. 


6. Ski jump When ski jumpers take off, the distance they fly varies 
considerably depending on their speed, skill, and wind conditions. 
Event organizers must position the landing area to allow for 
differences in the distances that the athletes fly. For a particular 
competition, the organizers estimate that the variation in distance 
flown by the athletes will be o=100 = 10 meters. An experienced 
jumper thinks that the organizers are underestimating the variation. 


In Exercises 7 and 8, explain what’s wrong with the stated hypotheses. 
Then give correct hypotheses. 


7. Stating hypotheses 


a. A change is made that should improve student satisfaction with the 
parking situation at a local high school. Before the change, 37% of 
students approve of the parking that’s provided. The null 
hypothesis HO: p>0.37Ho : p > 0.37 is tested against the 
alternative Ha: p=0.37Ha > p= 0.37, 


b. A researcher suspects that the mean birth weights of babies whose 
mothers did not see a doctor before delivery is less than 3000 
grams. The researcher states the hypotheses as 


HO: x =3000 grams/o : = 3000 grams 
Ha: x <3000 grams‘a : @ < 3000 grams 
8. Stating hypotheses 


a. A change is made that should improve student satisfaction with the 
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9. 


10. 


11. 


12. 


13. 


parking situation at your school. Before the change, 37% of 
students approve of the parking that’s provided. The null 
hypothesis HO: p\=0.37/40 : p = 0.37 is tested against the 
alternative Ha: p\>0.37Ha : Pp > 0.37, 


b. A researcher suspects that the mean birth weights of babies whose 
mothers did not see a doctor before delivery is less than 3000 
grams. The researcher states the hypotheses as 


HO: p=3000 grams/o : p = 3000 grams 
Ha: p<2999 grams/a : pp < 2999 grams 
Dg 557 | ®) No homework? Refer to Exercise 1. The math teachers 


inspect the homework assignments from a random sample of 50 
students at the school. Only 68% of the students completed their math 
homework. A significance test yields a P-value of 0.1265. 


a. Explain what it would mean for the null hypothesis to be true in 
this setting. 


b. Interpret the P-value. 


Attitudes Refer to Exercise 4. In the study of older students’ 
attitudes, the sample mean SSHA score was 125.7 and the sample 
standard deviation was 29.8. A significance test yields a P-value of 
0.0101. 


a. Explain what it would mean for the null hypothesis to be true in 
this setting. 


b. Interpret the P-value. 


How much juice? Refer to Exercise 3. The mean amount of liquid in 
the bottles is 179.6 ml and the standard deviation is 1.3 ml. A 
significance test yields a P-value of 0.0589. Interpret the P-value. 


Don’t argue Refer to Exercise 2. Yvonne finds that 96 of the 150 
students (64%) say they rarely or never argue with friends. A 
significance test yields a P-value of 0.0291. Interpret the P-value. 


Interpreting a P-value A student performs a test of HO: p=100 
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14. 


15. 


16. 


17, 


18. 


19. 


20. 


Ho : # = 100 versus Ha: p>1004 : # > 100 and gets a P-value of 
0.044. The student says, “There is a 0.044 probability of getting the 
sample result I did by chance alone.” Explain why the student’s 
explanation is wrong. 


Interpreting a P-value A student performs a test of HO: p=0.3 

Ho : p= 0.3 versus Ha: p<0.340 : P < 9.3 and gets a P-value of 
0.22. The student says, “This means there is about a 22% chance that 
the null hypothesis is true.” Explain why the student’s explanation is 
wrong. 

pg 559 |p) No homework Refer to Exercises 1 and 9. What 
conclusion would you make at the a=0.05a = 0.05 level? 


Attitudes Refer to Exercises 4 and 10. What conclusion would you 
make at the a=0.05a = 0.05 level? 


How much juice? Refer to Exercises 3 and 11. 
a. What conclusion would you make at the a=0.10e = 0.10 level? 


b. Would your conclusion from part (a) change if a 5% significance 
level was used instead? Explain your reasoning. 


Don’t argue Refer to Exercises 2 and 12. 
a. What conclusion would you make at the a=0.01@ = 0.01 level? 


b. Would your conclusion from part (a) change if a 5% significance 
level was used instead? Explain your reasoning. 


Making conclusions A student performs a test of HO: p=0.75 

Hy : p= 9.75 versus Ha: p<0.754a : P < 9-75 at the a=0.05a = 0.05 
significance level and gets a P-value of 0.22. The student writes: 
“Because the P-value is large, we accept HO/0. The data provide 
convincing evidence that the null hypothesis is true.” Explain what is 
wrong with this conclusion 


Making conclusions A student performs a test of HO: p=12 

Ho: &= 12 versus Ha: p412He * H # 12 at the a=0.05a = 0.05 
significance level and gets a P-value of 0.01. The student writes: 
“Because the P-value is small, we reject HO/0. The data prove that 
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21. 


22. 


23. 


Hao is true.” Explain what is wrong with this conclusion. 


Heavy bread? The mean weight of loaves of bread produced at the 
bakery where you work is supposed to be 1 pound. You are the 
supervisor of quality control at the bakery, and you are concerned that 
new employees are producing loaves that are too light. Suppose you 
weigh an SRS of bread loaves and find that the mean weight is 0.975 
pound. 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameter of interest. 


b. Explain why there is some evidence for the alternative hypothesis. 


c. The P-value for the test in part (a) is 0.0806. Interpret the P-value. 


d. What conclusion would you make at the a=0.01a = 0.01 


significance level? 


Philly fanatics? Nationally, the proportion of red cars on the road is 
0.12. A statistically minded fan of the Philadelphia Phillies (whose 
team color is red) wonders if Phillies fans are more likely to drive red 
cars. One day during a home game, he takes a random sample of 210 
cars parked at Citizens Bank Park (the Phillies home field), and 
counts 35 red cars. 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameter of interest. 


b. Explain why there is some evidence for the alternative hypothesis. 


c. The P-value for the test in (a) is 0.0187. Interpret the P-value. 


d. What conclusion would you make at the a=0.05a = 0.05 
significance level? 


pg 561 |») Opening a restaurant You are thinking about opening 


a restaurant and are searching for a good location. From research you 
have done, you know that the mean income of those living near the 
restaurant must be over $85,000 to support the type of upscale 
restaurant you wish to open. You decide to take a simple random 
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24. 


25. 


sample of 50 people living near one potential location. Based on the 
mean income of this sample, you will perform a test of 


H0:1=$85,000 Ho : # =$85, 000 
Ha:p>$85,000 4a : # >$85, 000 


where is the true mean income in the population of people who live 
near the restaurant.‘ Describe a Type I error and a Type II error in 
this setting, and give a possible consequence of each. 


Reality TV Television networks rely heavily on ratings of TV shows 
when deciding whether to renew a show for another season. Suppose 
a network has decided that “Miniature Golf with the Stars” will only 
be renewed if it can be established that more than 12% of U.S. adults 
watch the show. A polling company asks a random sample of 2000 
U.S. adults if they watch “Miniature Golf with the Stars.” The 
network uses the data to perform a test of 


HO0:p=0.12 Hy :p = 0.12 
Ha:p>0.12 H,:p> 0.12 


where p is the true proportion of all U.S. adults who watch the show. 
Describe a Type I error and a Type II error in this setting, and give a 
possible consequence of each. 


Awful accidents Slow response times by paramedics, firefighters, 
and policemen can have serious consequences for accident victims. In 
the case of life-threatening injuries, victims generally need medical 
attention within 8 minutes of the accident. Several cities have begun 
to monitor emergency response times. In one such city, emergency 
personnel took more than 8 minutes to arrive on 22% of all calls 
involving life-threatening injuries last year. The city manager shares 
this information and encourages these first responders to “do better.” 
After 6 months, the city manager selects an SRS of 400 calls 
involving life-threatening injuries and examines the response times. 
She then performs a test at the a=0.05a = 0.05 level of HO: p=0.22 
Hy : p= 0.22 versus Ha: p<0.22Ha > p< 0.22, where p is the true 
proportion of calls involving life-threatening injuries during this 6- 
month period for which emergency personnel took more than 8 
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26. 


Zi. 


minutes to arrive. 
a. Describe a Type I error and a Type II error in this setting. 


b. Which type of error is more serious in this case? Justify your 
answer. 


c. Based on your answer to part (b), do you agree with the manager’s 
choice of a=0.05?a = 0.05? Why or why not? 


Clean water The Environmental Protection Agency (EPA) has 
determined that safe drinking water should contain at most 1.3 
mg/liter of copper, on average. A water supply company is testing 
water from a new source and collects water in small bottles at each of 
30 randomly selected locations. The company performs a test at the 
a=0.05a = 0.05 significance level of HO: p=1.3/40 : # = 1-3 versus 
Ha: u>1.34 at # > 1.3 > where i is the true mean copper content of 
the water from the new source. 


a. Describe a Type I error and a Type II error in this setting. 


b. Which type of error is more serious in this case? Justify your 
answer. 


c. Based on your answer to part (b), do you agree with the company’s 
choice of a=0.05?a = 0.05? Why or why not? 


More lefties? In the population of people in the United States, about 
10% are left-handed. After bumping elbows at lunch with several left- 
handed students, Simon wondered if more than 10% of students at his 
school are left-handed. To investigate, he selected an SRS of 50 
students and found 8 lefties (p\=8/50=0.16)\P = 8/50 = 0.16). 


To determine if these data provide convincing evidence that more 
than 10% of the students at Simon’s school are left-handed, 200 trials 
of a simulation were conducted. Each dot in the graph shows the 
proportion of students that are left-handed in a random sample of 50 
students, assuming that each student has a 10% chance of being left 
handed. 
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0.00 0.05 0.10 0.15 0.20 


p = simulated sample proportion of lefties 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameter of interest. 


b. Use the simulation results to estimate the P-value of the test in part 
(a). Interpret the P-value. 


c. What conclusion would you make? 


28. Who wrote this poem? Statistics can help decide the authorship of 
literary works. Sonnets by a well-known poet contain an average of 
U=6.9H = 6.9 new words (words not used in the poet’s other works) 
and a standard deviation of o=2.70 = 2.7 words, and the number of 
new words is approximately Normally distributed. Scholars expect 
sonnets by other authors to contain more new words than this famous 
poet’s works. A new manuscript has been discovered with many new 
sonnets, and scholars are debating whether it is this poet’s work. They 
take a random sample of five sonnets from the new manuscript and 
count the number of new words in each one. The mean number of 
new words in these five sonnets is x =9.2% = 9.2. 


The following dotplot shows the results of simulating 200 random 
samples of size 5 from a Normal distribution with a mean of 6.9 and a 
standard deviation of 2.7, and calculating the mean for each sample. 
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X¥ = simulated mean number of new words 


State appropriate hypotheses for performing a significance test. Be 
sure to define the parameter of interest. 


Use the simulation results to estimate the P-value of the test in part 
(a). Interpret the P-value. 


What conclusion would you make? 


Multiple Choice Select the best answer for Exercises 29-32. 


29. Experiments on learning in animals sometimes measure how long it 
takes mice to find their way through a maze. The mean time is 18 
seconds for one particular maze. A researcher thinks that a loud noise 
will cause the mice to complete the maze faster. She measures how 
long each of 10 mice takes with a loud noise as stimulus. The 
appropriate hypotheses for the significance test are 


a. 
b. 
C. 
d. 


e. 


HO: p=18; Ha: p#18/40 : w= 18; Ha se 18, 
HO: p=18; Ha: p>1840 : # = 18; Ha : > 18, 
HO: p<18;Ha: p=1840 : # < 18; Ha se = 18, 
HO: p=18; Ha: p<18/40 : # = 18; Ha se < 18, 


HO: x =18; Ha: x <18Mo0 : @ = 18; Hy: < 18, 


Exercises 30-32 refer to the following setting. Members of the city 
council want to know if a majority of city residents supports a 1% 
increase in the sales tax to fund road repairs. To investigate, they 
survey a random sample of 300 city residents and use the results to 
test the following hypotheses: 


HO: p=0.50/40 : p = 0.50 
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30. 


31. 


Ha: p>0.50a :p> 0.50 


where p is the proportion of all city residents who support a 1% 
increase in the sales tax to fund road repairs. 


A Type I error in the context of this study occurs if the city council 


a. 


finds convincing evidence that a majority of residents supports the 
tax increase, when in reality there isn’t convincing evidence that a 
majority supports the increase. 


. finds convincing evidence that a majority of residents supports the 


tax increase, when in reality at most 50% of city residents support 
the increase. 


finds convincing evidence that a majority of residents supports the 
tax increase, when in reality more than 50% of city residents do 
support the increase. 


does not find convincing evidence that a majority of residents 
supports the tax increase, when in reality more than 50% of city 
residents do support the increase. 


does not find convincing evidence that a majority of residents 
supports the tax increase, when in reality at most 50% of city 
residents do support the increase. 


In the sample, p\=158/300=0.527P = 158/300 = 0.527 The resulting 
P-value is 0.18. What is the correct interpretation of this P-value? 


a. 


b. 


Only 18% of the city residents support the tax increase. 


There is an 18% chance that the majority of residents supports the 
tax increase. 


Assuming that 50% of residents support the tax increase, there is 
an 18% probability that the sample proportion would be 0.527 or 
greater by chance alone. 


Assuming that more than 50% of residents support the tax 
increase, there is an 18% probability that the sample proportion 
would be 0.527 or greater by chance alone. 
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e. 


Assuming that 50% of residents support the tax increase, there is 
an 18% chance that the null hypothesis is true by chance alone. 


32. Based on the P-value in Exercise 31, which of the following would be 
the most appropriate conclusion? 


a. 


Because the P-value is large, we reject HO. We have convincing 
evidence that more than 50% of city residents support the tax 
increase. 


Because the P-value is large, we fail to reject HO#0. We have 
convincing evidence that more than 50% of city residents support 
the tax increase. 


Because the P-value is large, we reject Hoo. We have convincing 
evidence that at most 50% of city residents support the tax 
increase. 


. Because the P-value is large, we fail to reject Hoo. We have 


convincing evidence that at most 50% of city residents support the 
tax increase. 


Because the P-value is large, we fail to reject HOM#0. We do not 
have convincing evidence that more than 50% of city residents 
support the tax increase. 


Recycle and Review 


33. Women in math (5.3) Of the 24,611 degrees in mathematics given 
by U.S. colleges and universities in a recent year, 70% were 
bachelor’s degrees, 24% were master’s degrees, and the rest were 
doctorates. Moreover, women earned 43% of the bachelor’s degrees, 


41% of the master’s degrees, and 29% of the doctorates.2 


a. 


a 


How many of the mathematics degrees given that year were earned 
by women? Justify your answer. 


Suppose we randomly select a person who earned a mathematics 
degree in this recent year. Are the events “Degree earned by a 
woman” and “Degree was a bachelor’s degree” independent? 
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Justify your answer using appropriate probabilities. 


c. If you choose 2 of the 24,611 mathematics degrees at random, 
what is the probability that at least 1 of the 2 degrees was earned 
by a woman? Show your work. 


34. Explaining confidence (8.2) Here is an explanation from a 
newspaper concerning one of its opinion polls. Explain what is wrong 
with the following statement. 


For a poll of 1600 adults, the variation due to sampling error is no 
more than 3 percentage points either way. The error margin is 
said to be valid at the 95% confidence level. This means that, if the 
same questions were repeated in 20 polls, the results of at least 19 
surveys would be within 3 percentage points of the results of this 
survey. 
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SECTION 9.2 Tests About a Population Proportion 


LEARNING TARGETS By the end of the section, you should be able to: 


e State and check the Random, 10%, and Large Counts conditions for 
performing a significance test about a population proportion. 


e Calculate the standardized test statistic and P-value for a test about 
a population proportion. 


e Perform a significance test about a population proportion. 


A significance test can be used to test a claim about a population 
parameter. Section 9.1 presented the reasoning of significance tests, 
including the idea of a P-value. This section describes how to perform a 
significance test about a population proportion. 


Performing a Significance Test About p 


In Section 9.1, we met a basketball player who claimed to make 80% of 
his free throws. We thought that he might be exaggerating. In a sample of 
50 free throws, the player made only 32. His sample proportion of made 
free throws was therefore 


Viorika/Getty Images 
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2 
p= a 0.64 
p4=3250=0.64 50 
This result is much lower than what he claimed. Does it provide 
convincing evidence against the player’s claim? To find out, we need to 


perform a significance test of 
HO: p=0.8040 : p = 0.80 
Ha: p<0.80a (p< 0.80 


where p = the true proportion of free throws that the shooter makes in the 
long run. 


CHECKING CONDITIONS In Chapter 8, we introduced conditions that 
should be met before we construct a confidence interval for a population 
proportion p. We called them Random, 10%, and Large Counts. These 
same conditions must be verified before carrying out a significance test. 

The Large Counts condition for proportions requires that both np and 
n(1-p)” (1 — P) be at least 10. When constructing a confidence interval for 
p, we use the sample proportion p/? in place of the unknown p to check 
this condition. As we discussed in Section 8.2, npA>10"P 2 10 and 
n(1—p)>10" (1—p) = 10 are the symbolic equivalent of saying that the 
observed counts of successes and failures in the sample are both at least 
10. Because we assume HO is true when performing a significance test, 
we use the parameter value specified by the null hypothesis (denoted p0P0 
) when checking the Large Counts condition. In this case, the Large 
Counts condition says that the expected count of successes npO”P0 and of 
failures n(1—p0)” (1 — Po) are both at least 10. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE 
TEST ABOUT A PROPORTION 


e Random: The data come from a random sample from the population 
of interest. 


o 10%: When sampling without replacement, n<0.10Nn < 0.10, 


e Large Counts: Both np0”Po0 and n(1-p0)” (1— po) are at least 10. 
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If the data come from a convenience sample or a voluntary response 
sample, there’s no point carrying out a significance test for p. The same is 
true if there are other sources of bias during data collection. If the Large 
Counts condition is violated, a P-value calculated from a Normal 
distribution will not be accurate. 

Let’s check the conditions for performing a significance test of the 
virtual basketball player’s claim that p=0.80? ~ 0.80. 

e Random: The 50 shots can be viewed as a random sample from the 
population of all possible shots that the shooter takes. 

o 10%: We’re not sampling without replacement from a finite 
population (because the player can keep on shooting), so we don’t 
check the 10% condition. 

e Large Counts: Assuming H040 is true, p=0.80P = 9-80. Then np0=(50) 

(0.80)=407Po = (50) (0.80) = 40 and n(1-p0)=(50)(0.20)=10 

n(1— po) = (50) (0.20) = 10 are both at least 10, so this condition is 

met. 


EXAMPLE |Get a job! (») 


Checking conditions 


ArtWell/Shutterstock.com 


PROBLEM: According to the U.S. Census Bureau, the proportion of 
students in high school who have a part-time job is 0.25. An 
administrator at a local high school suspects that the proportion of 
students at her school who have a part-time job is less than the national 
figure. She would like to carry out a test at the a=0.05a = 0.05 
significance level of 


HO: p=0.25H0 -p= 0.25 
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Ha: p<0.25Ha sp < 0.25 
where p=? ~ the true proportion of all students at the school who have a 
part-time job. 
The administrator selects a random sample of 200 students from the 


school and finds that 39 of them have a part-time job. Check if the 
conditions for performing the significance test are met. 


SOLUTION: 


e Random? Random sample of 200 students from the school. “/ 
o 10%: 200 is less than 10% of students at a large high school. 


e Large Counts? np0 = 200(0.25) = 50>10"P0 — 200 (0.25) = 50 > 10 ang 
n(1 — pO) = 200(1 — 0.25) = 150>10 
n(1—po) = 200(1—0.25) = 150 > 10, 


Be sure to use pg, not p\P , when checking the Large Counts condition! 


FOR PRACTICE, TRY EXERCISE 35 


CALCULATIONS: STANDARDIZED TEST STATISTIC AND P- 
VALUE For the free-throw shooter, the sample proportion of made shots 
was p\=3250=0.64? = 50 = 9-64 

Because this result is less than 0.80, there is some evidence against HO: 
p=0.8040 : p = 0.80 and in favor of Ha: p<0.804« : Pp < 9.80, But do we 
have convincing evidence that the player is exaggerating? To answer this 
question, we have to know how likely it is to get a sample proportion of 
0.64 or less by chance alone when the null hypothesis is true. In other 
words, we are looking for a P-value. 

Suppose for now that the null hypothesis HO: p=0.804o : p = 0.80 js 
true. Consider the sample proportion p\P of made free throws in a random 
sample of size n=50n = 50. You learned in Section 7.2 that the sampling 
distribution of p’? will have mean 


upA=p=0.804 — P = 0.80 
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and standard deviation 


op=p(1—p)n=0.80(0.20)50=0.0566 
[p(l—p) [0-80(0.20) 


sinter age ee eee 
O; en \ 50 0.0566 


Because the Large Counts condition is met, the sampling distribution of p/ 
P will be approximately Normal. Figure 9.3 displays this distribution. We 


~ _ 32 
have added the player’s sample result, p=3250=0.64? ~ 50 nee 


0.6302 0.6868 0.7434 0.8000 0.8566 0.9132 0.9698 


p = sample proportion of 
p=0.64 made free throws 


FIGURE 9.3 Normal distribution that models the sampling distribution of the 
sample proportion p/ 
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of made shots in random samples of 50 free throws by an 80% shooter. 


To assess how far the statistic (p\=0.64)(P = 0.64) is from the null 
value of the parameter (p0=0.80) (Po ~ 0.80) we standardize the statistic: 


z=p/—p0 p 0( 1- p 0)n=0.64—-0.800.0566=—2.83 
P—P 0.64—0.80 
0.0566 oa 


| P9 (1—-P9) 
| nm 


J 


This value is called the standardized test statistic. 


DEFINITION Standardized test statistic 


A standardized test statistic measures how far a sample statistic is 
from what we would expect if the null hypothesis HO//0 were true, in 
standard deviation units. That is, 


standardized test statistic=statistic — 


parameterstandard deviation of statistic 

: . 4 statistic — parameter 
standardized test statistic =—————_——-___ 
standard deviation of statistic 


Many people refer to the value of z as the test statistic. Some also write the formula as 


test statistic=statistic — null valuestandard error of statistic 


Bhs statistic — nullvalue 
teststatistic = -—————____—_—__- 
standarderrorofstatistic 


We chose to use the terminology from the AP® Statistics exam formula sheet. 


The standardized test statistic says how far the sample result is from 
the null value, and in what direction, on a standardized scale. In this case, 
the sample proportion p\=0.64? = 9-64 of made free throws is 2.83 
standard deviations less than the null value of p=0.80? = 9.80, 


You can use the standardized test statistic to find the P-value for a 
significance test. In this case, the P-value is the probability of getting a 
sample proportion less than or equal to p\=0.64? = 9-64 by chance alone 
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when HO: p=0.80/40 : p = 9.80 is true. The shaded area in Figure 9.4(a) 
shows this probability. Figure 9.4(b) shows the corresponding area to the 
left of z=-2.832 =—2-83 in the standard Normal distribution. 


Standard 
Normal curve 


P-value = 0.0023 P-value = 0.0023 


T T T T 
0.6302) 0.6868 0.7434 0.8000 0.8566 0.9132 0.9698 


p = sample proportion of 
p = 0.64 made free throws 


FIGURE 9.4 The shaded area shows the P-value for the player’s sample 
proportion of made shots (a) on the Normal distribution that models the sampling 
distribution of p/ 
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from Figure 9.3 and (b) on the standard Normal curve. 
We can find the P-value using Table A or technology. Table A gives 


P(z<-2.83)=0.0023" (2 <—2.83) = 0.0023 The TI-83/84 command 
normalcdf(lower: -1000~1000 | upper:-2.83-2-83, mean:0, SD:1) also 
gives a P-value of 0.0023. 


Note that the calculated P-value of 0.0023 is even smaller than the estimated P-value 


of 0.0075 from our earlier simulation. 


If HO/0 is true and the player makes 80% of his free throws in the 
long run, there’s only about a 0.0023 probability that he would make 32 or 
fewer of 50 shots by chance alone. This small probability confirms our 
earlier decision to reject HO“0 and gives convincing evidence that the 
player is exaggerating. 


EXAMPLE | Part-time jobs (») 
Calculating the standardized test statistic and P- 
value 


Andersen Ross/Exactostock-1555 
/Superstock 
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PROBLEM: In the preceding example, an administrator at a local high 
school decided to perform a test at the a=0.050 = 0.05 significance 
level of 


HO: p=0.254o : p = 0.25 

Ha: p<0.25 4a :9< 0.25 
where p = the true proportion of all students at the school who have a 
part-time job. The administrator selects a random sample of 200 
students from the school and finds that 39 of them have a part-time job. 


We already confirmed that the conditions for performing a significance 
test are met. 


a. Explain why the sample result gives some evidence for the alternative 
hypothesis. 


b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


SOLUTION: 
a. The sample proportion of students at this school with a part-time job 
7 ae 
is p\ = 39200 = 0.195” ~ 200 Roel which is less than the national 


proportion of p = 0.25? = 9-25 (as suggested by Ha/Za), 


 — 0.195-0.25 _ _4 99 


'0.25(0.75) 


b. z= 0.195 - 0.250.25(0.75)200=-1.80 VY ™ 


standardized test statistic=statistic — parameterstandard deviation 
of statisticz=p/—p0p0(1—p0)n 
statistic — parameter 


standardized test statistic =———____________ 
standard deviation of statistic 


P—Po 


| Po (1—P9) 


n 


1234 


Standard 


Normal curve 


-3 -2] = 0 1 2 3 


z= -1.80 
Using Table A: P(z < -1.80) = 0.0359 (2 < —1.80) = 0.0359 
Using Technology: normalcdf(lower: -1000,~ 1999, upper:-1.80, 
1.80, mean: 0, SD: 1)=0.0359= 0.0359 


c. Because the P-value of 0.0359<a = 0.05,0-0359 <a = 0.05, we reject 
Ho. We have convincing evidence that the true proportion of all 


students at this large high school who have a part-time job is less 
than 0.25. 
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Remember that there are two possible explanations for why the sample 
proportion of students who have part-time jobs (p4=39/200=0.195) 
(p = 39/200 = 0.195) in the example is less than p=0.25? = 9-25. The first 
explanation is that the true proportion of all students at the school who 
have part-time jobs is 0.25 and that we got a sample proportion this small 
due to sampling variability. The second explanation is that the true 
proportion of students at the school with part-time jobs is less than 0.25. 
We ruled out the first explanation due to the small P-value (0.0359) and 
settled on the second explanation. Of course, it is possible that we made a 
Type I error by rejecting HO/0 when it is true. 


Putting It All Together: One-Sample z Test for p 


To perform a significance test, we state hypotheses, check conditions, 
calculate a standardized test statistic and P-value, and draw a conclusion in 
the context of the problem. The four-step process is ideal for organizing 
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our work. 


SIGNIFICANCE TESTS: A FOUR-STEP PROCESS 


State: State the hypotheses you want to test and the significance 
a level, and define any parameters you use. 

Plan: Identify the appropriate inference method and check 
conditions. 


Do: If the conditions are met, perform calculations. 


e Give the sample statistic(s). 
e Calculate the standardized test statistic. 


e Find the P-value. 


Conclude: Make a conclusion about the hypotheses in the context of 
the problem. 


We have shown you how to complete each of the four steps in a test 
about a population proportion. Let’s reflect on how the pieces fit together 
in a test of HO: p=p0/0 : P = Po, When the conditions are met, the 
sampling distribution of p/? is approximately Normal with 


mean tp=p and standard deviation op=p(1-p)n 


= | p(1—p) 
mean ty =p and_ standard deviation a, = \ 


n 


For confidence intervals, we substitute p? for p in the standard deviation 
formula to obtain the standard error. When performing a significance test, 
however, we start by assuming that the null hypothesis HO: p=p0 

Ho : P = Po is true. We use this null value when calculating the standard 
deviation. 

If we standardize the statistic p\? by subtracting its mean and dividing 
by its standard deviation, we get the standardized test statistic. There are 
three conditions that must be met for this formula to be valid—one for 
each of the three components in the formula. 
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—_ 


po(1 — po) 


jo 
— p — po 


. The Random condition helps ensure that p\—p0P ~ Po is a good 


estimate for the difference between the true value of p and the null value 
of Po- 


The Large Counts condition allows us to use a Normal distribution to 
model the sampling distribution of pp. When this condition is met and 
H0/4o is true, the standardized test statistic z has approximately the 
standard Normal distribution. Then we can obtain P-values from this 
distribution using Table A or technology. 


. The 10% condition allows us to use the familiar formula for the standard 


deviation of the sampling distribution of p\? (with pOPo replacing p) 
when we are sampling without replacement from a finite population. 


Here is a summary of the details for a one-sample z test for a 


proportion. 


ONE-SAMPLE z TEST FOR A PROPORTION 


Suppose the conditions are met. To perform a test of HO: p=p0 
Hy: p= Po, compute the standardized test statistic 


” 


P~P0 


poea ue 
Z=p/—p0p0(1—p0)n \ " 


Find the P-value by calculating the probability of getting a z statistic 
this large or larger in the direction specified by the alternative 
hypothesis Ha/4e using a standard Normal distribution. 


Ha: P > Po Hg: P <P H,:P # Po 
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The AP® Statistics course outline calls this test a large-sample test for a proportion 
because it is based on a Normal approximation to the sampling distribution of p\P 


that becomes more accurate as the sample size increases. 


Here is an example of the one-sample z test for p in action. 


EXAMPLE | One potato, two potato » 


Significance test for a proportion 


imagestock/Getty Images 


PROBLEM: Recall that the potato-chip producer we met in 
Section 9.1 and its main supplier agree that each shipment of 
potatoes must meet certain quality standards. If the producer 

finds convincing evidence that more than 8% of the potatoes in the 

shipment have “blemishes,” the truck will be sent away to get another 
load of potatoes from the supplier. Otherwise, the entire truckload will 
be used to make potato chips. The potato-chip producer has just 
received a truckload of potatoes from the supplier. A supervisor selects 

a random sample of 500 potatoes from the truck. An inspection reveals 

that 47 of the potatoes have blemishes. Is there convincing evidence at 

the a=0.10e = 0.10 level that more than 8% of the potatoes in the 
shipment have blemishes? 


SOLUTION: 
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STATE: We want to test 
HO: p = 0.084 : p = 0.08 
Ha: p> 0.0840 : p > 0.08 


where p = the true proportion of potatoes in this shipment with 
blemishes, using a = 0.10. = 0.10. 


STATE: State the hypotheses you want to test and the 


significance level, and define any parameters you use. 


PLAN: One-sample z test for p. 
e Random: Random sample of 500 potatoes from the shipment. / 


o 10%: It’s reasonable to assume that 500 < 10% of all potatoes in 
the shipment. / 


e Large Counts: 500(0.08) = A0>10200 (0.08) = 40 > 10 ang 500(0.92) = 
460>10209 (0.92) = 460 > 10y 


PLAN: Identify the appropriate inference method and check conditions. 


DO: 


V 


a a 
© p= 47500 = 0.094” ~ 500 — 0-094 


0.094—0.08 r 
2 = 1.10 


————— 
/ 0.08(0.92) 


e z= 0.094 — 0.080.08(0.92)500 = 1.15 \ 500 


e P-value: 
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Standard 
Normal curve 


z=1.15 


Using Table A: P(z>1.15) = 1 — 0.8749 = 0.1251 
P(z> 1.15) = 1—0.8749 = 0.1251 


Using technology: normalcdf(lower:1.15, upper:1000, mean:0, SD:1) = 
0.1251 


DO: If the conditions are met, perform calculations: 
e Give the sample statistic(s). 
e Calculate the standardized test statistic. 


e Find the P-value. 


The sample result gives some evidence in favor of Hg because 0.094 + 
0.08. 


CONCLUDE: Because our P-value of 0.1251>a = 0.10, 
0.1251 > a = 0.10, we fail to reject Hp. There is not convincing 


evidence that the true proportion of blemished potatoes in the shipment 
is greater than 0.08. 


Conclude: Make a conclusion about the hypotheses in the context of the 


problem. 


FOR PRACTICE, TRY EXERCISE 43 


| AP® EXAM TIP | 
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When a significance test leads to a fail to reject HO/0 decision, as in 
the preceding example, be sure to interpret the results as “We don’t 
have convincing evidence for Hatta.” Saying anything that sounds like 
you believe HO/0 is (or might be) true will lead to a loss of credit. For 
instance, it would be wrong to conclude, “There is convincing evidence 
that the true proportion of blemished potatoes is 0.08.” And don’t write 
responses as text messages, like “FTR the Hoo .” 


The preceding example reminds us why significance tests are 
important. The sample proportion of blemished potatoes was 
p\=47/500=0.094? = 47/500 = 0.094 This result gave some evidence 
against HO and in favor of Hae. To see whether such an outcome is 
unlikely to occur by chance alone when H040 is true, we had to carry out 
a significance test. The P-value told us that a sample proportion this large 
or larger would occur in about 12.5% of all random samples of 500 
potatoes when H0/40 is true. So we can’t rule out sampling variability as a 
plausible explanation for getting a sample proportion of p\=0.094 
p = 0.094. Of course, we could have made a Type II error in this case by 
failing to reject HOM#0 when Ha: p>0.084 : p > 9.08 is true. 


WHAT HAPPENS WHEN THE DATA DON’T SUPPORT Ha/a? 
Suppose the supervisor had inspected a random sample of 500 potatoes 
from the shipment and found 33 with blemishes. This yields a sample 
proportion of p\=33/500=0.066? = 33/500 = 0.066. This sample doesn’t 
give any evidence to support the alternative hypothesis Ha: p>0.08 
H, : p > 0.08) Don’t continue with the significance test. The conclusion is 
clear: we should fail to reject HO: p=0.080 : p = 9.08. This truckload of 
potatoes will be used by the potato-chip producer. 

If you weren’t paying attention, you might end up carrying out the test. 
Let’s see what would happen. The corresponding standardized test statistic 
is 


z=p/—p0p0(1-—p0)n=0.066—0.080.08(0.92)500=-1.15 
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P—P9 ~—_—0.066—-0.08 
py(1—pp) _, /0-08(0.92) 
———- 500 


What’s the P-value? It’s the probability of getting a z statistic this large or 
larger in the direction specified by Ha, P(z>-1.15)4a; P(z>-1.15)_ 
Figure 9.5 shows this P-value as an area under the standard Normal curve. 
Using Table A or technology, the P-value is 1—0.1251=0.8749 

1 — 0.1251 = 0.8749, There’s about an 87.5% chance of getting a sample 
proportion as large as or larger than p\=0.066P = 9-066 if p=0,08P = 9.08, 
As aresult, we would fail to reject HO#0. Same conclusion, but with lots 
of unnecessary work! 


—1.15 


= 


Standard 
Normal curve 


P-value = 0.8749 


-3 -2 fA 0 1 


z=-1.15 
FIGURE 9.5 The P-value for the one-sided test of Hg: p = 0.08 versus H,: p > 
0.08. 


2 3 


Always check to see whether the data give evidence against HO/0 in 
the direction specified by Ha/e before you do calculations. 


CHECK YOUR UNDERSTANDING 


(\ 


y According to the National Campaign to Prevent Teen and 
Unplanned Pregnancy, 20% of teens aged 13 to 19 say that they have 
electronically sent or posted sexually suggestive images of themselves.® 
The counselor at a large high school worries that the actual figure might 
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be higher at her school. To find out, she administers an anonymous 
survey to arandom sample of 250 of the school’s 2800 students. All 250 
respond, and 63 admit to sending or posting sexual images. Carry out a 
significance test at the a=0.05a = 0.05 significance level. 


Your calculator will handle the “Do” part of the four-step process, as 
this Technology Corner illustrates. However, be sure to read the Ap® 
Exam Tip that follows. 


20. Technology Corner |PERFORMING A ONE- 
SAMPLE Z TEST FOR A PROPORTION (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


The TI-83/84 can be used to test a claim about a population proportion. 
We’|l demonstrate using the preceding example. In a random sample of 
size n = 500, the supervisor found X = 47 potatoes with blemishes. To 
perform a significance test: 


e Press [STAT], then choose TESTS and 1-PropZTest. 

e On the 1-PropZTest screen, enter the values shown: p0=0.08 
po = 0.08 x=47x = 47, and n=500n = 500. Specify the alternative 
hypothesis as “prop >p0~ Po.” Note: x is the number of successes 
and n is the number of trials. Both must be whole numbers! 


e If you select “Calculate” and press [ENTER], you will see that the 
standardized test statistic is z=1.152 = 1.15 and the P-value is 
0.1243. 


e If you select the “Draw” option, you will see the screen shown on the 
right. Compare these results with those in the example on page 574. 
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NORMAL FLOAT AUTO REAL RADIAN MP HH] 


1-PropZTest 


n: 500 
ProP:#po <Pe 


Color: BLUE 


Calculate Draw 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


Prop>d@. 98 
z=1.153915828 
P=0. 1242673934 
p=0, 094 

n=580 


NORMAL FLOAT AUTO REAL RADIAN MP fi 


1-Prop2tTest 
2=1.1539 pP=6,.1243 


AP® EXAM TIP 


You can use your calculator to carry out the mechanics of a significance 
test on the AP® Statistics exam. But there’s a risk involved. If you give 


just the calculator answer with no work, and one or more of your values 
are incorrect, you will probably get no credit for the “Do” step. If you 
opt for the calculator-only method, be sure to name the procedure (one- 
sample z test for a proportion) and to report the standardized test 
statistic (z=1.15)(* = 1-15) and P-value (0.1243). 
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Two-Sided Tests 


The free-throw shooter, part-time job, and blemished-potato settings 
involved one-sided tests. The P-value in a one-sided test about a 
proportion is the area in one tail of a standard Normal distribution—the tail 
specified by Hac. In a two-sided test, the alternative hypothesis has the 
form Ha: p~poZ, a: P * Po. The P-value in such a test is the probability of 
getting a sample proportion as far as or farther from pOP9 in either 
direction than the observed value of pp. As aresult, you have to find the 
area in both tails of a standard Normal distribution to get the P-value. The 
following example shows how this process works. 


EXAMPLE |Nonsmokers (>) 
A two-sided test 


snowmeo/Getty Images 
PROBLEM: According to the Centers for Disease Control and 
Prevention (CDC) website, 68% of high school students have never 
smoked a cigarette.2 Statistics class project, Yanhong surveys a simple 
random sample of 150 students from her school. She gets responses 
from all 150 students, and 90 say that they have never smoked a 
cigarette. Is there convincing evidence that the CDC’s claim does not 
hold true at Yanhong’s school? 


SOLUTION: 


STATE: We want to test 
HO: p = 0.68/40 : p = 0.68 
Ha: p,0.684a : p, 0.68 


where p = the proportion of all students in Yanhong’s school who would 
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say they have never smoked a cigarette. We’ll use a = 0.05.0 = 0.05. 


Follow the four-step process! 


PLAN: One-sample z test for p. 


e Random: Yanhong surveyed an SRS of 150 students from her school. 


© 10%: 150 students is less than 10% of all students at a large high 
school. “ 


e Large Counts: 150(0.68) = 102>10159 (0.68) = 102 > 10 ang 1500.32) 


DO: 


~ 90 __ 
© pA =90150 = 0.60? ~ is0 ~ 9-60 


0.60-0.68 _ _ 9 49 


vA 
“a 


/ 0.68(0.32) 


© z = 0.60 — 0.680.68(0.32)150=-2.10 \ 


e P-value: 


Standard 
Normal curve 


27=-2.10 z=2.10 


Using Table A: P(z< -2.10P(2 < —2-10 oF 752.10) = 2(0.0179) = 0.0358 
z > 2.10) = 2(0.0179) = 0.0358 


Using technology: normalcdf(lower: —1000, upper: —2.10, mean:0, 
SD:1) x 2= 0.03572 = 0.0357 


The sample result gives some evidence in favor of Hg because 0.60 # 
0.68. 
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The TI-83/84’s 1-PropZTest gives z = —2.10 and P-value = 0.0357. 


CONCLUDE: Because our P-value of 0.0357<a =0.05, 

0.0357 < a = 0.05, we reject HO40. We have convincing evidence that 
the proportion of all students at Yanhong’s school who would say they 
have never smoked a cigarette differs from the CDC’s claim of 0.68. 
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AP® EXAM TIP 


When making a conclusion in a significance test, be sure that you are 
describing the parameter and not the statistic. In the preceding example, 
it’s wrong to say that we have convincing evidence that the proportion 
of students at Yanhong’s school who said they have never smoked 


differs from the CDC’s claim of 0.68. The “proportion who said they 
have never smoked” is the sample proportion, which is known to be 
0.60. The test gives convincing evidence that the proportion of all 
students at Yanhong’s school who would say they have never smoked a 
cigarette differs from 0.68. 


WHY CONFIDENCE INTERVALS GIVE MORE INFORMATION 
The result of a significance test begins with a decision to reject HO#0 or 
fail to reject Hoo. In Yanhong’s smoking study, for instance, the data led 
us to reject HO: p=0.680 : p = 0.68 because we found convincing 
evidence that the proportion of students at her school who would say they 
have never smoked cigarettes differs from the CDC’s claim. We’re left 
wondering what the actual proportion p may be. A confidence interval can 
shed some light on this issue. You learned how to calculate a confidence 
interval for a population proportion in Section 8.2. 


A 95% confidence interval for p is 


0.60+1.960.60(1-0.60)150=0.60+0.078=(0.522,0.678) 
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/0.60(1—0.60) 


0.60 + 004) = 0.60 + 0.078 = (0.522,0.678) 
150 


This interval gives the values for p that are plausible based on the 
sample data. We would not be surprised if the proportion of all students at 
Yanhong’s school who would say they have never smoked cigarettes was 
any value between 0.522 and 0.678. However, we would be surprised if 
the true proportion was 0.68 because this value is not contained in the 
confidence interval. Figure 9.6 gives computer output from Minitab 
software that includes both the results of the significance test and the 
confidence interval. 


_! Session 


Test and Cl for One Proportion 


FIGURE 9.6 Minitab output for the two-sided significance test at « = 0.05 and a 
95% confidence interval for Yanhong’s smoking study. 


There is a link between confidence intervals and two-sided tests. The 
95% confidence interval (0.522, 0.678) gives an approximate set of p0’s 
Po’S that would not be rejected by a two-sided test at the a=0.050 = 0.05 
significance level. Any pOP0 value outside the interval would be rejected 
as implausible. 


With proportions, the link isn’t perfect because the standard error used 
for the confidence interval is based on the sample proportion pp , while 
the denominator of the standardized test statistic is based on the value pO 
Po from the null hypothesis. 


Standardized test statistic: z=p/—p0p0(1—p0)n 
Standardized test statistic: z = ; iE 
| Po (1—Po) 


J 


Tm 
Confidence interval: p\+z*p\(1-p)n 


/p(1—p 
Confidence interval: p + z* \ ye? 


Tl 
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The big idea is still worth considering: a two-sided test at significance 
level a anda 100(1-a),109 (1 — @), confidence interval (a 95% confidence 
interval if a=0.05e = 0.05) give similar information about the population 
parameter. There is a connection between one-sided tests and confidence 
intervals, but it is beyond the scope of this course. 


CHECK YOUR UNDERSTANDING 


( 


y According to the National Institute for Occupational Safety and 
Health, job stress poses a major threat to the health of workers. A news 
report claims that 75% of restaurant employees feel that work stress has 
a negative impact on their personal lives.2 Managers of a large 
restaurant chain wonder whether this claim is valid for their employees. 
A random sample of 100 employees finds that 68 answer “Yes” when 
asked, “Does work stress have a negative impact on your personal life?” 


1. Do these data provide convincing evidence at the a=0.10e = 0.10 
significance level that the proportion of all employees in this chain 
who would say “Yes” differs from 0.75? 


2. The figure shows Minitab output from a significance test and 
confidence interval for the restaurant worker data. Explain how the 
confidence interval is consistent with, but gives more information 
than, the test. 

_] Session 


Test and Cl for One Proportion 


Section 9.2 Summary 
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The conditions for performing a significance test of HO: p=p0 
Hy : p= po are 


= Random: The data come from a random sample from the population 
of interest. 
» 10%: When sampling without replacement, n<0.10Nr < 0.10N, 
. Large Counts: Both np0”Po and n(1-p0)” (1— Po) are at least 10. 
The standardized test statistic for a one-sample z test for a proportion 
is 


_— -P-PO 


| Po (1—Po) 
z=p\—popo(i-p0)n Yo” 
When the Large Counts condition is met, the standardized test statistic 
has approximately a standard Normal distribution. You can use Table A 
or technology to find the P-value. 
Follow the four-step process when you perform a significance test: 


State: State the hypotheses you want to test and the significance level, 
and define any parameters you use. 


Plan: Identify the appropriate inference method and check conditions. 
Do: If the conditions are met, perform calculations: 

« Give the sample statistic(s). 

» Calculate the standardized test statistic. 

=» Find the P-value. 

Conclude: Make a conclusion about the hypotheses in the context of the 
problem. 


Confidence intervals provide additional information that significance 
tests do not—namely, a set of plausible values for the population 
proportion p. A two-sided test of HO: p=po 0* P > Po at significance 
level a usually gives the same conclusion as a 100(1-a)% 100 (1—a) % 
confidence interval. 
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9.2 Technology Corner >» 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
20. Performing a one-sample z test for a proportion Page 576 


Section 9.2 Exercises 


35. pg 569 |») Home computers Jason reads a report that says 80% of 


36. 


37. 


U.S. high school students have a computer at home. He believes the 
proportion is smaller than 0.80 at his large rural high school. Jason 
chooses an SRS of 60 students and finds that 41 have a computer at 
home. He would like to carry out a test at the a=0.05a = 0.05 
significance level of HO: p=0.80/0 : p = 9.80 versus Ha:p<0.80 

H, : p < 0.80) where p = the true proportion of all students at Jason’s 
high school who have a computer at home. Check if the conditions 
for performing the significance test are met. 


Walking to school A recent report claimed that 13% of students 
typically walk to school.2 DeAnna thinks that the proportion is higher 
than 0.13 at her large elementary school. She surveys a random 
sample of 100 students and finds that 17 typically walk to school. 
DeAnna would like to carry out a test at the a=0.05a = 0.05 
significance level of HO: p=0.13440 : p= 0.13 versus Ha: p>0.13 

H, : p > 0.13) where p = the true proportion of all students at her 
elementary school who typically walk to school. Check if the 
conditions for performing the significance test are met. 


Fair coin? You want to determine if a coin is fair. So you toss it 10 
times and record the proportion of tosses that land “heads.” You 
would like to perform a test of HO: p=0.5/40 : p = 9-5 versus Ha: 
p~0.5Ha : p#0.5 where p = the proportion of all tosses of the coin 
that would land “heads.” Check if the conditions for performing the 
significance test are met. 
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38. Fire the coach! A college president says, “More than two-thirds of 
the alumni support my firing of Coach Boggs.” The president’s 
statement is based on 200 emails he has received from alumni in the 
past three months. The college’s athletic director wants to perform a 
test of HO: p=2/3440 : P = 2/3 versus Ha: p>2/344a : P > 2/3 where 
p = the true proportion of the college’s alumni who favor firing the 
coach. Check if the conditions for performing the significance test are 
met. 


39. pg 571 |») Home computers Refer to Exercise 35. 


a. Explain why the sample result gives some evidence for the 
alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 
c. What conclusion would you make? 
40. Walking to school Refer to Exercise 36. 


a. Explain why the sample result gives some evidence for the 
alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


41. Significance tests A test of HO: p=0.5/40 : p = 9.5 versus Ha: p>0.5 
H, : p > 0.5 based on a sample of size 200 yields the standardized 
test statistic z=2.19z = 2.19. Assume that the conditions for 
performing inference are met. 


a. Find and interpret the P-value. 


b. What conclusion would you make at the a=0.01a = 0.01 
significance level? Would your conclusion change if you used 
a=0.05a@ = 0.05 instead? Explain your reasoning. 


c. Determine the value of p\=? = the sample proportion of 
successes. 


42. Significance tests A test of HO: p=0.65440 : p = 0.65 against Ha: 
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44. 


45. 


p<0.6542, a: p < 0.65 based on a sample of size 400 yields the 
standardized test statistic z=-1.782 =—1.78. 


a. Find and interpret the P-value. 


b. What conclusion would you make at the a=0.10e = 0.10 
significance level? Would your conclusion change if you used 
a=0.05a@ = 0.05 instead? Explain your reasoning. 


c. Determine the value of p\=? = the sample proportion of 
successes. 


pg 574 |») Bullies in middle school A media report claims that 


more than 75% of middle school students engage in bullying 
behavior. A University of Illinois study on aggressive behavior 
surveyed a random sample of 558 middle school students. When 
asked to describe their behavior in the last 30 days, 445 students 
admitted that they had engaged in physical aggression, social ridicule, 
teasing, name-calling, and issuing threats—all of which would be 
classified as bullying.12 Do these data provide convincing evidence at 
the a=0.05a = 0.05 significance level that the media report’s claim is 
correct? 


Watching grass grow The germination rate of seeds is defined as the 
proportion of seeds that sprout and grow when properly planted and 
watered. A certain variety of grass seed usually has a germination rate 
of 0.80. A company wants to see if spraying the seeds with a 
chemical that is known to increase germination rates in other species 
will increase the germination rate of this variety of grass. The 
company researchers spray a random sample of 400 grass seeds with 
the chemical, and 339 of the seeds germinate. Do these data provide 
convincing evidence at the a=0.05a = 0.05 significance level that the 
chemical is effective for this variety of grass? 


Better parking A local high school makes a change that should 
improve student satisfaction with the parking situation. Before the 
change, 37% of the school’s students approved of the parking that 
was provided. After the change, the principal surveys an SRS of 200 
from the more than 2500 students at the school. In all, 83 students say 
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46. 


47. 


48. 


49. 


that they approve of the new parking arrangement. The principal cites 
this as evidence that the change was effective. 


a. Describe a Type I error and a Type II error in this setting, and give 
a possible consequence of each. 


b. Is there convincing evidence that the principal’s claim is true? 


Side effects A drug manufacturer claims that less than 10% of 
patients who take its new drug for treating Alzheimer’s disease will 
experience nausea. To test this claim, researchers conduct an 
experiment. They give the new drug to a random sample of 300 out of 
5000 Alzheimer’s patients whose families have given informed 
consent for the patients to participate in the study. In all, 25 of the 
subjects experience nausea. 


a. Describe a Type I error and a Type II error in this setting, and give 
a possible consequence of each. 


b. Do these data provide convincing evidence for the drug 
manufacturer’s claim? 


Cell-phone passwords A consumer organization suspects that less 
than half of parents know their child’s cell-phone password. The Pew 
Research Center asked a random sample of parents if they knew their 
child’s cell-phone password. Of the 1060 parents surveyed, 551 
reported that they knew the password. Explain why it isn’t necessary 
to carry out a significance test in this setting. 


Proposition X A political organization wants to determine if there is 
convincing evidence that a majority of registered voters in a large city 
favor Proposition X. In an SRS of 1000 registered voters, 482 favor 
the proposition. Explain why it isn’t necessary to carry out a 
significance test in this setting. 


Mendel and the peas Gregor Mendel (1822-1884), an Austrian 
monk, is considered the father of genetics. Mendel studied the 
inheritance of various traits in pea plants. One such trait is whether 
the pea is smooth or wrinkled. Mendel predicted a ratio of 3 smooth 
peas for every 1 wrinkled pea. In one experiment, he observed 423 


1254 


30. 


ol. 


a2. 


smooth and 133 wrinkled peas. Assume that the conditions for 
inference are met. 


a. State appropriate hypotheses for testing Mendel’s claim about the 
true proportion of smooth peas. 


b. Calculate the standardized test statistic and P-value. 


c. Interpret the P-value. What conclusion would you make? 


Spinning heads? When a fair coin is flipped, we all know that the 
probability the coin lands on heads is 0.50. However, what if a coin is 
spun? According to the article “Euro Coin Accused of Unfair 
Flipping” in the New Scientist, two Polish math professors and their 
students spun a Belgian euro coin 250 times. It landed heads 140 
times. One of the professors concluded that the coin was minted 
asymmetrically. A representative from the Belgian mint indicated the 
result was just chance. Assume that the conditions for inference are 
met. 


a. State appropriate hypotheses for testing these competing claims 
about the true proportion of spins that will land on heads. 


b. Calculate the standardized test statistic and P-value. 


c. Interpret the P-value. What conclusion would you make? 


pg 577 |») Teen drivers A state’s Division of Motor Vehicles 


(DMV) claims that 60% of all teens pass their driving test on the first 
attempt. An investigative reporter examines an SRS of the DMV 
records for 125 teens; 86 of them passed the test on their first try. Is 
there convincing evidence at the a=0.05a = 0.05 significance level 
that the DMV’s claim is incorrect? 


We want to be rich In a recent year, 73% of first-year college 
students responding to a national survey identified “being very well- 
off financially” as an important personal goal. A state university finds 
that 132 of an SRS of 200 of its first-year students say that this goal is 
important. Is there convincing evidence at the a=0.05a = 0.05 
significance level that the proportion of all first-year students at this 
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university who think being very well-off is important differs from the 
national value of 73%? 


Teen drivers Refer to Exercise 51. 


a. Construct and interpret a 95% confidence interval for the true 


proportion p of all teens in the state who passed their driving test 
on the first attempt. Assume that the conditions for inference are 
met. 


b. Explain why the interval in part (a) provides more information than 


the test in Exercise 51. 


We want to be rich Refer to Exercise 52. 


a. Construct and interpret a 95% confidence interval for the true 


proportion p of all first-year students at the university who would 
identify being very well-off as an important personal goal. Assume 
that the conditions for inference are met. 


b. Explain why the interval in part (a) provides more information than 


the test in Exercise 52. 


Do you Tweet? The Pew Internet and American Life Project asked a 
random sample of U.S. adults, “Do you ever ... use Twitter or 
another service to share updates about yourself or to see updates 
about others?” According to Pew, the resulting 95% confidence 
interval is (0.123, 0.177).+4 Based on the confidence interval, is there 
convincing evidence that the true proportion of U.S. adults who 
would say they use Twitter or another service to share updates differs 
from 0.17? Explain your reasoning. 


Losing weight A Gallup poll found that 59% of the people in its 
sample said “Yes” when asked, “Would you like to lose weight?” 
Gallup announced: “For results based on the total sample of national 
adults, one can say with 95% confidence that the margin of 
(sampling) error is +3+3 percentage points.”!2 Based on the 
confidence interval, is there convincing evidence that the true 
proportion of U.S. adults who would say they want to lose weight 
differs from 0.55? Explain your reasoning. 
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57. Reporting cheating What proportion of students are willing to report 
cheating by other students? A student project put this question to an 
SRS of 172 undergraduates at a large university: “You witness two 
students cheating on a quiz. Do you go to the professor?” The 
Minitab output shows the results of a significance test and a 95% 
confidence interval based on the survey data.12 


@ Session @O)(x) 


Test and Cl for One Proportion 


Z-Value P-Value 
-1.45 0.146 


a. Define the parameter of interest. 


b. Check that the conditions for performing the significance test are 
met in this case. 


c. Interpret the P-value. 


d. Do these data give convincing evidence that the population 
proportion differs from 0.15? Justify your answer with appropriate 
evidence. 


58. Teens and sex The Gallup Youth Survey asked a random sample of 
U.S. teens aged 13 to 17 whether they thought that young people 
should wait until marriage to have sex./4 The Minitab output shows 
the results of a significance test and a 95% confidence interval based 
on the survey data. 


& Session 


Test and CI for One Proportion 


a. Define the parameter of interest. 


b. Check that the conditions for performing the significance test are 
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met in this case. 
c. Interpret the P-value. 


d. Do these data give convincing evidence that the actual population 
proportion differs from 0.5? Justify your answer with appropriate 
evidence. 


Multiple Choice Select the best answer for Exercises 59-62. 


59. 


60. 


After once again losing a football game to the archrival, a college’s 
alumni association conducted a survey to see if alumni were in favor 
of firing the coach. An SRS of 100 alumni from the population of all 
living alumni was taken, and 64 of the alumni in the sample were in 
favor of firing the coach. Suppose you wish to see if a majority of all 
living alumni is in favor of firing the coach. The appropriate 
standardized test statistic is 


0.64—0.5 
a. 2=0.64-0.50.64(0.36)100 VY 100 — 
0.64—0.5 
b. t=0.64-0.50.64(0.36)100 VY 100 — 
0.64—0.5 
c. 2=0.64-0.50.5(0.5)100 V 100 — 
0.64—0.5 


/0.64(0.36) 
d. 2=0.64-0.50.64(0.36)64 V 64 
0.5-0.64 


e. 2=0.5-0.640.5(0.5)100 VY 100 


, A 
~ 


_—_— 
ao 


N 


vA 
~ 


Which of choices (a) through (d) is not a condition for performing a 
significance test about a population proportion p? 
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a. The data should come from a random sample from the population 
of interest. 


b. Both np0O”Po and n(1—p0)” (1 — Po) should be at least 10. 


c. If you are sampling without replacement from a finite population, 
then you should sample less than 10% of the population. 


d. The population distribution should be approximately Normal, 
unless the sample size is large. 


e. All of the above are conditions for performing a significance test 
about a population proportion. 


61. The standardized test statistic for a test of HO: p=0.4/40 : p = 0.4 
versus Ha: p~0.4Ha : Pp # 0.4 ig 7=2 432 = 2.43. This test is 


a. not significant at either a=0.05a@ = 0.05 or a=0.01a = 0.01, 
b. significant at a=0.050 = 0.05, but not at a=0.01e = 0.01, 
c. significant at a=0.010 = 0.01, but not at a=0.05a = 0.05, 
d. significant at both a=0.05@ = 0.05 and a=0.01a0 = 0.01, 


e. inconclusive because we don’t know the value of p/P. 


62. Which of the following 95% confidence intervals would lead us to 
reject HO: p=0.30/0 : p = 9.30 in favor of Ha: p#0.304a : p 0.30 
at the 5% significance level? 


a. (0.19, 0.27) 
b. (0.24, 0.30) 
c. (0.27, 0.31) 
d. (0.29, 0.38) 
e. None of these 


Recycle and Review 


63. Packaging DVDs (6.2, 5.3) A manufacturer of digital video discs 
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(DVDs) wants to be sure that the DVDs will fit inside the plastic 
cases used as packaging. Both the cases and the DVDs are circular. 
According to the supplier, the diameters of the plastic cases vary 
Normally with mean p=5.3/ = 5-3 inches and standard deviation 
o=0.01¢ = 0.01 inch. The DVD manufacturer produces DVDs with 
mean diameter p=5.264 — 5.26 inches. Their diameters follow a 
Normal distribution with o=0.020 = 0.02 inch. 


a. Let X = the diameter of a randomly selected case and Y = the 
diameter of a randomly selected DVD. Describe the shape, center, 
and variability of the distribution of the random variable X-Y 
X — Y_ What is the importance of this random variable to the DVD 
manufacturer? 


b. Calculate the probability that a randomly selected DVD will fit 


inside a randomly selected case. 


c. The production process runs in batches of 100 DVDs. If each of 
these DVDs is paired with a randomly chosen plastic case, find the 
probability that all the DVDs fit in their cases. 


Cash to find work? (4.2) Will cash bonuses speed the return to work 
of unemployed people? The Illinois Department of Employment 
Security designed an experiment to find out. The subjects were 
10,065 people aged 20 to 54 who were filing claims for 
unemployment insurance. Some were offered $500 if they found a job 
within 11 weeks and held it for at least 4 months. Others could tell 
potential employers that the state would pay the employer $500 for 


hiring them. A control group got neither kind of bonus.22 


a. Describe a completely randomized design for this experiment. 


b. Explain how you would use a random number generator to assign 
the treatments. 


c. What is the purpose of the control group in this setting? 
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SECTION 9.3 Tests About a Population Mean 


LEARNING TARGETS By the end of the section, you should be able to: 


e State and check the Random, 10%, and Normal/Large Sample 
conditions for performing a significance test about a population 
mean. 


Calculate the standardized test statistic and P-value for a test about 
a population mean. 


Perform a significance test about a population mean. 


Use a confidence interval to make a conclusion for a two-sided test 
about a population parameter. 


Interpret the power of a significance test and describe what factors 
affect the power of a test. 


You learned how to construct a confidence interval for a population mean 
in Section 8.3. Now we’Il examine the details of testing a claim about a 
population mean pF. 


Carrying Out a Significance Test for p 


In an earlier example, a company claimed to have developed a deluxe 
AAA battery that lasts longer than its regular AAA batteries. Based on 
years of experience, the company knows that its regular AAA batteries last 
for 30 hours of continuous use, on average. To test the company’s claim, 
we want to perform a test at the a=0.05a@ = 0.05 significance level of 


HO: u=3040: p= 30 
Ha: >30Ha: p> 30 
where uF is the true mean lifetime (in hours) of the deluxe AAA batteries. 


Our next step is to check that the conditions for performing this 
significance test are met. 
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CHECKING CONDITIONS In Chapter 8, we introduced conditions that 
should be met before we construct a confidence interval for a population 
mean. We called them Random, 10%, and Normal/Large Sample. These 
same conditions must be verified before performing a significance test 
about a population mean. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE 
TEST ABOUT A MEAN 


Random: The data come from a random sample from the population 
of interest. 


o 10%: When sampling without replacement, n<0.10Nn < 0.10N, 


Normal/Large Sample: The population has a Normal distribution or 
the sample size is large (n>30)(" = 30) If the population distribution 
has unknown shape and n<30n < 30, use a graph of the sample data 
to assess the Normality of the population. Do not use t procedures if 


the graph shows strong skewness or outliers. 


AP® EXAM TIP 


It is not enough just to make a graph of the data on your calculator 
when assessing Normality. You must sketch the graph on your paper 
and make an appropriate comment about it to receive credit. You don’t 
have to draw more than one graph—a single appropriate graph will do. 


If the sample size is large (n>30)(” = 30) the Normal/Large Sample 


condition is met. This condition is more difficult to check if the sample 
size is small (n<30)(” < 30) In that case, we have to examine a graph of 
the sample data to see if it is reasonable to believe that the population 


distribution is Normal. 


EXAMPLE | Better batteries (>) 
Checking conditions 
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zentilia/Getty Images 


PROBLEM: Here are the lifetimes (in hours) of the 15 deluxe AAA 
batteries from the company’s simple random sample: 


17 OZ 22 45 30 36 o1 27 
oy) 47 35 33 44 ae 31 


Check if the conditions for performing the significance test are met. 
SOLUTION: 


e Random: SRS of 15 deluxe AAA batteries. “ 


o 10%: Assume that 15 is less than 10% of all the company’s deluxe 
AAA batteries. / 


e Normal/Large Sample: The dotplot does not show strong skewness or 
outliers. / 


e 
% e © e600 s¢¢ eee ue 
15 20 25 30 35 40 45 50 55 
Battery life (h) 


Because the graph shows that there are no outliers or strong skewness in 


the sample, it is plausible that the population distribution of deluxe AAA 
battery lifetimes is Normal. 


FOR PRACTICE, TRY EXERCISE 65 


We used a dotplot to check the Normal/Large Sample condition in the 
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example because it is an easy graph to make by hand. You can also make a 
stemplot, histogram, boxplot, or Normal probability plot to check this 
condition. Figure 9.7 shows all four of these graphs for the battery lifetime 
data. The stemplot, histogram, and boxplot are roughly symmetric and 
have no outliers. The Normal probability plot is fairly linear, as we would 
expect if the data came from a Normally distributed population of battery 


lifetimes. 


Key: 1 | 7 = battery life 
of 17 hours. 


Ob BWWND 
wn 
on 
~N 


pS 
2 


1S 20 25 30 35 40 45 50 
(c) Battery life (h) 


Frequency 


(b) 


“score 


Expected z: 


4-4 


30 35 40) 45 50 55 
Battery life (h) 


T T T T T T 


30 35 40) 45 50 55 


Battery life (h) 


FIGURE 9.7 (a) A stemplot, (b) a histogram, (c) a boxplot, and (d) a Normal 
probability plot of the lifetimes of a simple random sample of 15 deluxe AAA 


batteries. None of the graphs shows any strong skewness or outliers, so it is 


plausible that the population distribution of deluxe AAA battery lifetimes is 


Normal. 


CALCULATIONS: STANDARDIZED TEST STATISTIC AND P- 
VALUE In the “Better batteries” example, the sample mean lifetime for 
the SRS of 15 deluxe batteries is x =33.93£ = 33.93 hours and the 
standard deviation is sx=9.825: = 9-82 hours. Because the sample mean of 
33.93 hours is greater than 30 hours, we have some evidence against HO: 
u=30/40 ‘ # = 30 and in favor of Ha: u>304a :# > 30 But do we have 
convincing evidence that the true mean lifetime of the company’s new 
AAA batteries is greater than 30 hours? To answer this question, we have 
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to know how likely it is to get a sample mean of 33.93 hours or more by 
chance alone when the null hypothesis is true. As with proportions, we will 
calculate a standardized test statistic and a P-value to find out. 


When performing a significance test, we do calculations assuming that 
the null hypothesis HO/40 is true. The standardized test statistic measures 
how far the sample result diverges from the null parameter value, in 
standardized units. As before, 


standardized test statistic=statistic — 


parameterstandard deviation of statistic 

: ma statistic — parameter 
standardized test statistic = — ———_——______ 
standard deviation of statistic 

For a test of HO: u=po 0: # =~ 0, our statistic is the sample mean x =. 
The standard deviation of the sampling distribution of x @ is 


o 
0; =— 


= in 
ox =on a“ 


In an ideal world, our standardized test statistic would be 


z=x —y0on we 


When the Normal/Large Sample condition is met, the standardized test 
Statistic z can be modeled by the standard Normal distribution. We could 
then find the P-value from a standard Normal distribution if the 
Normal/Large Sample condition is met. 


There are only a few real-world situations in which we might know the population 


standard deviation o. If we do, then we can calculate P-values using Table A or 
technology. The TI-83/84’s Z-Test option in the TESTS menu is designed for this 


special situation. 


Because the population standard deviation o is almost always 
unknown, we use the sample standard deviation sx*« in its place. The 
resulting standardized test statistic has the standard error of x = in the 
denominator and is denoted by t (you will see why shortly). So the formula 
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becomes 


t=x -pOsxn = V” 


The standardized test statistic for the “Better batteries” example is 
therefore 


33.93—30 
= = 1.55 
9.82 si 

t=33.93-309.8215=1.55 V15 


When the Normal/Large Sample condition is met, the standardized test 
Statistic 


t=x —yOsxn men 


can be modeled by a t distribution. As you learned in Section 8.3, we 
specify a particular t distribution by giving its degrees of freedom (df). 
When we perform inference about a population mean J using a t 
distribution, the appropriate degrees of freedom are found by subtracting 1 
from the sample size n, making df=n-14f = n — 1, 


DEFINITION t distribution 


A t distribution is described by a symmetric, single-peaked, bell- 
shaped density curve. Any t distribution is completely specified by its 
degrees of freedom (df). When performing inference about a 
population mean based on a random sample of size n when the 
population standard deviation o7 is unknown, use a t distribution 
with df=n-1¢f = — 1, 


Figure 9.8 compares the density curves of the standard Normal 
distribution and the ¢ distributions with 2 and 9 degrees of freedom. The 
figure illustrates these facts about the t distributions: 
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t distributions have 
more area in the tails 
than the standard 
Normal distribution. J 


-- t,2 degrees 

of freedom 

~~ t,9 degrees 

of freedom 
— Standard 

Normal 


FIGURE 9.8 Density curves for the t distributions with 2 and 9 degrees of 
freedom and the standard Normal distribution. All are symmetric with center 0. 
The t distributions have more variability and a slightly different shape than the 
standard Normal distribution. 


e The t distributions are similar in shape to the standard Normal 
distribution. They are symmetric about 0, single-peaked, and bell- 
shaped. 


e The ¢ distributions have more variability than the standard Normal 
distribution. It is more likely to get an extremely large value of t (say, 
greater than 3) than an extremely large value of z because the t 
distributions have more area in the tails of the distribution. 


e As the degrees of freedom increase, the t distributions approach the 
standard Normal distribution. 


We can use Table B to find a P-value from the appropriate t 
distribution when performing a test about a population mean. In the 
“Better batteries” example, we are testing 


HO: u=3040: pp = 30 
Ha: p>30Ha: pe > 30 
where p= ~ the true mean lifetime (in hours) of the company’s deluxe 


AAA batteries. An SRS of n=15n = 15 batteries yielded an average 
lifetime of x =33.93% = 33.93 hours and a standard deviation of sx=9.82 
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8x = 9.82 hours. The P-value is the probability of getting a result as large 
as or larger than x =33.93% = 33.93 just by chance when HO: p=30 

Ho : 4 = 30 ig true. In symbols, P-value=P(x >33.93|1=30) 

P—value = P(% > 33.93| = 30), Earlier, we calculated the standardized 
test statistic to be t=1.55¢ = 1.55, So we estimate the P-value by finding 
P(t>1.55)P (¢ = 1-55) in a ¢ distribution with df=15-1=14df = 15 -1= 14 
. The shaded area in Figure 9.9 shows this probability. 


t distribution 
with df = 14 


P-value = 
P(t = 1.55) 


4 -3 -2 -!l 0) l 2 3 4 


FIGURE 9.9 The shaded area shows the P-value for the “Better batteries” example 
as the area to the right of t=1.55 


. 


in at distribution with 14 degrees of freedom. 


We can find this P-value using Table B. Go to the df=14df = 14 row. 
The t statistic falls between the values 1.345 and 1.761. If you look at the 
top of the corresponding columns in Table B, you’ll find that the “Upper- 
tail probability p” is between 0.10 and 0.05. (See the excerpt from Table 
B.) Because we are looking for P(t21.55)” (t> 1.55), this is the 
probability we seek. That is, the P-value for this test is between 0.05 and 
0.10. 
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Upper-tail probability p 


df 0.10 0.05 0.025 
13 1.350 17 71 2.160 
14 1.345 1.761 2.145 
15 1.341 1.753 2.131 

80% 90% 95% 


Confidence level C 


As you can see, Table B gives an interval of possible P-values for a 
significance test. We can still draw a conclusion from the test in much the 
same way as if we had a single probability. Let’s illustrate using the 
“Better batteries” example. Because the P-value of between 0.05 and 0.10 
0.10 is greater than a=0.05e = 0.05, we fail to reject HOM#0. We don’t 
have convincing evidence that the true mean lifetime of the company’s 
deluxe AAA batteries is greater than 30 hours. 


Table B has two other limitations for finding P-values. 


e The table shows probabilities for only positive values of t. To find a P- 
value for a negative value of t, we use the symmetry of the t 
distributions. For example, P(t=1.55)=P(t<—1.55) 

P(t > 1.55) = P(t < —1.55) when using at distribution with a 
particular df. 


e The table includes probabilities only for t distributions with degrees of 
freedom from 1 to 30 and then skips to df=40, 50, 60, 80, 100 
df = 40, 50, 60, 80, 100 and 1000. (The bottom row gives probabilities 
for df=oodf = co, which corresponds to the standard Normal 
distribution. ) oOo If the df you need isn’t provided in Table B, use the 
next lower df that is available. It’s not fair “rounding up” to a larger 
df, which is like pretending that your sample size is larger than it really 
is. Doing so would give you a smaller P-value than is true and would 
make you more likely to incorrectly reject Hoo when it’s true (i.e., 
make a Type I error). Of course, “rounding down” to a smaller df will 
give you a larger P-value than is true, which makes you more likely to 
commit a Type II error! 


The next example shows how to deal with both of these issues. 


TTT 
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EXAMPLE | Two-sided tests (>) 


Calculating the standardized test statistic and P- 
value 


PROBLEM: Suppose you want to perform a test of HO: p=54o i 
versus Ha: p#5/40 ‘ # ¥ 5 at the a=0.01a = 0.01 significance level. A 
random sample of size n=37n = 37 from the population of interest 
yields x =4.81% = 4.81 and sx=0.3655: = 9-365, Assume that the 
conditions for carrying out the test are met. 


a. Explain why the sample result gives some evidence for the alternative 
hypothesis. 


b. Calculate the standardized test statistic and P-value. 
SOLUTION: 


a. The sample mean is x =4.81£ = 4.81, which is not equal to 5 (as 
suggested by Hala), 


b. t= 4.81 - 50.36537 = - 3.17 Wa 


standardized test statistic=statistic — parameterstandard deviation of 
statistict=x — Osxn 

statistic — parameter 

standardized test statistic = SE citi sana 

standard deviation of statistic 


Zp 
_ 0 


Sx 
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t distribution 
with df = 36 


4 (23 -2 -1 0 1 Z 4 
t=-3.17 t=3.17 


Upper-tail probability p 


df 0.005 0.0025 0.001 
29 2.756 3.038 3.396 
30 2.750 3.030 3.385 
40 2.704 2.971 3.307 

99% 99.5% 99.8% 


Confidence level C 
Using Table B: Because df = 37 - 1 = 364¢f = 37 — 1 = 36 js not 
available on the table, use df = 30¢f = 30, 
pt23.17)P (t = 3.17) is between 0.001 and 0.0025. 


P(t<-3.17 or 23.17) (t S—3.17ort > 3.17) ig between 2(0.001) = 0.002 
2(0.001) = 0.002 and 20,0025) = 0.0052 (0.0025) = 0.005, 


FOR PRACTICE, TRY EXERCISE 69 


Given the limitations of Table B, our advice is to use technology to 
find P-values when carrying out a significance test about a population 
mean. 


21. Technology Corner |COMPUTING P-VALUES 
FROM t DISTRIBUTIONS (>) 


TI-Nspire and other technology instructions are on the book’s website at 
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highschool.bfwpub.com/tps6e. 


You can use the tcdf command on the TI-83/84 to calculate areas under 
a t distribution curve. The syntax is tcdf(lower bound, upper bound, df). 


Let’s use the tcdf command to compute the P-values from the last two 
examples. 


Better batteries: To find P(t > 1.55)? (¢ = 1-55), 


e Press (DISTR) and choose tcdf(.OS 2.55 or later: In 
the dialog box, enter these values: lower: 1.55, upper: 1000, df:14, 


choose Paste, and then press [ENTER|. Older OS: Complete the 
command tcdf(1.55,1000,14) and press [ENTER|, 


NORMAL FLOAT AUTO REAL RADIAN MP nl 


tcdf(1.55,1000.14) 
@.9717235647 


Two-sided test: To find the P-value for the two-sided test with df=36 
df = 36 and t=-3.17¢ = —3-17, use the command tcdf (lower: —1000 
~1000 | upper: -3.17~3-17, df:36) and multiply the result by 2. 


NORMAL FLOAT AUTO REAL RADIAN MP 


tcdf(-1000, -3.17,36)%*2 
@. 9031888065 


CHECK YOUR UNDERSTANDING 


1272 


fy 


», The makers of Aspro brand aspirin want to be sure that their tablets 
contain the right amount of active ingredient (acetylsalicylic acid). So 
they inspect a random sample of 30 tablets from a batch in production. 
When the production process is working properly, Aspro tablets contain 
an average of p=3204 — 320 milligrams (mg) of active ingredient. The 
amount of active ingredient in the 30 selected tablets has mean 319 mg 
and standard deviation 3 mg. 

1. State appropriate hypotheses for a significance test in this setting. 

2. Check that the conditions are met for carrying out the test. 

3. Calculate the standardized test statistic and P-value. 


4. What conclusion would you make? 


Putting It All Together: One-Sample t Test for p 


We have shown you how to complete each of the four steps in a test about 
a population mean. Let’s reflect on how the pieces fit together in a test of 
HO: u=po 0+ = #0, We start by assuming that the null hypothesis is 
true. Because we usually don’t know the population standard deviation o, 
we use the sample standard deviation sx** to estimate it. If we standardize 
the statistic x % by subtracting its mean and dividing by its standard error, 
we get the standardized test statistic. There are three conditions that must 
be met for this formula to be valid—one for each of the three components 


in the formula. 


. The Random condition helps ensure that x —y07 ~ 0 is a good 
estimate for the difference between the true value of p/ and the null 


value LOMO. 
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’. The Normal/Large Sample condition allows us to model the distribution 
of the standardized test statistic t using a ¢ distribution with n-1” ~ 1 
degrees of freedom. Then we can obtain P-values from this distribution 
using Table B or technology. 


3. The 10% condition allows us to use the familiar formula for the standard 
deviation of the sampling distribution of x £ (with sxSz replacing o7) 
when we are sampling without replacement from a finite population. 


Here is a summary of the details for a one-sample t test for a mean. 


ONE-SAMPLE t TEST FOR A MEAN 


Suppose the conditions are met. To test the hypothesis HO: p=0 
Hy: b= #0, compute the standardized test statistic 


t—p 

t= al 
sz 

t=x —pOsxn vn 

Find the P-value by calculating the probability of getting a t statistic 

this large or larger in the direction specified by the alternative 

hypothesis Ha/e in a t distribution with df=n-14f = n — 1, 


H,: > Uo Hy: b< Uo H,: UU Ug 


Now we are ready to test a claim about a population mean. Once again, 
we follow the four-step process. 


EXAMPLE | Healthy streams (>) 


Performing a significance test about p 
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blackestockphoto/Getty Images 


PROBLEM: The level of dissolved oxygen (DO) in a stream or 
river is an important indicator of the water’s ability to support 
aquatic life. A researcher measures the DO level at 15 randomly 

chosen locations along a stream. Here are the results in milligrams per 

liter (mg/l): 


4.53 5.04 o.29 5.23 4.13 5.90 4.83 4.40 
9.42 6.38 4.01 4.66 2.87 o/3 3.00 


An average dissolved oxygen level below 5 mg/l puts aquatic life at 
risk. 


a. Do the data provide convincing evidence at the a=0.05a@ = 0.05 
significance level that aquatic life in this stream is at risk? 


b. Given your conclusion in part (a), which kind of mistake—a Type I 
error or a Type II error—could you have made? Explain what this 
mistake would mean in context. 


SOLUTION: 


a. STATE: We want to test 

HO: p=5/Lo : b=5 

Ha: p<5Ha ae | ete 
where 5 //° the true mean dissolved oxygen (DO) level in the stream, 
using o =5 0.05.0 = 50.05. 


Follow the four-step process! 
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PLAN: One-sample ¢ test for p./ 


There are an infinite number of possible locations along the stream, so it 


isn’t necessary to check the 10% condition. 


e Random: The researcher measured the DO level at 15 randomly 
chosen locations. “ 


e Normal/Large Sample: The histogram looks roughly symmetric and 
shows no outliers. / 


DO level (mg/l) 


Because the histogram shows no strong skewness or outliers in the 
sample, it is plausible that the population distribution of dissolved 
oxygen levels in the stream is Normal. 


Enter the data into your calculator to make a graph and to calculate 1- 
Var Stats. 


DO: 
e x 47710 sx=0.93962 = 4.771, Ss, = 0.9396 
ip 4.771-5 —_9.94 


0.9396 


e t=4.771 — 50.939615 =-0.94 V15 
e P-value:df=15-1=14P — value: df= 15-1=14 


The sample result gives some evidence in favor of Hg because x = 


4.771 <5. 
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Using Table B: P-value is between 0.15 and 0.20. 

Using technology: tcdf(lower: -1000~1000, upper: -0.94—-9-94, df: 
14) = 0.1816 

CONCLUDE: Because the P-value of 0.1816>a = 0.05, 

0.1816 > a = 0.05, we fail to reject HOM0. We don’t have convincing 
evidence that the true mean DO level in the stream is less than 5 mg/l. 


b. Because we failed to reject HO in part (a), we could have made a 
Type II error (failing to reject Hoo when Hao is true). If we did, 
then the true mean dissolved oxygen level pi in the stream is less 
than 5 mg/l, but we didn’t find convincing evidence with our 
significance test. That would imply aquatic life in this stream is at 
risk, but we weren’t able to detect that fact. 


t distribution 
with df = 14 


t =-0.94 


FOR PRACTICE, TRY EXERCISE 73 


Because the t procedures are so common, all statistical software 
packages will do the calculations for you. Figure 9.10 shows the output 
from Minitab for the one-sample t test in the preceding example. Note that 
the results match! 


G{ Session 


One-Sample T: DO (mg/L) 


Test of mu = 5 vs < 5 


Variable 
DO (mg/L) 


L277 


FIGURE 9.10 Minitab output for the one-sample t test from the dissolved oxygen 
example. 


You can also use your calculator to carry out a one-sample t test. But 
be sure to read the AP® Exam Tip at the end of the Technology Corner 
below. 


22. Technology Corner |PERFORMING A ONE- 
SAMPLE t TEST FOR A MEAN (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


You can perform a one-sample t test using either raw data or summary 
Statistics on the TI-83/84. Let’s use the calculator to carry out the test of 
HO: p=5/40 : # = 5 versus Ha: p<5/40 ‘ # < 5 from the dissolved 
oxygen example. Start by entering the sample data in L1. Then, to do 
the test: 


e Press [STAT], choose TESTS and T-Test. 


e Adjust your settings as shown. 

If you select “Calculate,” the screen below left appears. 

The standardized test statistic is t=-0.94¢ =—9-94 and the P-value is 
0.1809. 


NORMAL FLOAT AUTO REAL RADIAN MP nl 


Inet: Bem) Stats 
He:5S 


Frea:1 

Li: #Ue He 
Color: i=l 
Calculate Draw 


If you specify “Draw,” you see a t distribution curve (df=14) 
(df = 14) with the lower tail shaded, along with the standardized test 
statistic and P-value. 
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u<S 
t=-@. 9425562016 
P=0. 1809448972 
x=4. 771333333 
Sx=@. 9395961645 
n=15 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


pP=0.1809 


Note: If you are given summary Statistics instead of the original data, 
you would select the option “Stats” instead of “Data” in the first screen 
and enter the summary statistics. 


AP® EXAM TIP 


Remember: If you give just calculator results with no work, and one or 
more values are wrong, you probably won’t get any credit for the “Do” 


step. If you opt for the calculator- only method, name the procedure 
(one-sample t test for 1) and report the standardized test statistic (t= 
0,94), (¢ =—0.94), degrees of freedom (df=14),(4f = 14), and P-value 
(0.1809). 


CHECK YOUR UNDERSTANDING 


A teacher suspects that students at his school are getting less than 
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the recommended 8 hours of sleep a night, on average. To test his belief, 
the teacher asks a random sample of 28 students, “How much sleep did 
you get last night?” Here are the data (in hours): 


6 8 oo © ©) 6.5 6 7 2 4 3 4 
on) 11 Ge 2. or 6 10 7 8 4.5 eed 


Do these data provide convincing evidence at the a=0.05a = 0.05 
significance level in support of the teacher’s suspicion? 


Two-Sided Tests and Confidence Intervals 


You learned in Section 9.2 that a confidence interval gives more 

information than a significance test does—it provides the entire set of 

plausible values for the parameter based on the data. The connection 

between two-sided tests and confidence intervals is even stronger for 

means than it was for proportions. That’s because both inference methods 

for means use the standard error of x = in the calculations: 

standardized test statistic:t=x —Osxn confidence interval:x +t*sxn 

Lt — pt 


: = - 2 
confidence interval a fm =e —- 


standardized test statistic : t = 


The link between two-sided tests and confidence intervals for a 
population mean allows us to make a conclusion directly from a 
confidence interval. 


e If a 95% confidence interval for p“ does not capture the null value p10 
#0, we can reject HO: u=po/4 0: = F0 in a two-sided test at the a=0.05 
a = 0.05 significance level. 


e If a 95% confidence interval for p“ captures the null value pO, then 
we should fail to reject HO: u=po 0+ = #0 in a two-sided test at the 
a=0.05a = 0.05 significance level. 


The same logic applies for other confidence levels, but only for a two- 
sided test. 
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EXAMPLE | Juicy pineapples >» 


Confidence intervals and two-sided tests 


Maks Narodenko 
/Shutterstock.com 


PROBLEM: At the Hawaii Pineapple Company, the mean weight of 
the pineapples harvested from one large field was 31 ounces last year. A 
different irrigation system was installed in this field after the growing 
season. Managers wonder if this change will affect the mean weight of 
future pineapples grown in the field. To find out, they select and weigh 
a random sample of 50 pineapples from this year’s crop. 


a. State an appropriate pair of hypotheses for a significance test in this 
setting. Be sure to define the parameter of interest. 


b. Check conditions for performing the test in part (a). 


c. A 95% confidence interval for the mean weight of all pineapples 
grown in the field this year is (31.255, 32.616). Based on this 
interval, what conclusion would you make for a test of the hypotheses 
in part (a) at the a=0.05e = 0.05 significance level? 


d. Can we conclude that the different irrigation system caused a change 
in the mean weight of pineapples produced? Explain your answer. 


SOLUTION: 


a. We want to test 
HO: p = 3140: 4 = 31 
Ha: p#31Ha : wx 31 
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where pt = the true mean weight (in ounces) of all pineapples grown 
in the field this year. 


o Random: Random sample of 50 pineapples from this year’s crop. 
"A 
o 10%: It is safe to assume that 50<10,29 < 10, of all pineapples 
in a large field. 


© Normal/Large Sample:n = 50230." = 90 = 30./ 


c. The 95% confidence interval does not include 31 as a plausible 
value, so we would reject H0O//0. We have convincing evidence that 
the true mean weight of all pineapples grown this year is not 31 
ounces. 


d. No; this was not a randomized comparative experiment, so we 
cannot infer causation. It is possible that other things besides the 
irrigation system changed from last year’s growing season. Maybe 
the weather was different this year, and that’s why the pineapples 
have a different mean weight than last year. 


Recall from Chapter 4 that only well-designed experiments allow us to 


establish cause-and-effect conclusions. 


FOR PRACTICE, TRY EXERCISE 81 


Minitab output for a significance test and confidence interval based on 
the pineapple data is shown here. Notice that the weights of the 50 
randomly selected pineapples from this year’s crop had a mean of 
x =31.935¢ = 31.935 ounces and a standard deviation of sx=2.394 
8x = 2.394 ounces. The standardized test statistic is 
THQ 31.935—31 


pe = = 2:76 
‘Sz 2.394 


t=x —pOsxn=31.935-312.39450=2.76 he /50 


The corresponding P-value from a ¢t distribution with 49 degrees of 
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freedom is 0.008. Because the P-value of 0.008<a=0.050.008 < a = 0.05, 
we would reject HO. This is consistent with our decision based on the 
95% confidence interval in the example. 


& Session 


One-Sample T: Weight (oz) 
Test of mu = 31 vs not = 31 


Would a 99% confidence interval for p/ include 31 ounces as a 
plausible value for the parameter? Only if a two-sided test would fail to 
reject HO: u=3140 7oB=31 at a1% significance level. Because the P- 
value of 0.008<a=0.010.008 < a = 0.01, we would reject HOM#0. We once 
again have convincing evidence that the mean weight of all pineapples 
produced this year is not 31 ounces. So the 99% confidence interval would 
not contain 31. You can check that the interval is (31.028, 32.842). 


Think About It 


IS THERE A CONNECTION BETWEEN ONE-SIDED TESTS 
AND CONFIDENCE INTERVALS FOR A POPULATION 
MEAN? As you might expect, the answer is yes. But the link is more 
complicated. Consider a one-sided test of HO: u=10H0 : # = 10 versus 
Ha: p>10/40 : # > 10 based on an SRS of 30 observations. With 
df=30-1=29df = 30 — 1 = 29 | Table B says that the test will reject HO 
Ao at a=0.05a = 0.05 if the standardized test statistic t is greater than 
1.699. For this to happen, the sample mean x Z would have to exceed 
10=10H0 = 10 by more than 1.699 standardized units. 

Table B also shows that t*=1.699¢" = 1.699 is the critical value for 
a 90% confidence interval. That is, a 90% confidence interval will 
extend 1.699 standardized units on either side of the sample mean x 7. 
If x Z exceeds 10 by more than 1.699 standardized units, the resulting 
interval will not include 10. And the one-sided test will reject HO: p=10 
Ho : #= 10. There’s the link: our one-sided test at a=0.05a = 0.05 
gives the same conclusion about HO/0 as a 90% confidence interval for 
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| pe. 


CHECK YOUR UNDERSTANDING 


( 


y According to the National Center for Health Statistics, the mean 
systolic blood pressure for males 35 to 44 years of age is 128. The 
health director of a large company wonders if this national average 
holds for the company’s middle-aged male employees. So the director 
examines the medical records of a random sample of 72 male employees 
in this age group and records each of their systolic blood pressure 
readings. 


1. State an appropriate pair of hypotheses for a significance test in this 
setting. Be sure to define the parameter of interest. 


2. Check conditions for performing the test in Question 1. 


3. A 95% confidence interval for the mean systolic blood pressure of all 
35- to 44-year-old male employees at this company is (126.43, 
133.43). Based on this interval, what conclusion would you make for 
a test of the hypotheses in Question 1 at the a=0.05a = 0.05 
significance level? 


The Power of a Test 


Researchers often perform a significance test in hopes of finding 
convincing evidence for the alternative hypothesis. Why? Because Hatta 
States the claim about the population parameter that they believe, suppose, 
or suspect is true. For instance, a drug manufacturer claims that less than 
10% of patients who take its new drug for treating Alzheimer’s disease 
will experience nausea. To test this claim, researchers want to carry out a 


test of 
HO: p=0.10/40 : p = 0.10 
Ha: p<0.104a : p < 0.10 


1284 


where p = the true proportion of patients like the ones in the study who 
would experience nausea when taking the new Alzheimer’s drug. They 
plan to give the new drug to arandom sample of Alzheimer’s patients 
whose families have given informed consent for the patients to participate 
in the study. 


Suppose that the true proportion of subjects like the ones in the 
Alzheimer’s study who would experience nausea after taking the new drug 
is p=0.08P? = 9.08, This means that the alternative hypothesis Ha: p<0.10 
H, : p < 0.10 is true. Researchers would make a Type II error if they 
failed to find convincing evidence for Ha/4« based on the sample data. 
How likely is the significance test to avoid a Type IJ error in this case? We 
refer to this probability as the power of the test. 


DEFINITION Power 


The power of a test is the probability that the test will find 
convincing evidence for H, when a specific alternative value of the 


parameter is true. 


The power of a test is a conditional probability: power = P(reject HO 
Ho | parameter = some specific alternative value). In other words, power is 
the probability that we find convincing evidence the alternative hypothesis 
is true, given that the alternative hypothesis really is true. To interpret the 
power of a test in a given setting, just interpret the relevant conditional 
probability. 

Let’s return to the Alzheimer’s study. Suppose the researchers decide 
to perform a test of HO: p=0.1040 : p = 9.10 versus Ha: p<0.10 
H, : p < 0.10 at the a=0.05a = 0.05 significance level based on data from 
a random sample of 300 Alzheimer’s patients. Advanced calculations 
reveal that the power of this test to detect p=0.08P = 9-98 js 0.29. 
Interpretation: If the true proportion of Alzheimer’s patients like these 
who would experience side effects when taking the new drug is p=0.08 
p = 0.08 | there is a 0.29 probability that the researchers will find 
convincing evidence for Ha: p<0.104a :p < 0.10, 
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EXAMPLE | Can we tell if the new batteries last 
longer? |») 


Interpreting the power of a test 


=e | aa 
ey 
eo 


tuahlensa/Shutterstock.com 
PROBLEM: Let’s return one last time to the company that developed a 
new deluxe AAA battery that is supposed to last longer than its regular 
AAA battery. Based on years of experience, the company’s regular 
AAA batteries last for 30 hours of continuous use, on average. The 
company plans to select an SRS of 50 deluxe AAA batteries and use 
them continuously until they are completely drained. Then it will 
perform a test at the a=0.05a = 0.05 significance level of 


HO: p=30/40 : # = 30 
Ha: p>30Ha:# > 30 


where p= ~ the true mean lifetime (in hours) of the deluxe AAA 
batteries. The new batteries are more expensive to produce, so the 
company would like to be convinced that they really do last longer. The 
power of the test to detect that p=31/ — 31 hours is 0.762. Interpret this 
value. 


SOLUTION: 


If the true mean lifetime of the company’s deluxe AAA batteries is 1=31 
= 31 hours, there is a 0.762 probability that the company will find 
convincing evidence for Ha:p+30.Ha : # + 30. 


FOR PRACTICE, TRY EXERCISE 85 
The company in the example has a good chance (power=0.762) 
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(power = 0.762) of rejecting HOo if the true mean lifetime of its deluxe 
AAA batteries is 31 hours. That is, P(reject HO| p=31)=0.762 

P (rejectHo| » = 31) = 0.762 what’s the probability that the company 
makes a Type II error in this case? It’s P(fail to reject 

HO] p=31)=1-0.762=0.238/40| # = 31) = 1 — 0.762 = 0.238 We can 
generalize this relationship between the power of a significance test and 
the probability of a Type II error. 


RELATING POWER AND TYPE IT ERROR 


The power of a test to detect a specific alternative parameter value is 


related to the probability of a Type II error for that alternative: 


Power=|—P(Type Ilerror) and P(Type I error)=l—Power 
Power =1— P(Typellerror) and P(TypelIlerror) = 1 — Power 


The power of the test in the Alzheimer’s study to detect p=0.08? = 9-08 js 
only 0.29. In other words, researchers have a 1-0.29=0.711 — 0.29 = 0.71 
probability of making a Type IJ error by failing to find convincing 
evidence for Ha: p<0.104« : p < 9.10 when p=0.08P = 9-98. What can 
researchers do to decrease the probability of making a Type II error and 
increase the power of the test? 


WHAT AFFECTS THE POWER OF A TEST? Here is an activity that 


will help you answer this question. 


ACTIVITY Powerful batteries 


In this activity, we will use an applet to investigate the factors that affect 
the power of a test. Let’s return to the battery study from the preceding 
example. Recall that the company wants to perform a test of 


HO: p=30/40 : # = 30 
Ha: u>30Ha : pe > 30 


where p= ~ the true mean lifetime (in hours) of the deluxe AAA 
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batteries. The company estimates that the standard deviation of the 
lifetimes for its new batteries is o=30 = 3 hours based on a pilot study. 


1. Go to highschool.bfwpub.com/tps6e and launch the Statistical Power 
applet. Enter the values HO: u=30/40 >= 30 Ha: u>304a 7p > 30. 
o=30 = 3, n=50, a=0.05" = 90,a = 0.05 and alternate H=31h — 31 
Confirm that the power of the test is 0.762. 

Ho: u= 30 
H;. ® u>30 
J u< 30 
> uz30 
o=§3 
n= 50 
Alpha (a): 0.05 
— a 28.303 28.852 29.401 29.951 30.500 31.049 31.599 32.148 32.697 


alt. u= 31 Power = 0.762 


RESET 


28.303 28.852 29.401 29.951 30.500 31.049 31.599 32.148 32.697 


e The top curve shows the sampling distribution of the sample mean 
x £ for random samples of size n=50" = 50 when HO: p=30 
Ho : 4 = 30 is true. We refer to this as the null distribution. A 
value of x = that falls along the horizontal axis within the yellow 
region would lead us to reject HO: u=3040 p= 30. 


e The bottom curve shows the sampling distribution of the sample 
mean x £ for random samples of size n=50n = 50 when p=31 
= 31. We refer to this as the alternative distribution. The red 
region represents the power of the test. A value of x = that falls 
along the horizontal axis within the red region would lead to a 
correct rejection of HO: p=30/0 : # = 30. 


2. Sample size: Change the sample size from n=50n = 50 to n=100 
n = 100, What happens in the top and bottom panels of the applet? 
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Does the power increase or decrease? Explain why this makes sense. 
3. Significance level: Reset the sample size to n=507 = 50, 


a. Using the slider, change the significance level to a=0.010 = 0.01, 
What happens in the bottom panel of the applet? Does the power 
increase or decrease? How about the probability of a Type II error? 


b. Make a guess about what will happen if you change the 
significance level to a=0.10a = 0.10. Use the applet to test your 
conjecture. 


c. Explain what the results in parts (a) and (b) tell you about the 
relationship between Type I error probability, Type II error 
probability, and power. 


4. Difference between null and alternative parameter value: Reset the 
sample size to n=507 = 50 and the significance level to a=0.05 
a = 0.05, Will the company be more likely to detect that the true 
mean lifetime of its new batteries is greater than p=30 — 30 if HW=31 
# = 31 hours or H=32H = 32 hours? Use the applet to test your 
conjecture. 


As Step 2 of the activity confirms, we get better information about the 
true average lifetime of the company’s new AAA batteries from a random 
sample of 100 batteries than from a random sample of 50 batteries. The 
power of the test to detect that p=31/ — 31 increases from 0.762 to 0.954 
when the sample size increases from n=50" = 50 to n=100n = 100, 


Will it be easier to reject HOM0 if a=0.05e = 0.05 or a=0.10? 

a = 0.10? When ac is larger, it is easier to reject HO0 because the P- 
value doesn’t need to be as small. Step 3 of the activity shows that the 
power of the test to detect that p=31/ — 31 increases from 0.762 to 0.859 
when the significance level increases from 0.05 to 0.10. 

Figure 9.11 reveals an important link between Type I and Type II error 
probabilities. Because P(Type I error)=o"” (Typelerror) = a increasing 
the significance level increases the chance of making a Type I error. As the 
applet shows, this change also increases the power of the test. Because 
P(Type II error)=1—-Power!’ (Type Ilerror) = 1 — Power, higher power 
means a smaller chance of making a Type II error. So increasing the Type 
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I error probability a@ decreases the Type IJ error probability. By the same 
logic, decreasing the chance of a Type I error results in a higher chance of 
a Type IJ error. 

u= 30 

@® »w>30 


@ u<30 
> we30 


o=3 P(Type I error) = 0.10 


n= 50 
Alpha (a): 0.1 


S 
alt. u= 31 Power = 0.859 


28.303 28.852 29.401 29.951 30.500 31.049 31.599 32.148 32.697 


P| A TE 
P(Type I error) = 1 — 0.859 
=0.141 


28.303 28.852 29.401 29.951 30.500 31.049 31.599 32.148 32.697 


Figure 9.11 
Starnes & Tabor, The Practice of Statistics, 6e, © 2018 Bedford, Freeman & Worth High School Publishers 


FIGURE 9.11 The connection between Type I error, Type II error, and power. 
Because P(Type I error)=a, 


P (Type l error) = a, 


as the significance level a 
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increases: P (Type I error) increases, P (Type II error) decreases, and power 
increases. 


Step 4 of the activity shows that it is easier to detect large differences 
between the null and alternative parameter values than small differences. 
When n=50n = 50 and a=0.05a = 0.05, the power of the test to detect 
1=32H = 32 is 0.999, while the power of the test to detect p=31/' = 31 is 
only 0.762. 

The difference between the null parameter value and the specific 
alternative parameter value of interest is often referred to as the effect size. 
For the batteries’ setting with a null value of p=304 — 30 hours, it is much 
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easier to detect if the true mean lifetime of the company’s new AAA 
batteries is p=324 — 32 hours (an effect size of 2 hours) than if the true 
mean lifetime is p=31/ — 31 hours (an effect size of 1 hour). 


INCREASING THE POWER OF A SIGNIFICANCE TEST 


The power of a significance test to detect an alternative value of the 
parameter when HO/40 is false and Ha/« is true, based on a random 
sample of size n and significance level aa, will be larger when: 


e The sample size n is larger. 
e The significance level aa is larger. 


e The null and alternative parameter values are farther apart. 


In addition to these three factors, we can also gain power by making 
wise choices when collecting data. For example, using blocking in an 
experiment or stratified random sampling can greatly increase the power of 
a test in some circumstances. In an experiment, power will increase if you 
have fewer sources of variability by keeping other variables constant. 


EXAMPLE | Can we tell if the new drug reduces 
nausea? |») 
What affects the power of a test 


Tom Grill/Getty Images 


PROBLEM: The researchers in the Alzheimer’s experiment want to 
test the drug manufacturer’s claim that fewer than 10% of patients who 
take its new drug for treating Alzheimer’s disease will experience 
nausea. That is, they want to carry out a test of 
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HO: p=0.10/40 ‘po = 0.10 
Ha: p<0.104a : p < 0.10 


where p=? ~ the true proportion of patients like the ones in the study 
who would experience nausea when taking the new Alzheimer’s drug. 
Earlier, we mentioned that the power of the test to detect p=0.08 

p = 0.08 ysing a random sample of 300 patients and a significance level 
of a=0.05a = 0.05 js 0.29. 


Determine whether each of the following changes would increase or 
decrease the power of the test. Explain your answers. 


a. Use a=0.01a = 0.01 instead of a=0.05a = 0.05, 
b. If the true proportion is p=0.06? = 9-96 instead of p=0.08P = 9-08 
c. Use n=200n = 200 instead of n=300n = 300, 


SOLUTION: 


a. Decrease; using a smaller significance level makes it harder to reject 
Hoo when Ha/« is true. 


b. Increase; it is easier to detect a bigger difference between the null 
and alternative parameter value. 


c. Decrease; a smaller sample size gives less information about the true 
proportion p. 


FOR PRACTICE, TRY EXERCISE 87 


In Chapter 8 , you learned how to calculate the sample size needed for 
a desired margin of error in a confidence interval. When researchers plan 
to use a significance test to analyze their results, they will often calculate 
the sample size needed for a desired power. For a specific study design, 
the sample size required depends on three factors: 


. Significance level. How much risk of a Type I error—rejecting the null 
hypothesis when HO/0 is actually true—are we willing to accept? If a 
Type I error has serious consequences, we might opt for a=0.01 
a = 0.01, Otherwise, we should choose a=0.05a@ = 0.05 or a=0.10 
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a = 0.10, Recall that using a higher significance level would decrease 
the Type IJ error probability and increase the power. 


’. Effect size. How large a difference between the null parameter value and 
the actual parameter value is important for us to detect? 


3. Power. What probability do we want our study to have to detect a 
difference of the size we think is important? Most researchers insist on a 
power of at least 0.80 for their significance tests. 


Calculating power by hand is possible but unpleasant. It’s better to let 
technology do the work for you. There are many applets available to help 
determine sample size and to calculate the power of a test. 


Sometimes budget constraints get in the way of achieving high power. It can be 


expensive to collect data from a large enough sample of individuals to give a 


significance test the power that researchers desire. 


CHECK YOUR UNDERSTANDING 


A 


, 


y Can a six-month exercise program increase the total body bone 
mineral content (TBBMC) of young women? A team of researchers is 
planning a study to examine this question. The researchers would like to 
perform a test of 


HO: u=0/40 7p=0 

Ha: p>0Ha : H > 0 
where i is the true mean percent change in TBBMC during the exercise 
program. 


1. The power of the test to detect a mean increase in TBBMC of 1% 
using a=0.05e = 0.05 and n=25n = 25 subjects is 0.80. Interpret this 
value. 


2. Find the probability of a Type I error and the probability of a Type II 
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error for the test in Question 1. 


3. Describe two ways that researchers could increase the power of the 
test in Question 1. 


Using Tests Wisely 


Significance tests are widely used in reporting the results of research in 
many fields. New drugs require significant evidence of effectiveness and 
safety. Courts ask about statistical significance in hearing discrimination 
cases. Marketers want to know whether a new ad campaign significantly 
outperforms the old one, and medical researchers want to know whether a 
new therapy performs significantly better. In all these uses, statistical 
significance is valued because it points to an effect that is unlikely to occur 
simply by chance. 

Carrying out a significance test is often quite simple, especially if you 
use technology. Using tests wisely is not so simple. Here are some points 
to keep in mind when using or interpreting significance tests. 


STATISTICAL SIGNIFICANCE AND PRACTICAL 
IMPORTANCE When a null hypothesis of no effect or no difference can 
be rejected at the usual significance levels (a=0.10(4 = 0.10 oF @=0.05 
a = 0.05 or a=0.01)% = 0.01) | there is convincing evidence of a 
difference. But that difference may be very small. When large samples are 
used, even tiny deviations from the null hypothesis will be significant. 
Suppose we’re testing a new antibacterial cream, “Formulation NS,” 
on a small cut made on the inner forearm. We know from previous 
research that with no medication, the mean healing time (defined as the 
time for the scab to fall off) is 7.6 days. The claim we want to test here is 
that Formulation NS speeds healing. So our hypotheses are 


HO: u=7.6Ho : p= 7.6 
Ha: p<7.64a : H < 7.6 


where p= ~ the true mean healing time (in days) in the population of 
college students whose cuts are treated with Formulation NS. We will use 
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a 5% significance level. 


A random sample of 250 college students give informed consent to 
participate in a study and apply Formulation NS to their wounds. The 
mean healing time for these subjects is x =7.5% = 7.5 days and the 
standard deviation is sx=0.98+ = 9-9 day. Note that the conditions for 
performing a one-sample t test for p/ are met. We carry out the test and 
find that t=-1.76¢ = —1.76 and P-value =0.04= 0.04 with df=249df = 249 
. Because 0.04 is less than a=0.05a = 0.05, we reject HOM0. We have 
convincing evidence that Formulation NS reduces the average healing 
time. However, this result is not practically important. Having your scab 
fall off one-tenth of a day sooner is no big deal! 


Some people say “not clinically significant” when they mean not practically 


important. 


Remember the wise saying: oOo Statistical significance is not the 


same thing as practical importance. The remedy for attaching too much 
importance to statistical significance is to pay attention to the data as well 
as to the P-value. The foolish user of statistics who feeds the data to a 
calculator or computer without exploratory analysis will often be 
embarrassed. Plot your data, and examine them carefully. Is the 
difference you are seeking visible in your graphs? If not, ask yourself 
whether the difference is large enough to be practically important. Are 
there outliers or other departures from a consistent pattern? A few outlying 
observations can produce highly significant results if you blindly apply 
common significance tests. Outliers can also destroy the significance of 
otherwise convincing data. 

To help evaluate practical importance, give a confidence interval for 
the parameter in which you are interested. A confidence interval provides a 
set of plausible values for the parameter, rather than simply asking if the 
observed result is too surprising to occur by chance alone when Hoo is 
true. Confidence intervals are not used as often as they should be, whereas 
significance tests are perhaps overused. 


BEWARE OF MULTIPLE ANALYSES Statistical significance ought to 
mean that you have found a difference that you were looking for. The 
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reasoning behind statistical significance works well if you decide what 
difference you are seeking, design a study to search for it, and use a 
significance test to weigh the evidence you gather. In other settings, 
significance may have little meaning. Here’s one such example. 


Might the radiation from cell phones be harmful to users? Many 
studies have found little or no connection between using cell phones and 
various illnesses. Here is part of a news account of one study: 


A hospital study that compared brain cancer patients and a 
similar group without brain cancer found no statistically 
significant difference in cell-phone use for the two groups. 
But when 20 distinct types of brain cancer were considered 
separately, a significant difference in cell-phone use was 
found for one rare type. Puzzlingly, however, this risk 
appeared to decrease rather than increase with greater 
mobile phone use.2& 

Think for a moment. Suppose that the 20 null hypotheses for these 20 
significance tests are all true. Then each test has a 5% chance of being 
significant at the 5% level. That’s what a=0.05a = 0.05 means: results this 
extreme occur only 5% of the time just by chance when the null hypothesis 
is true. We expect about 1 of 20 tests to give a significant result just by 
chance. Running one test and reaching the a=0.05a = 0.05 level is 
reasonably good evidence that you have found something; running 20 tests 
and reaching that level only once is not. 

Searching data for patterns is certainly legitimate. Performing every 
conceivable significance test on a data set with many variables until you 
obtain a statistically significant result is not. This unfortunate practice is 
known by many names, including data dredging and P-hacking. 


For more on the pitfalls of multiple analyses, do an Internet search for the XKCD 


comic about jelly beans causing acne. 


Section 9.3 Summary 


1297 


The conditions for performing a significance test of HO: p=y0 
Hy : = Mo are: 


= Random: The data come from a random sample from the population 
of interest. 


: 10%: When sampling without replacement, n<0.10Nn < 0.10N., 


» Normal/Large Sample: The population has a Normal distribution or 
the sample size is large (n>30)(” > 30) Tf the population distribution 

has unknown shape and n<30n < 30, use a graph of the sample data to 

assess the Normality of the population. Do not use t procedures if the 


graph shows strong skewness or outliers. 


The standardized test statistic for a one-sample t test for a mean is 


t=x —yOsxn ve 


When the Normal/Large Sample condition is met, the distribution of this 
standardized test statistic can be modeled by a t distribution with n-1 

n — I degrees of freedom. You can use Table B or technology to find 
the P-value. 


Confidence intervals provide additional information that significance 
tests do not—namely, a set of plausible values for the parameter p/. A 
95% confidence interval for yp“ gives consistent results with a two-sided 
test of HO: p=p00 : # = Ho at the a=0.05a = 0.05 significance level. 


The power of a test is the probability that the test will find convincing 
evidence for Ha/4« when a specific alternative value of the parameter is 
true. In other words, the power of a test is the probability of avoiding a 


Type II error. For a specific alternative, Power=1—P(Type II error) 
Power = 1 — P (Type Il error) 


. 


We can increase the power of a significance test by increasing the 
sample size, increasing the significance level, or increasing the 
difference that is important to detect between the null and alternative 
parameter values (known as the effect size). Wise choices when 
collecting data, such as controlling for other variables and blocking in 
experiments or stratified random sampling, can also increase power. 
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e Fora specific study design, there are three factors that influence the 
sample size required for a statistical test: significance level, effect size, 
and the desired power of the test. 

e Very small differences can be highly significant (small P-value) when a 
test is based on a large sample. A statistically significant result may not 
be practically important. 

e Many tests run at once will probably produce some significant results by 
chance alone, even if all the null hypotheses are true. Beware of P- 
hacking. 


9.3 Technology Corners (» 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 
21. Computing P-values from t distributions Page 590 


22. Performing a one-sample t test for a mean Page 594 


Section 9.3 Exercises 


65. 586 ») Attitudes The Survey of Study Habits and Attitudes 
bs No 


(SSHA) is a psychological test with scores that range from 0 to 200. 
The mean score for U.S. college students is 115. A teacher suspects 
that older students have better attitudes toward school. She gives the 
SSHA to an SRS of 45 students from the more than 1000 students at 
her college who are at least 30 years of age. The teacher wants to 
perform a test at the a=0.05a@ = 0.05 significance level of 


HO: w=115Ho: pe=115 
Ha: p>115Ha: w > 115 


where p= — the mean SSHA score in the population of students at 
her college who are at least 30 years old. Check if the conditions for 
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66. 


67. 


68. 


performing the test are met. 


Candy! A machine is supposed to fill bags with an average of 19.2 

ounces of candy. The manager of the candy factory wants to be sure 
that the machine does not consistently underfill or overfill the bags. 
So the manager plans to conduct a significance test at the a=0.10 

a = 0.10 significance level of 


HO: p=19.240: # = 19.2 
Ha: p#19.2Ha: w # 19.2 


where p= ~ the true mean amount of candy (in ounces) that the 
machine put in all bags filled that day. The manager takes a random 
sample of 75 bags of candy produced that day and weighs each bag. 
Check if the conditions for performing the test are met. 


Battery life A tablet computer manufacturer claims that its batteries 
last an average of 10.5 hours when playing videos. The quality- 
control department randomly selects 20 tablets from each day’s 
production and tests the fully charged batteries by playing a video 
repeatedly until the battery dies. The quality-control department will 
discard the batteries from that day’s production run if they find 
convincing evidence that the mean battery life is less than 10.5 hours. 
Here are a dotplot and summary statistics of the data from one day: 


aan ieee T ‘ne ne T 
10 105 1 5 12 125 13 «135 «14 
Battery life (h) 
n Mean SD Min Qi Med Q3 Max 
20 11.07 1.097 10 10.3 10.6 11.85 13.9 


a. State appropriate hypotheses for the quality-control department to 
test. Be sure to define your parameter. 


b. Check if the conditions for performing the test in part (a) are met. 


Paying high prices? A retailer entered into an exclusive agreement 
with a supplier who guaranteed to provide all products at competitive 
prices. To be sure the supplier honored the terms of the agreement, 
the retailer had an audit performed on a random sample of 25 
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invoices. The percent of purchases on each invoice for which an 
alternative supplier offered a lower price than the original supplier 
was recorded.“ For example, a data value of 38 means that the price 
would be lower with a different supplier for 38% of the items on the 
invoice. A histogram and some numerical summaries of the data are 
shown here. The retailer would like to determine if there is 
convincing evidence that the mean percent of purchases for which an 
alternative supplier offered lower prices is greater than 50% in the 
population of this company’s invoices. 


Frequency 


0) 20 40 60) 80 100 120 


Percent lower 


n Mean SD Min Qi Med Q3 Max 
25 7779 32.6768 0 68 100 100 100 


a. State appropriate hypotheses for the retailer’s test. Be sure to 
define your parameter. 


b. Check if the conditions for performing the test in part (a) are met. 


69. pg 589 |») Two-sided test Suppose you want to perform a test of 


HO: u=6440 : H = 64 versus Ha: 1464 : HF 64 at the a=0.05 

a = 0.05 significance level. A random sample of size n=25n = 25 
from the population of interest yields x =62.8% = 62.8 and sx=5.36 
Sx = 9.36, Assume that the conditions for carrying out the test are 
met. 


a. Explain why the sample result gives some evidence for the 
alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 
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70. One-sided test Suppose you want to perform a test of HO: p=5 
Ho > = 5 versus Ha: p<544e ? # <5 at the a=0.05a = 0.05 
significance level. A random sample of size n=20n = 20 from the 
population of interest yields x =4.7% = 4.7 and sx=0.7482 = 9.74, 
Assume that the conditions for carrying out the test are met. 


a. Explain why the sample result gives some evidence for the 
alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 


71. Attitudes In the study of older students’ attitudes from Exercise 65, 
the sample mean SSHA score was 125.7 and the sample standard 
deviation was 29.8. 


a. Calculate the standardized test statistic. 
b. Find and interpret the P-value. 


c. What conclusion would you make? 


72. Candy! In the study of the candy machine from Exercise 66, the 
sample mean weight for the bags of candy was 19.28 ounces and the 
sample standard deviation was 0.81 ounce. 


a. Calculate the standardized test statistic. 
b. Find and interpret the P-value. 
c. What conclusion would you make? 


73. pg 592 |) Construction zones Every road has one at some point 


—construction zones that have much lower speed limits. To see if 
drivers obey these lower speed limits, a police officer uses a radar 
gun to measure the speed (in miles per hour, or mph) of a random 
sample of 10 drivers in a 25 mph construction zone. Here are the data: 


a7 33 32 21 30 30 20) 25 27 34 
a. Is there convincing evidence at the a=0.010 = 0.01 significance 
level that the average speed of drivers in this construction zone is 


greater than the posted speed limit? 
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74. 


70. 


76. 


b. Given your conclusion in part (a), which kind of mistake—a Type 
I error or a Type II error—could you have made? Explain what this 
mistake would mean in context. 


Ending insomnia A study was carried out with a random sample of 
10 patients who suffer from insomnia to investigate the effectiveness 
of a drug designed to increase sleep time. The following data show 
the number of additional hours of sleep per night gained by each 
subject after taking the drug.t® A negative value indicates that the 
subject got less sleep after taking the drug. 


1.9 0.8 1.1 0.1 -0.1 4.4 3.0 1.6 4.6 3.4 


a. Is there convincing evidence at the a=0.010 = 0.01 significance 
level that the average sleep increase is positive for insomnia 
patients when taking this drug? 


b. Given your conclusion in part (a), which kind of mistake—a Type 
I error or a Type II error—could you have made? Explain what this 
mistake would mean in context. 


Reading level A school librarian purchases a novel for her library. 
The publisher claims that the book is written at a fifth-grade reading 
level, but the librarian suspects that the reading level is lower than 
that. The librarian selects a random sample of 40 pages and uses a 
standard readability test to assess the reading level of each page. The 
mean reading level of these pages is 4.8 with a standard deviation of 
0.8. Do these data give convincing evidence at the a=0.05a@ = 0.05 
significance level that the average reading level of this novel is less 
than 5? 


How much juice? One company’s bottles of grapefruit juice are 
filled by a machine that is set to dispense an average of 180 milliliters 
(ml) of liquid. The company has been getting negative feedback from 
customers about underfilled bottles. To investigate, a quality-control 
inspector takes a random sample of 40 bottles and measures the 
volume of liquid in each bottle. The mean amount of liquid in the 
bottles is 179.6 ml and the standard deviation is 1.3 ml. Do these data 
provide convincing evidence at the a=0.05a@ = 0.05 significance level 
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‘yp 


78. 


73. 


that the machine is underfilling the bottles? 


Pressing pills A drug manufacturer forms tablets by compressing a 
granular material that contains the active ingredient and various 
fillers. The hardness of a sample from each batch of tablets produced 
is measured to control the compression process. The target value for 
the hardness is p=11.54 = 11.5. The hardness data for a random 
sample of 20 tablets from one large batch are 


11.627 11.613 11.493 11.602 11.360 
11.374 11.592 11.458 11.552 11.463 
11.383 11.715 11.485 11.509 11.429 
11.477 11.570 11.623 11.472 11.531 


Is there convincing evidence at the 5% level that the mean hardness 
of the tablets in this batch differs from the target value? 


Jump around Student researchers Haley, Jeff, and Nathan saw an 
article on the Internet claiming that the average vertical jump for 
teens was 15 inches. They wondered if the average vertical jump of 
students at their school differed from 15 inches, so they obtained a list 
of student names and selected a random sample of 20 students. After 
contacting these students several times, they finally convinced them 
to allow their vertical jumps to be measured. Here are the data (in 
inches): 


11.0 115 125 265 150 125 220 150 13.5 12.0 
23.00 19.00 155 21.0 125 23.0 20.0 8.5 25.5 20.5 


Do these data provide convincing evidence at the a=0.10a = 0.10 
level that the average vertical jump of students at this school differs 
from 15 inches? 


Pressing pills Refer to Exercise 77. 


a. Construct and interpret a 95% confidence interval for the true 
hardness pt of the tablets in this batch. Assume that the conditions 
for inference are met. 


b. Explain why the interval in part (a) is consistent with the result of 
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the test in Exercise 77. 


80. Jump around Refer to Exercise 78. 


a. Construct and interpret a 90% confidence interval for the true 
mean vertical jump pi (in inches) of the students at Haley, Jeff, 
and Nathan’s school. Assume that the conditions for inference are 
met. 


b. Explain why the interval in part (a) is consistent with the result of 
the test in Exercise 78. 


81. pg 595 |) Fast connection? How long does it take for a chunk of 


information to travel from one server to another and back on the 
Internet? According to the site internettrafficreport.com, the average 
response time is 200 milliseconds (about one-fifth of a second). 
Researchers wonder if this claim is true, so they collect data on 
response times (in milliseconds) for a random sample of 14 servers in 
Europe. A graph of the data reveals no strong skewness or outliers. 


a. State an appropriate pair of hypotheses for a significance test in 
this setting. Be sure to define the parameter of interest. 


b. Check conditions for performing the test in part (a). 


c. The 95% confidence interval for the mean response time is 158.22 
to 189.64 milliseconds. Based on this interval, what conclusion 
would you make for a test of the hypotheses in part (a) at the 5% 
significance level? 


d. Do we have convincing evidence that the mean response time of 
servers in the United States is different from 200 milliseconds? 
Justify your answer. 


82. Water! A blogger claims that U.S. adults drink an average of five 8- 
ounce glasses (that’s 40 ounces) of water per day. Researchers 
wonder if this claim is true, so they ask a random sample of 24 U.S. 
adults about their daily water intake. A graph of the data shows a 
roughly symmetric shape with no outliers. 


a. State an appropriate pair of hypotheses for a significance test in 
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83. 


84. 


85. 


this setting. Be sure to define the parameter of interest. 


b. Check conditions for performing the test in part (a). 


c. The 90% confidence interval for the mean daily water intake is 
30.35 to 36.92 ounces. Based on this interval, what conclusion 
would you make for a test of the hypotheses in part (a) at the 10% 
significance level? 


d. Do we have convincing evidence that the amount of water U.S. 
children drink per day differs from 40 ounces? Justify your answer. 


Tests and confidence intervals The P-value for a two-sided test of 
the null hypothesis HO: p=10 is 0.0640 : # = 10 is 0.06. 


a. Does the 95% confidence interval for p/ include 10? Why or why 
not? 


b. Does the 90% confidence interval for p/ include 10? Why or why 


not? 


Tests and confidence intervals The P-value for a two-sided test of 
the null hypothesis HO: p=15 is 0.0340 : # = 15 is 0.03, 


a. Does the 99% confidence interval for p/ include 15? Why or why 
not? 


b. Does the 95% confidence interval for p/ include 15? Why or why 


not? 


pg 598 |») Potato chips A company that makes potato chips 


requires each shipment of potatoes to meet certain quality standards. 
If the company finds convincing evidence that more than 8% of the 
potatoes in the shipment have “blemishes,” the truck will be sent back 
to the supplier to get another load of potatoes. Otherwise, the entire 
truckload will be used to make potato chips. To make the decision, a 
supervisor will inspect a random sample of potatoes from the 
shipment. He will then perform a test of HO: p=0.08/40 : p = 0.08 
versus Ha: p>0.08 a! p> 0.08 where p is the true proportion of 
potatoes with blemishes in a given truckload. The power of the test to 
detect that p=0.11? = 9-11, based on a random sample of 500 
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86. 


87. 


88. 


89. 


potatoes and significance level a=0.05a = 0.05, is 0.764. Interpret 
this value. 


Upscale restaurant You are thinking about opening a restaurant and 
are searching for a good location. From the research you have done, 
you know that the mean income of those living near the restaurant 
must be over $85,000 to support the type of upscale restaurant you 
wish to open. You decide to take a simple random sample of 50 
people living near one potential site. Based on the mean income of 
this sample, you will perform a test at the a=0.05a = 0.05 
significance level of HO: p=$85,000/0 : # =$85, 000 
u>$85,000He : # >$85, 000 where uu is the true mean income in the 
population of people who live near the restaurant. The power of the 
test to detect that 1=$86,000" =$86, 000 is 0.64. Interpret this value. 


versus Ha: 


pg 602 |») Powerful potatoes Refer to Exercise 85. Determine if 


each of the following changes would increase or decrease the power 
of the test. Explain your answers. 


a. Change the significance level to a=0.10a = 0.10, 
b. Take a random sample of 250 potatoes instead of 500 potatoes. 


c. The true proportion is p=0.10? = 9-10 instead of p=0.11? = 9-11, 


Restaurant power Refer to Exercise 86. Determine if each of the 
following changes would increase or decrease the power of the test. 
Explain your answers. 


a. Use arandom sample of 30 people instead of 50 people. 


b. Try to detect that 1=$85,5004 =$85, 500 instead of =$86,000 
y =$86, 000 


c. Change the significance level to a=0.10a = 0.10, 


Potato power problems Refer to Exercises 85 and 87. 


a. Explain one disadvantage of using a=0.10a = 0.10 instead of 
a=0.05a = 0.05 when performing the test. 
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b. Explain one disadvantage of taking a random sample of 500 
potatoes instead of 250 potatoes from the shipment. 


90. Restaurant power problems Refer to Exercises 86 and 88. 


91. 


92. 


a. Explain one disadvantage of using a=0.10a = 0.10 instead of 
a=0.05a = 0.05 when performing the test. 


b. Explain one disadvantage of taking a random sample of 50 people 
instead of 30 people. 


Strong chairs? A company that manufactures classroom chairs for 
high school students claims that the mean breaking strength of the 
chairs is 300 pounds. One of the chairs collapsed beneath a 220- 
pound student last week. You suspect that the manufacturer is 
exaggerating the breaking strength of the chairs, so you would like to 
perform a test of 


HO: p=30040: # = 300 
Ha: <300Ha: pe < 300 


where uF is the true mean breaking strength of this company’s 
classroom chairs. 


a. The power of the test to detect that p=2944 — 294 based ona 
random sample of 30 chairs and a significance level of a=0.05 
a = 0.05 is 0.71. Interpret this value. 


b. Find the probability of a Type I error and the probability of a Type 


II error for the test in part (a). 


c. Describe two ways to increase the power of the test in part (a). 


Better parking A local high school makes a change that should 
improve student satisfaction with the parking situation. Before the 
change, 37% of the school’s students approved of the parking that 
was provided. After the change, the principal surveys an SRS of 
students at the school. She would like to perform a test of 

HO: p=0.374o: p = 0.37 

Ha: p>0.37 Ha: p > 0.37 
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94. 


95. 


96. 


where p is the true proportion of students at school who are satisfied 
with the parking situation after the change. 


a. The power of the test to detect that p=0.45? = 9-49 based ona 
random sample of 200 students and a significance level of a=0.05 
a = 0.05 is 0.75. Interpret this value. 


b. Find the probability of a Type I error and the probability of a Type 
II error for the test in part (a). 


c. Describe two ways to increase the power of the test in part (a). 


Error probabilities and power You read that a significance test at 
the a=0.01@ = 0.01 significance level has probability 0.14 of making 
a Type II error when a specific alternative is true. 


a. What is the power of the test against this alternative? 


b. What’s the probability of making a Type I error? 


Power and error A scientist calculates that a test at the a=0.05 
a = 0.05 significance level has probability 0.23 of making a Type II 
error when a specific alternative is true. 


a. What is the power of the test against this alternative? 


b. What’s the probability of making a Type I error? 


Do you have ESP? A researcher looking for evidence of extrasensory 
perception (ESP) tests 500 subjects. Four of these subjects do 


significantly better (P<0.01)(P < 0.01) than random guessing. 


a. Is it proper to conclude that these four people have ESP? Explain 
your answer. 


b. What should the researcher now do to test whether any of these 
four subjects has ESP? 


Preventing colds A medical experiment investigated whether taking 
the herb echinacea could help prevent colds. The study measured 50 
different response variables usually associated with colds, such as 
low-grade fever, congestion, frequency of coughing, and so on. At the 
end of the study, those taking echinacea displayed significantly better 
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a7. 


98. 


99. 


responses at the a=0.05a = 0.05 level than those taking a placebo for 
3 of the 50 response variables studied. Should we be convinced that 
echinacea helps prevent colds? Why or why not? 


Improving SAT scores A national chain of SAT-preparation schools 
wants to know if using a smartphone app in addition to its regular 
program will help increase student scores more than using just the 
regular program. On average, the students in the regular program 
increase their scores by 128 points during the 3-month class. To 
investigate using the smartphone app, the prep schools have 5000 
students use the app along with the regular program and measure their 
improvement. Then the schools will test the following hypotheses: 
HO: u=128H0 > = 128 versus Ha: u>128Ha > 128 where pF is 
the true mean improvement in the SAT score for students who attend 
these prep schools. After 3 months, the average improvement was 

x =130% = 130 with a standard deviation of sx=655: = 65, The 
standardized test statistic is t=2.18¢ = 2.18 with a P-value of 0.0148. 
Explain why this result is statistically significant, but not practically 
important. 


Music and mazes A researcher wishes to determine if people are able 
to complete a certain pencil and paper maze more quickly while 
listening to classical music. Suppose previous research has 
established that the mean time needed for people to complete a 
certain maze (without music) is 40 seconds. The researcher decides to 
test the hypotheses HO: p=40 versus Ha: p<40 

Ho : w= 40-versus Hy : p< 40 where J = the time in seconds to 
complete the maze while listening to classical music. To do so, the 
researcher has 10,000 people complete the maze with classical music 
playing. The mean time for these people is x =39.92% = 39.92 
seconds, and the P-value of his significance test is 0.0002. Explain 
why this result is statistically significant, but not practically 
important. 


Sampling shoppers A marketing consultant observes 50 consecutive 
shoppers at a supermarket, recording how much each shopper spends 
in the store. Explain why it would not be wise to use these data to 
carry out a significance test about the mean amount spent by all 
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shoppers at this supermarket. 


00. Ages of presidents Joe is writing a report on the backgrounds of 
American presidents. He looks up the ages of all the presidents when 
they entered office. Because Joe took a statistics course, he uses these 
numbers to perform a significance test about the mean age of all U.S. 
presidents. Explain why this makes no sense. 

01. Power calculation: potatoes Refer to Exercise 85. 


a. 


Suppose that HO: p=0.08/40 : p = 0.08 is true. Describe the shape, 
center, and variability of the sampling distribution of p/? in random 
samples of size 500. 


Use the sampling distribution from part (a) to find the value of p? 
with an area of 0.05 to the right of it. If the supervisor obtains a 
random sample of 500 potatoes with a sample proportion of 
defective potatoes greater than this value of p’?, he will reject HO: 
p=0.08/0 : p = 0.08 at the a=0.05a = 0.05 significance level. 


Now suppose that p=0.11? = 9-11. Describe the shape, center, and 
variability of the sampling distribution of p\? in random samples of 
size 500. 


Use the sampling distribution from part (c) to find the probability of 
getting a sample proportion greater than the value you found in part 
(b). This result is the power of the test to detect p=0.11? = 9-11, 


Multiple Choice Select the best answer for Exercises 102—108. 


02. The reason we use t procedures instead of z procedures when carrying 
out a test about a population mean is that 


a. 
b. 
C. 
d. 


e. 


z requires that the sample size be large. 

zZ requires that you know the population standard deviation o. 
z requires that the data come from a random sample. 

z requires that the population distribution be Normal. 


z can only be used for proportions. 


03. You are testing HO: u=1040 > # = 10 against Ha: u<104a >< 10 
based on an SRS of 20 observations from a Normal population. The t 
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statistic is t=-2.25¢ =—2-25, The P-value 
a. falls between 0.01 and 0.02. 
b. falls between 0.02 and 0.04. 
c. falls between 0.04 and 0.05. 
d. falls between 0.05 and 0.25. 


e. is greater than 0.25. 

04. You are testing HO: u=1040 > = 10 against Ha: u#104a ‘#10 
based on an SRS of 15 observations from a Normal population. What 
values of the t statistic are statistically significant at the a=0.005 
a = 0.005 level? 


a. t>3.326t > 3.326 
b. t>3.286t > 3.286 
Cc. t>2.977t > 2.977 
d. t<-3.326t <—3.326 or t>3.326t > 3.326 


e, t<—3.286¢ <—3.286 or t>3.286t > 3.286 

05. After checking that conditions are met, you perform a significance test 
of HO: u=14o > =1 versus Ha: #1 Ho ‘#1. You obtain a P-value 
of 0.022. Which of the following must be true? 


a. A 95% confidence interval for p/ will include the value 1. 
b. A 95% confidence interval for p/ will include the value 0.022. 
c. A 99% confidence interval for p/ will include the value 1. 
d. A 99% confidence interval for u/ will include the value 0.022. 


e. None of these is necessarily true. 
06. The most important condition for making an inference about a 
population mean from a significance test is that 


a. the data come from a random sample. 

b. the population distribution is exactly Normal. 

c. the data contain no outliers. 

d. the sample size is less than 10% of the population size. 


e. the sample size is at least 30. 
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07. Vigorous exercise helps people live several years longer (on average). 
Whether mild activities like slow walking extend life is not clear. 
Suppose that the added life expectancy from regular slow walking is 
just 2 months. A significance test is more likely to find a significant 
increase in mean life expectancy with regular slow walking if 


a. it is based on a very large random sample and a 5% significance 
level is used. 

b. it is based on a very large random sample and a 1% significance 
level is used. 


c. it is based on a very small random sample and a 5% significance 
level is used. 


d. it is based on a very small random sample and a 1% significance 
level is used. 


e. the size of the sample doesn’t have any effect on the significance of 
the test. 

08. A researcher plans to conduct a significance test at the a=0.010 = 0.01 
significance level. She designs her study to have a power of 0.90 at a 
particular alternative value of the parameter of interest. The probability 
that the researcher will commit a Type II error for the particular 
alternative value of the parameter she used is 


a. 0.01. 
b. 0.10. 
c. 0.89. 
d. 0.90. 
e. 0.99. 


Recycle and Review 


109. Is your food safe? (8.1) “Do you feel confident or not confident that 
the food available at most grocery stores is safe to eat?” When a 
Gallup poll asked this question, 87% of the sample said they were 
confident.42 Gallup announced the poll’s margin of error for 95% 
confidence as +3+3 percentage points. Which of the following 
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sources of error are included in this margin of error? Explain your 
answer. 


a. Gallup dialed landline telephone numbers at random and so missed 
all people without landline phones, including people whose only 
phone is a cell phone. 


b. Some people whose numbers were chosen never answered the 
phone in several calls or answered but refused to participate in the 
poll. 


c. There is chance variation in the random selection of telephone 
numbers. 


110. Spinning for apples (5.3 or 6.3) In the “Ask Marilyn” column of 
Parade magazine, a reader posed this question: “Say that a slot 
machine has five wheels, and each wheel has five symbols: an apple, 
a grape, a peach, a pear, and a plum. I pull the lever five times. What 
are the chances that I’ ll get at least one apple?” Suppose that the 
wheels spin independently and that the five symbols are equally likely 
to appear on each wheel in a given spin. 


a. Find the probability that the slot player gets at least one apple in 
one pull of the lever. 


b. Now answer the reader’s question. 
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MM — FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


Anne reads that the average price of regular gas in her state is $4.06 
per gallon. To see if the average price of gas is different in her city, 
she selects 10 gas stations at random and records the price per gallon 
for regular gas at each station. The data, along with the sample mean 
and standard deviation, are listed in the table. 


Station 1 2 3 4 5 6 7 8 9 10 Mean SD 
Price 4.13 4.01 4.09 4.05 3.97 3.99 4.05 3.98 4.09 4.02 4.038 0.0533 


Do the data provide convincing evidence that the average price of 
gas in Anne’s city is different from $4.06 per gallon? 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
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“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 


score. 
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Chapter 9 Review 


Section 9.1: Significance Tests: The Basics 


In this section, you learned the basic ideas of significance testing. Start by 
stating the hypotheses that you want to test. The null hypothesis (Ho) 0) 
is typically a statement of “no difference” and the alternative hypothesis 
(Ha)(Ha) describes what we suspect is true. Remember that hypotheses 


are always about population parameters, not sample statistics. 


When sample data provide evidence for the alternative hypothesis, 
there are two possible explanations: (1) The null hypothesis is true, and 
data supporting the alternative hypothesis occurred just by chance, or (2) 
the alternative hypothesis is true, and the data are consistent with an 
alternative value of the parameter. In a significance test, always start with 
the belief that the null hypothesis is true. If you can rule out chance as a 
plausible explanation for the observed data, there is convincing evidence 
that the alternative hypothesis is true. 


The P-value in a significance test measures how likely it is to get 
evidence for Ha/e as strong or stronger than the observed evidence, 
assuming the null hypothesis is true. To determine if the P-value is small 
enough to reject H0O40 , compare it to a predetermined significance level 
such as a=0.05a = 0.05, If P-value<avalue < a, reject HO40—there is 
convincing evidence that the alternative hypothesis is true. However, if P- 
value>avalue > a fail to reject HO/0—there is not convincing evidence 
that the alternative hypothesis is true. 


Because conclusions are based on sample data, there is a possibility 
that the conclusion will be incorrect. You can make two types of errors in 
a significance test: a Type I error occurs if you find convincing evidence 
for the alternative hypothesis when, in reality, the null hypothesis is true. 
A Type II error occurs when you don’t find convincing evidence that the 
alternative hypothesis is true when, in reality, the alternative hypothesis is 
true. The probability of making a Type I error is equal to the significance 


level (a)(®) of the test. 
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Section 9.2: Tests About a Population Proportion 

In this section, you learned the details of performing a significance test 
about a population proportion p. Whenever you are asked if there is 
convincing evidence for a claim about a population proportion, you are 
expected to respond using the familiar four-step process. 


STATE: Give the hypotheses you are testing in terms of p, define the 
parameter p, and state the significance level. 


PLAN: Name the procedure you plan to use (one-sample z test for a 

population proportion) and check the appropriate conditions (Random, 

10%, Large Counts) to see if the procedure is appropriate. 

e Random: The data come from a random sample from the population of 
interest. 


© 10%: The sample size is less than 10% of the population when 
sampling without replacement. 


e Large Counts: Both np0O”Po and n(1—p0)” (1— Po) are at least 10, 
where pOP9 is the value of p in the null hypothesis. 

DO: Calculate the standardized test statistic and P-value. The standardized 
test statistic z measures how far away the sample statistic is from the 
hypothesized parameter value in standardized units: 

P—PO 

| Po (1—Po) 
z=p\-pOp0(1-pojn Y ™ 


To calculate the P-value, use ‘Table A or technology. 


CONCLUDE: Use the P-value to make an appropriate conclusion about 
the hypotheses, in context. 

Perform a two-sided test when looking for convincing evidence that 
the true value of the parameter is different from the hypothesized value, in 
either direction. The P-value for a two-sided test is calculated by finding 
the probability of getting a sample statistic at least as extreme as the 
observed statistic, in either direction, assuming the null hypothesis is true. 
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You can also use a confidence interval to make a conclusion for a two- 
sided test. If the null parameter value is one of the plausible values in the 
interval, there isn’t convincing evidence that the alternative hypothesis is 
true. However, if the null parameter value is not one of the plausible 
values in the interval, there is convincing evidence that the alternative 
hypothesis is true. Besides helping you draw a conclusion, the interval tells 
you which alternative parameter values are plausible. 


Section 9.3: Tests About a Population Mean 


In this section, you learned the details of performing a significance test 
about a population mean. Although some of the details are different, the 
reasoning and structure of the tests in this section are the same as in 
Section 9.2. In fact, the “State” and “Conclude” steps are exactly the same, 
other than the switch from proportions to means. 


PLAN: Name the procedure you are using (one-sample t test for a 
population mean), and check the conditions (Random, 10%, and 
Normal/Large Sample). The Random and 10% conditions are the same as 
in Section 9.2. The Normal/Large Sample condition states that the 
population distribution is Normal or the sample size is large (n>30) 

(n > 30) Tf the sample size is small and the population shape is unknown, 
graph the sample data to make sure there is no strong skewness or outliers 


that would suggest a non-Normal population. 


DO: Calculate the standardized test statistic and P-value. The standardized 
test statistic ¢ measures how far away the sample statistic is from the 
hypothesized parameter value in standardized units: 
t—p 
t= 0 
oH 


t=x —yOsxn vn 


To calculate the P-value using a t distribution, determine the degrees of 
freedom (df=n—1)\4£ = " — 1) and use Table B or technology. 


The probability that you avoid making a Type II error when an 
alternative value of the parameter is true is called the power of the test. 
Power is good—if the alternative hypothesis is true, we want to maximize 
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the probability of finding convincing evidence that it is true. We can 
increase the power of a significance test by increasing the sample size or 
by increasing the significance level. The power of a test will also be 
greater when the alternative value of the parameter is farther away from 
the null hypothesis value. 


Remember to use significance tests wisely. When planning a study, use 
a large enough sample size so the test will have adequate power. Also, 
remember that statistically significant results aren’t always practically 
important. Finally, be aware that the probability of making at least one 
Type I error goes up dramatically when conducting multiple tests. 


Significance test for p Significance test for p/“ 


Name (TI- One-sample z test for One-sample t test for p/ (TTest) 
83/84) p (1-PropZTest) 
Formmeta z=p/—p0p0(1—-p0)n $i T—Ug 
ee $a 
| po (1—p9) =x “pOsxn Vv" 
\ n P-value from t distribution with df=n-1 


df=n-1 
P-value from standard 
Normal distribution 


Conditions , Random: The data e Random: The data come from a 
come from a random sample from the population of 
random sample interest. 
from the population = «_ 49%: When sampling without 
of interest. replacement, n<0.10Nn < 0.10N, 

o 10%: When 


e Normal/Large Sample: The 


ae pane population has a Normal distribution 
cr ' or the sample size is large (n=30). 
replacement, 

ager (n > 30). If the population 
n<0.10N. distribution has unknown shape and 


" e ; n<30,” < 30, use a graph of the 
° et aie ; sample data to assess the Normality of 
oth np an the population. Do not use t procedures 


n(1—poy”? (1 — Po) if the graph shows strong skewness or 
are at least 10. That —_ outliers. 
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is, the expected 
number of successes 
and the expected 
number of failures 
in the sample are 
both at least 10. 


What Did You Learn? 


Related Relevant 
Example Chapter 


on Review 
Learning Target Section Page(s) Exercise(s) 
State appropriate hypotheses for a 9.1 5595~—sC R91, R9.5 
significance test about a population 
parameter. 
Interpret a P-value in context. 9.1 907 BS.7 
Make an appropriate conclusion for a 9.1 Soo BO3, Be7 
significance test. 
Interpret a Type I and a Type II error in 9.1 361 R9.5 


context. Give a consequence of each error 

in a given setting. 

State and check the Random, 10%, and 9.2 569 R9.2 
Large Counts conditions for performing a 

significance test about a population 

proportion. 


Calculate the standardized test statistic and 9.2 B/ 1 R9.3 
P-value for a test about a population 

proportion. 

Perform a significance test about a 92 674,577 R9.6 
population proportion. 

State and check the Random, 10%, and 9.3 586 R9.2 
Normal/Large Sample conditions for 

performing a significance test about a 

population mean. 

Calculate the standardized test statistic and 9.3 589 Bed 
P-value for a test about a population mean. 
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Perform a significance test about a 9.3 
population mean. 


592 


995 


Use a confidence interval to make a 9.3 , 


conclusion for a two-sided test about a 
population parameter. 


Interpret the power of a significance test 9.3 


and describe what factors affect the power 


998, 602 


of a test. 
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Chapter 9 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R9.1 |») Stating hypotheses State the appropriate null and alternative 


hypotheses in each of the following settings. Explain why the 
sample data give some evidence for Ha/« in each case. 


a. The average height of 18-year-old American women is 64.2 
inches. You wonder whether the mean height of this year’s 
female graduates from a large local high school differs from the 
national average. You measure an SRS of 48 female graduates 


and find that x =63.5 inches# = 63.5 inches and sx=3.7 inches 
$, = 3.7 inches. 


b. Rob once read that one-quarter of all people have played/danced 
in the rain at some point in their lives. His friend Justin thinks 
that the proportion is higher than 0.25 for their high school. To 
settle their dispute, they ask a random sample of 80 students in 
their school and find out that 28 have played/danced in the rain. 


R9.2 |») Checking conditions Refer to Exercise R9.1. Identify the 
appropriate test to perform in each setting and show that the 
conditions for carrying out the test are met. 

R9.3 |) Calculations and conclusions Refer to Exercise R9.1. Find the 
standardized test statistic and P-value in each setting, and make an 
appropriate conclusion. 

R9.4 |») Fonts and reading ease Does the use of fancy type fonts slow 


down the reading of text on a computer screen? Adults can read four 
paragraphs of a certain text in the common Times New Roman font 
in an average time of 22 seconds. Researchers asked a random 
sample of 24 adults to read this text in the ornate font named Gigi. 
Here are their times (in seconds): 
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23.2 212 28.9. 27.7 291 27.3 16.1 22.6: 25.6 34.2 23.9 .26.8 
20.5 343 214 32.6 26.2 341 31.5 246 23.0 286 244 28.1 


Do these data provide convincing evidence that it takes adults longer 
than 22 seconds, on average, to read these four paragraphs in Gigi 
font? 

R9.5 |p) Flu vaccine A drug company has developed a new vaccine for 


preventing the flu. The company claims that fewer than 5% of adults 
who use its vaccine will get the flu. To test the claim, researchers 
give the vaccine to a random sample of 1000 adults. 


a. State appropriate hypotheses for testing the company’s claim. Be 
sure to define your parameter. 


b. Describe a Type I error and a Type IJ error in this setting, and 
give the consequences of each. 


c. Would you recommend a significance level of 0.01, 0.05, or 0.10 
for this test? Justify your choice. 


d. The power of the test to detect the fact that only 3% of adults 
who use this vaccine would develop flu using a=0.05a = 0.05 is 
0.9437. Interpret this value. 


e. Explain two ways that you could increase the power of the test 
from part (d). 
R9.6 |) Flu vaccine Refer to Exercise R9.5. Of the 1000 adults who 


were given the vaccine, 43 got the flu. Do these data provide 
convincing evidence to support the company’s claim? 


R9.7 |») Roulette An American roulette wheel has 18 red slots among its 


38 slots. To test if a particular roulette wheel is fair, you spin the 
wheel 50 times and the ball lands in a red slot 31 times. The 
resulting P-value is 0.0384. 


a. Interpret the P-value. 
b. What conclusion would you make at the a=0.05a@ = 0.05 level? 


c. The casino manager uses your data to produce a 99% confidence 
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interval for p and gets (0.44, 0.80). He says that this interval 
provides convincing evidence that the wheel is fair. How do you 
respond? 
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Chapter 9 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T9.1 An opinion poll asks a random sample of adults whether they favor 
banning ownership of handguns by private citizens. A 
commentator believes that more than half of all adults favor such a 
ban. The null and alternative hypotheses you would use to test this 


claim are 

a. HO: pA=0.5; Ha: p>0.5o :p=0.5; H,:p>0.5. 
b. HO: p=0.5; Ha: p>0.50 : p = 0.5; Ha : p > 0.5, 

c. HO: p=0.5; Ha: p<0.540 >p=0.5;H, :p<0.5. 

d. HO: p=0.5; Ha: p#0.5Ho : p = 0.5; Ha : p # 0.5, 

e. HO: p>0.5; Ha: p=0.54o : p > 0.5; Ha : p= 0.5, 


T9.2 You are thinking of conducting a one-sample t test about a 
population mean py using a 0.05 significance level. Which of the 
following statements is correct? 


a. 


You should not carry out the test if the sample does not have a 
Normal distribution. 


You can safely carry out the test if there are no outliers, 
regardless of the sample size. 


You can carry out the test if a graph of the data shows no strong 
skewness, regardless of the sample size. 


You can carry out the test only if the population standard 
deviation is known. 


You can safely carry out the test if your sample size is at least 
30. 


T9.3 To determine the reliability of experts who interpret lie detector 
tests in criminal investigations, a random sample of 280 such cases 
was studied. The results were 
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Suspect’s True Status 


Examiner’s Decision Innocent Guilty 
“Innocent” 131 15 
“Guilty” q 125 


If the hypotheses are HO/0 : Suspect is innocent versus Ha/Za: 
Suspect is guilty, which of the following is the best estimate of the 
probability that an expert commits a Type II error? 


a. 15/280 
b. 9/280 
c. 15/140 
d. 9/140 
e. 15/146 


T9.4 A significance test allows you to reject a null hypothesis Hoo in 
favor of an alternative hypothesis Ha/4e at the 5% significance 
level. What can you say about significance at the 1% level? 


a. HO#o can be rejected at the 1% significance level. 


b. There is insufficient evidence to reject HO at the 1% 
significance level. 


c. There is sufficient evidence to accept HO at the 1% 
significance level. 


d. Hala can be rejected at the 1% significance level. 
e. The answer can’t be determined from the information given. 


T9.5 A random sample of 100 likely voters in a small city produced 59 
voters in favor of Candidate A. The observed value of the 
standardized test statistic for performing a test of HO: p=0.5 
Ho : p = 0.5 versus Ha: p>0.54a : p > 0.5 is which of the 
following? 

0.59—0.5 


/0.59(0.41) 


a. Z=0.59-0.50.59(0.41)100 ¥ 100 ~ 


y 
~ 
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/ 0.5(0.5) 
b. z=0.59-0.50.5(0.5)100 = V1 
0.5—0.59 
/ 0.59(0.41) 
c. 2=0.5-0.590.59(0.41)100 =) ™ 
0.5—0.59 
d. z=0.5-0.590.5(0.5)100 = Ym 
, — 0.59-0.5 


e. z=0.59-0.5100 v'100 


y A 
a” 


» — 


T9.6 A researcher claims to have found a drug that causes people to 
grow taller. The coach of the basketball team at Brandon 
University has expressed interest but demands evidence. Over 
1000 Brandon students volunteer to participate in an experiment to 
test this new drug. Fifty of the volunteers are randomly selected, 
their heights are measured, and they are given the drug. Two 
weeks later, their heights are measured again. The power of the 
test to detect an average increase in height of 1 inch could be 
increased by 


a. using only volunteers from the basketball team in the 
experiment. 


b. using a=0.01a = 0.01 instead of a=0.05a = 0.05, 

c. using a=0.05a@ = 0.05 instead of a=0.01a = 0.01, 

d. giving the drug to 25 randomly selected students instead of 50. 
e. using a two-sided test instead of a one-sided test. 


T9.7 A 95% confidence interval for the proportion of viewers of a 
certain reality television show who are over 30 years old is (0.26, 
0.35). Suppose the show’s producers want to test the hypothesis 
HO: p=0.2540 : p = 9.25 against Ha: p#0.25a :p #0.25 Which 
of the following is an appropriate conclusion for them to draw at 
the a=0.05@ = 0.05 significance level? 


a. Fail to reject HO;/40; there is convincing evidence that the true 
proportion of viewers of this reality TV show who are over 30 
years old equals 0.25. 
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Fail to reject HO;0; there is not convincing evidence that the 
true proportion of viewers of this reality TV show who are over 
30 years old differs from 0.25. 


Reject HO;/0; there is not convincing evidence that the true 
proportion of viewers of this reality TV show who are over 30 
years old differs from 0.25. 


Reject HO;0; there is convincing evidence that the true 
proportion of viewers of this reality TV show who are over 30 
years old is greater than 0.25. 


Reject H0O;/40; there is convincing evidence that the true 
proportion of viewers of this reality TV show who are over 30 
years old differs from 0.25. 


T9.8 In a test of HO : p=0.440 : p = 0.4 against Ha : p~0.4Ha ipF04 
a random sample of size 100 yields a standardized test statistic of 
Z=1.282 = 1.28. Which of the following is closest to the P-value 
for this test? 


a. 


0.90 


b. 0.40 
c. 0.05 
d. 0.20 
e. 0.10 


T9.9 Bags of a certain brand of tortilla chips claim to have a net weight 


of 14 ounces. Net weights vary slightly from bag to bag and are 
Normally distributed with mean p. A representative of a 
consumer advocacy group wishes to see if there is convincing 
evidence that the mean net weight is less than advertised and so 
intends to test the hypotheses 


HO: p=14Ho: w = 14 
Ha: p<14Ha: w < 14 


A Type | error in this situation would mean concluding that the 
bags 
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a 


a 


are being underfilled when they aren’t. 

are being underfilled when they are. 

are not being underfilled when they are. 
are not being underfilled when they aren’t. 


are being overfilled when they are underfilled. 


T9.10 Which of the following has the greatest probability? 


a. 
b. 
C. 
d. 


e. 


P(t>2)P (¢ > 2) if thas 5 degrees of freedom. 
P(t>2)P (t > 2) if thas 2 degrees of freedom. 
P(z>2)P (2 > 2) if is a standard Normal random variable. 
P(t<2)P (t <2) if thas 5 degrees of freedom. 


P(z<2)P (2 < 2) if is a standard Normal random variable. 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T9.11 A software company is trying to decide whether to produce an 
upgrade of one of its programs. Customers would have to pay $100 
for the upgrade. For the upgrade to be profitable, the company 
must sell it to more than 20% of their customers. You contact a 
random sample of 60 customers and find that 16 would be willing 
to pay $100 for the upgrade. 


a. 


b. 


Do the sample data give convincing evidence that more than 
20% of the company’s customers are willing to purchase the 
upgrade? Carry out an appropriate test at the a=0.05a = 0.05 
significance level. 


Which would be a more serious mistake in this setting—a Type 
I error or a Type II error? Justify your answer. 


Suppose that 30% of the company’s customers would be willing 
to pay $100 for the upgrade. The power of the test to detect this 
fact is 0.60. Interpret this value. 
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T9.12 According to the Bureau of Labor Statistics, the average age of 
American workers is 41.9 years. The manager of a large 
technology company believes that the company’s employees tend 
to be younger, on average. So she takes a random sample of 12 
employees and records their ages. Here are the data: 


27 38 32 24 30 47 42 38 27 43 37 33 


a. State appropriate hypotheses for testing the manager’s belief. 
Be sure to define the parameter of interest. 


b. State the conditions for performing a test of the hypotheses in 
(a), and determine whether each condition is met. 


c. The P-value of the test is 0.003. Interpret this value. What 
conclusion would you make? 


T9.13 A government report says that the average amount of money spent 
per U.S. household per week on food is about $158. A random 
sample of 50 households in a small city is selected, and their 
weekly spending on food is recorded. The sample data have a mean 
of $165 and a standard deviation of $20. Is there convincing 
evidence that the mean weekly spending on food in this city differs 
from the national figure of $158? 
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Chapter 10 Comparing Two 


Populations or Treatments 


Bill Gozansky/Alamy 


Introduction 


Section 10.1 Comparing Two Proportions 
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INTRODUCTION 


Which of two often-prescribed drugs—Lipitor or Pravachol—helps lower 
“bad cholesterol” more? Researchers designed an experiment, called the 
PROVE-IT Study, to find out. They used about 4000 people with heart 
disease as subjects. These individuals were randomly assigned to one of 
two treatment groups: Lipitor or Pravachol. At the end of the study, 
researchers compared the proportion of subjects in each group who died, 
had a heart attack, or suffered other serious consequences within two 
years. For those using Pravachol, the proportion was 0.263; for those using 
Lipitor, it was 0.224.4 Could such a difference have occurred purely by the 
chance involved in the random assignment? This is a question about 
comparing two proportions. 

Who studies more in college—men or women? Researchers asked 
separate random samples of 30 males and 30 females at a large university 
how many minutes they studied on a typical weeknight. The females 
reported studying an average of x F=165.17@F = 165.17 minutes; the 
male average was x M=117.17%™ = 117.17 minutes. How large is the 
difference in the corresponding population means? This is a question about 
comparing two means. 


What if we want to compare means in a setting that involves measuring 
the same quantitative variable twice for each individual or once for each of 
two individuals that are much alike? For instance, a researcher studied a 
random sample of identical twins who had been separated and adopted at 
birth. In each case, one twin was adopted by a high-income family and the 
other by a low-income family. Both twins were given an IQ test as adults. 
Did the twins raised in high-income families have significantly higher 
average IQ scores than the twins raised in low-income families? This is a 
question about comparing two means for paired data. 


Comparing two proportions or means based on random sampling or a 
randomized experiment is one of the most common situations encountered 
in statistical practice. In the PROVE-IT experiment, the goal of inference 
is to determine whether the treatments (Lipitor and Pravachol) caused the 
observed difference in the proportion of subjects who experienced serious 
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consequences in the two groups. For the college study-time survey, the 
goal of inference is to draw a conclusion about the difference in actual 
mean study times for all women and all men at the university. In the 
observational study of identical twins raised separately, the goal is to make 
an inference about whether twins raised in high-income families have 
higher mean adult IQ scores than twins raised in low-income families. 


The following activity gives you a taste of what lies ahead in this 
chapter. 


ACTIVITY who likes tattoos? 


For their response bias project (page 292), Sarah and Miranda 
investigated whether the characteristics of the interviewer can affect the 
response to a survey question. At the Tucson Mall, they asked 60 
shoppers the question “Do you like tattoos?” When interviewing 30 of 
the shoppers, Sarah and Miranda wore long-sleeved tops that covered 
their tattoos. For the remaining 30 shoppers, the interviewers wore tank 
tops that revealed their tattoos. The choice of long-sleeve or tank top 
was determined at random. Sarah and Miranda suspected that more 
people would answer “Yes” when their tattoos were visible. 


Rawpixel.com/Shutterstock 


What happened in the experiment? The two-way table summarizes 
the results:2 


Clothing worn 


Tank top Long sleeves Total 
Yes ay 
Like tattoos? 
No 28 


Total | 30 | 30 | 60 


The difference (Tank top — Long sleeves) in the proportions of 
people who said they like tattoos is 18/30-—14/30=0.600-0.467=0.133 
18/30 — 14/30 = 0.600 — 0.467 = 0.133. Does this difference provide 
convincing evidence that the appearance of the interviewer has an effect 
on the response, or could the difference be due to chance variation in the 
random assignment? 

In this activity, your class will investigate whether the results of the 
experiment are statistically significant. Let’s see what would happen just 
by chance if we randomly reassign the 60 people in this experiment to 
the two treatments (tank top and long sleeves) many times, assuming the 
treatment received doesn’t affect whether or not a person says they like 
tattoos. 


1. Using 60 index cards or equally sized pieces of paper, write “Yes” on 
32 and “No” on 28. Shuffle the cards and divide them at random into 
two piles of 30—one for the tank top treatment group and one for the 
long sleeves treatment group. Be sure to determine which pile will 
represent each group before you deal. 


2. Calculate the difference (Tank top — Long sleeves) in the proportions 
of “Yes” responses for the two groups. 


3. Your teacher will draw and label axes for a class dotplot. Plot your 
result from Step 2 on the graph. 


4. Repeat Steps 2 and 3 if needed to get a total of at least 40 trials of the 
simulation for your class. 


5. How often did a difference in proportions of 0.133 or greater occur 
due only to the chance involved in the random assignment? What 
conclusion would you make about the effect of the interviewer’s 
appearance? 


We used software to perform 100 trials of the simulation described in 
the activity. In each trial, the computer randomly reassigned the 60 
subjects in the experiment to the two treatments, assuming that the 
treatment received would not affect whether each person says that he or 
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she likes tattoos. Each dot in the graph represents the difference (Tank top 
— Long sleeves) in the proportion of respondents in the two groups who 
say they like tattoos for a particular trial. In this simulation, 19 of the 100 
trials (in red) produced a difference in proportions of at least 0.133, so the 
approximate P-value is 0.19. A difference this large could easily occur just 
due to the chance variation in random assignment! Sarah and Miranda’s 
data do not provide convincing evidence that the appearance of the 
interviewer caused a higher proportion of people to say they like tattoos 
when the interviewer’s tattoos were visible. 
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SECTION 10.1 Comparing Two Proportions 


LEARNING TARGETS By the end of the section, you should be able to: 


e Describe the shape, center, and variability of the sampling 
distribution of p\1—pA2P1 ~ P2, 


Determine whether the conditions are met for doing inference about 
a difference between two proportions. 


Construct and interpret a confidence interval for a difference 
between two proportions. 


Calculate the standardized test statistic and P-value for a test about 
a difference between two proportions. 


Perform a significance test about a difference between two 
proportions. 


Suppose we want to compare the proportions of individuals with a certain 
characteristic in Population 1 and Population 2. Let’s call these parameters 
of interest p1?1 and p2P2. The preferred strategy is to take a separate 
random sample from each population and to compare the sample 
proportions p1?1 and p\2?2 with that characteristic. 


What if we want to compare the effectiveness of Treatment 1 and 
Treatment 2 in a completely randomized experiment? This time, the 
parameters p1?1 and p2?P2 that we want to compare are the true 
proportions of successful outcomes for each treatment. We use the 
proportions of successes in the two treatment groups, pA1P 1 and pA2P2 , to 
make the comparison. Here’s a table that summarizes these two situations: 


Population or treatment Parameter Statistic Sample size 
1 pl pAiP1 ni” 


We compare the populations or treatments by doing inference about 
the difference p1—p2P1 ~ P2 between the parameters. The statistic that 
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estimates this difference is the difference between the two sample 
proportions, p\1—p/2?1 ~ P2. To use p\1—p/2P1 ~ P2 for inference, we 
must know its sampling distribution. 


The Sampling Distribution of a Difference Between Two 
Proportions 


To explore the sampling distribution of p\1—ph2P1 ~ P2, let’s start with 
two populations having a known proportion of successes. Suppose there 
are two large high schools—each with more than 2000 students—in a 
certain town. At School 1, 70% of students did their homework last night 
(p1=0.70) (P1 = 0.70). Only 50% of the students at School 2 did their 
homework last night (p2=0.50)(P2 = 0.50) The counselor at School 1 
selects an SRS of 100 students and records the proportion pr1P1 that did 
the homework. School 2’s counselor selects an SRS of 200 students and 
records the proportion pr2P2 that did the homework. What can we say 
about the difference pA1—pA2P1 ~ P2 in the sample proportions? 


MachineHeadz/Getty Images 


In Chapter 7, we saw that the sampling distribution of a sample 
proportion p/P has the following properties: 


Shape: Approximately Normal if np>10"P = 10 and n(1—p)=10 
n(1—p)>10 
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Center: pA=pCereer ‘My =P 
Variability:op/=p(1—p)n if n<0.10N 


[p(1—p) 


Variability : 0; = \/ ifn < 0.10N 


For the sampling distributions of pA1P1 and pr2P2 in this case: 


Sampling distribution of p/ 1P1 
Shape Approximately Normal; 


mpi: = 100(0.70) = 70 > 10 and 


n1p1=100(0.70)=70>10 andn1(1—p1)=100(0.30)=30 >10 m1 -m) = 1001030) ~ 20 >10 


Center =p, = 0.70 


up1=p1=0.70""» | 

Variability 7 | pip) - [o7o3) 
"1 Vm ~V~ 100 

op\1=p1(1-p1)n1=0.7(0.3)100=0.0458 = 9-0458 

because 100 < 10% of all students at School 1. 


What about the sampling distribution of p\1—pA2P1 ~ P2? We used 
software to randomly select 100 students from School 1 and 200 students 
from School 2. Our first set of samples gave pA1=0.72P 1 = 0.72 and 
p\2=0.47P2 = 0.47 resulting in a difference of p\1—p42=0.72-0.47=0.25 
P, ~ Pz = 0.72 — 0.47 = 0.25 | A red dot for this value appears in Figure 
10.1. The dotplot shows the results of repeating this process 500 times. 
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FIGURE 10.1 Simulated sampling distribution of the difference in sample 
proportions p41—p/2 
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in 500 SRSs of size n1=100 


1 = 100 


from a population with p1=0.70 


1 = 0.70 


and 500 SRSs of size n2=200 


2 = 200 


from a population with p2=0.50 
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9 = 0.50 


The figure suggests that the sampling distribution of pA1-pA2P1 — Po 
has an approximately Normal shape. This makes sense from what you 
learned in Section 6.2 because we are subtracting two independent random 
variables, pA1P1 and pA2P 2, that have approximately Normal 
distributions. 

The mean of the sampling distribution is 0.20. The true proportion of 
students who did last night’s homework at School 1 is p1=0.70P1 = 9.70 
and at School 2 is p2=0.50P2 = 9-50. We expect the difference p\1—p/2 
P: ~ P2 to center on the actual difference in the population proportions, 
p1—p2=0.70-0.50=0.20P1 ~ p2 = 9.70 — 0.50 = 0.20. The standard 
deviation of the sampling distribution is 0.058. It can be found using the 
formula 


p1(1- p 1)n1+p2(1- p 2)n2=0.7(0.3)100+0.5(0.5)200=0.058 


'py(1—-p,) po(1—-p 0.7(0.3) 0.5(0.5 
‘ee 1) , Pal a) =] (0:3) | 


= 0.058 
ny ng 100 200 ° 


That is, the difference (School 1-School 2)(Schooll ~ School2) in the 
sample proportions of students at the two schools who did their homework 
last night typically varies by about 0.058 from the true difference in 
proportions of 0.20. 


THE SAMPLING DISTRIBUTION OF pA1-p/2?P1 — Po 


Choose an SRS of size n1™1! from Population 1 with proportion of 
successes p1P1 and an independent SRS of size n2™2 from Population 2 
with proportion of successes p2P2. Then: 


e The sampling distribution of p\1-p2P1 — Pe is approximately 
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Normal if the Large Counts condition is met for both samples: 
np112>10, ni(l-p1)=10, n2p2>10, 

NyPi > 10, Thy (1 = pi) = 10, 3 P2 Be 10, and n2(1—-p2)>10 

9 (1 — p2) = 10 


e The mean of the sampling distribution of p\1—pA2P1 P2 is 
pp\1-p\1=p1-p2ei-p: ~ PLP 
e The standard deviation of the sampling distribution of p41—p/2 
Pi — Po is 
op41—-p42=p1(1-p1)n1+p2(1-p2)n2 
[py (1—p1)  P2(1-Pe) 
Oo: " = aaa + ——————— 
Pi~P2 \ n4 no 


as long as the 10% condition is met for both samples: n1<0.10N1 
ny <0.10N) and n2<0.10N2"2 < 0.10N2. 


Note that the formula for the standard deviation of the sampling 
distribution is only correct for independent random samples from the 
corresponding populations. We will adjust the Random condition for 
inference about a difference between two proportions to reflect this added 
requirement. The standard deviation of the sampling distribution tells us 
how much the difference in sample proportions will typically vary from 
the difference in the population proportions if we repeat the random 
sampling process many times. 


Think About It 


WHERE DO THE FORMULAS FOR THE MEAN AND 
STANDARD DEVIATION OF THE SAMPLING DISTRIBUTION 
OF p41 - pA2?1 ~ P2 COME FROM? Both p‘1P: and p2?2 are 
random variables. That is, their values would vary in repeated 
independent SRSs of size n1™ and n2"2. Independent random samples 
yield independent random variables pA1P1 and pr2P 2. The statistic 
p\1-pA2P 1 ~ P2 is the difference of these two independent random 
variables. 
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In Chapter 6, we learned that for any two random variables X and Y, 
uxX—-Y="X-pVYHx-y = Mx — Hy 
For the random variables pA1P1 and pA2P 2, we have 
yp\1—p\2=yp\1—pp2=p1—p2"Pi-P2 ~ Mp, ~ Mp, ~ Pi Pe 
We also learned in Chapter 6 that for independent random variables 
X and Y, 
oX-Y2=0X2+0V27X-¥ = Ox + Py 
For the random variables pA1P 1 and pA2P 2, we have 


op\1—-p22=o0p12+0p\22=(p1(1—-p1)n1)2+(p2(1-p2)n2)2 


| i - | feces a Se a 
ms ot “ot -(\™ a) (4 a) 
Pi-P2 Pi 2 \ ny \ ng 


_Pi(t=pi) | Pa(t=P2) 
=p1(1-p1)n1+p2(1—-p2)n2 ny ng 


/rmi(l—pi) | pe(l—pe) 
a f yt ae SS — 


So op\1-p/2=p1(1-p1)n1+p2(1—p2)n2°P1-P2 ~ Vm ng 


When the conditions are met, we can use the Normal density curve 
shown in Figure 10.2 to model the sampling distribution of p41—p/2 
P: ~ P2, Note that this would allow us to calculate probabilities involving 
pA1-p2P1 ~ P2 with a Normal distribution. 


Standard deviation: 


jee -Pi) po(1-pp) 
+ 


ny ny 


Values of p; — Pp» 


FIGURE 10.2 Select independent SRSs from two populations having proportions 
of successes p1 
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and p2 
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. The proportions of successes in the two samples are p41 
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and pA2 
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. When the conditions are met, the sampling distribution of p41 


1348 


Po» 


is approximately Normal with mean p1—p2 


Pi — 


and standard deviation p1(1—p1)n1+p2(1-p2)n2 
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Ey Ss eee eee 
I py (1—p,) Po (1—ps) 
: 


Thy m9 


EXAMPLE | Yummy goldfish! (») 
Describing the sampling distribution of p41-p/2 
Pi — Po 


PROBLEM: Your teacher brings two bags of colored goldfish crackers 
to class. Bag 1 has 25% red crackers and Bag 2 has 35% red crackers. 
Each bag contains more than 1000 crackers. Using a paper cup, your 
teacher takes an SRS of 50 crackers from Bag 1 and a separate SRS of 
AO crackers from Bag 2. Let pA1—p2P1 ~ P2 be the difference in the 
sample proportions of red crackers. 


a. What is the shape of the sampling distribution of p\1-p2P1 — P2? 
Why? 
b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling 


distribution. 
a @a 
aa 
be et 
VaMVe 
a q X e 
POY 
SOLUTION: 
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a. Approximately Normal, because n1p1 = 50(0.25) = 12.5 
mpi = 50 (0.25) = 12.5. a4” - pt) = 50(0.75) = 37.5 
ny (1 = P1) = 50 (0.75) = 37.5 n2p2 = 40(0.35) =14 
nap2 = 40(0.35) = 14) and n2(1 - p2) = 40(0.65) = 26 
nz (1 — pz) = 40(0.65) = 26 are al] >10.> 10. 


Note that these values are the expected numbers of successes and 


failures in the two samples. 


ee ee ee ae erns.—5. — 9-20 — 0.50 = — 0.10 


up\1—pA2=p1-p2"r: Po = Pi — p2 


c. Because 50 < 10% of all crackers in Bag 1 and 40 < 10% of all 
crackers in Bag 2, 
op/1-pA2 = 0.25(0.75)50+0.35(0.65)40 = 0.0971 
[0.25(0.75)  0.35(0.65) 
Sg \ so. t —1 = 0.0971 

The difference (Bag 1 — Bag 2) in the sample proportions of red 

goldfish crackers typically varies by about 0.097 from the true 

difference in proportions of -0.10.~9-10. 


op\1—-p42=p1(1-p1)n1+p2(1-p2)n2 


| py (1-P}) 2 (1—p2) 
ng 


FOR PRACTICE, TRY EXERCISE 1 


Confidence Intervals for p1 — p2?: — P2 


1352 


When data come from two independent random samples or two groups in a 
randomized experiment (the Random condition), the statistic p\1—p/2 

P: ~ P2 is our best guess for the value of p1—p2?1 ~ P2, The method we 
use to calculate a confidence interval for p1—p2?! ~ P2 requires that the 
sampling distribution of pA1-pA2P1 ~ P2 be approximately Normal. 
Earlier, we noted that this will be true whenever n1p1"1P1, n1(1-p1) 

m (1— pr) n2p2"2P2, and n2(1—p2)!"2 (1 — p2) are all at least 10. Because 
we don’t know the value of p1?! or p2?2 when we are estimating their 
difference, we use pA1P1 and pA2P 2 when checking the Large Counts 
condition. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE 
INTERVAL ABOUT A DIFFERENCE IN PROPORTIONS 


e Random: The data come from two independent random samples or 
from two groups in a randomized experiment. 


© 10%: When sampling without replacement, n1<0.10N1 
n; <0.10N) and n2<0.10N2%2 < 0-10.N2. 
e Large Counts: The counts of “successes” and “failures” in each 
sample or group—n1p\1™P1, n1(1—-p1)™ (1— pi), n2pr2"2P2 , 
n2(1—pA2)"2 (1 — P2)_are all at least 10. 


Recall from Chapter 4 that the Random condition is important for 
determining the scope of inference. Random sampling allows us to 
generalize our results to the populations of interest; random assignment in 
an experiment permits us to draw cause-and-effect conclusions. 


EXAMPLE |Do you prefer brand names? » 
Checking conditions 
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Ariel Skelley/Getty Images 


PROBLEM: A Harris Interactive survey asked independent random 
samples of adults from the United States and Germany about the 
importance of brand names when buying clothes. Of the 2309 U.S. 
adults surveyed, 26% said brand names were important, compared with 
22% of the 1058 German adults surveyed. Let pU=PU ~ the true 
proportion of all U.S. adults who think brand names are important when 
buying clothes and pG=P¢ ~ the true proportion of all German adults 
who think brand names are important when buying clothes. Check if the 
conditions for calculating a confidence interval for pU-pGPU ~ PG are 
met. 


SOLUTION: 


e Random: Independent random samples of 2309 U.S. adults and 1058 
German adults. / 


o 10%: 2309<10%2309 < 10% of all U.S. adults and 1058 << 10% 
of all German adults “ 


Be sure to mention independent random samples from the populations 


of interest when checking the Random condition. 


e Large Counts?nUp/U = 2309(0.26) = 600.34 — 600 
ny Py = 2309 (0.26) = 600.34 — 600 nU(1 — pa) = 2309(0.74) = 
1708.66 1709" (1 — By) = 2309 (0.74) = 1708.66 + 1709 4G 
pAG = 1058(0.22) = 232.76 — 233 
ng Pg = 1058 (0.22) = 232.76 — 233 Gc — pAG) = 1058(0.78) = 
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825.24 825"G (1 = Pa) = 1058 (0.78) = 825.24 — 825 are all >10 
> 1057 


We round these values to the nearest whole number because they 


represent the actual numbers of successes and failures in the two 


samples. 


FOR PRACTICE, TRY EXERCISE 5 


If the conditions are met, we can use our familiar formula to calculate a 
confidence interval for p1—p2:P1 ~ P2 : 


Statistict+(critical value) ‘(standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 


=(p1- pr2)ize- (standard deviation of statistic) 
= (p, — po) + 2” - (standard deviation of statistic) 


As mentioned earlier, the standard deviation of the sampling distribution 
of pA1—-pA2P1 ~ P2 is 


ot we gf EEE), Baa) 
7 _ _ _ oD, P2 \ ny n9 
op41-p42=p1(1-p1)n1+p2(1-p2)n2 


Because we don’t know the values of the parameters p1?1 and p2P2, we 
replace them in the standard deviation formula with the sample 
proportions. The result is the standard error of p\1—p42P1 ~ P2: 


SEp41—p42=p41(1-p41)n1+p42(1-p2)n2 


sm 24 = {210-?1) | F2-P3) 
1 ~P2 \ ny ng 


This value estimates how much the difference in sample proportions will 
typically vary from the difference in the true proportions if we repeat the 
random sampling or random assignment many times. 

When the Large Counts condition is met, we find the critical value z* 
2° for the given confidence level using ‘Table A or technology. Our 
confidence interval for p1—p2?!1 ~ P2 is therefore 
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statistic+(critical value) ‘(standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 


=(p\1-p2)+z*p1(1-p%1)n1+p2(1—p/2)n2 


'p1(1-B1) ba (1-B2) 
=f ip jae | PL 1) , Pa(1~P9) 
\ ny ng 


This is often called a two-sample z interval for a difference between two 
proportions. 


TWO-SAMPLE z INTERVAL FOR A DIFFERENCE 
BETWEEN TWO PROPORTIONS 


When the conditions are met, a C% confidence interval for p1—p2 
Pi — P2 is 


(p\1—p2)+z*pM1(1-pA1)n1+pA2(1—pA2)n2 
[P1(1-P1) | P2 (1-2) 
ny nL 


(Pp, —po) +2" 


where z*2° is the critical value for the standard Normal curve with C% 
C% of the area between —z*~2" and z*2*. 


Let’s return to the brand names example. Recall that Harris Interactive 
took independent random samples of 2309 U.S. adults and 1058 German 
adults, and found that pAU=0.26Pu = 0.26 and pAG=0.22P 6 = 0.22 We 
already confirmed that the conditions are met. A 95% confidence interval 
for pU-pGPU ~ PG is 

(0.26—-0.22)+1.960.26(1—-0.26)2309+0.22(1-0.22)1058 
O26-—ormy erg | a, Oe) 
\ ~~ 2309 1058 
=0.04+0.03 = 0.04 + 0.03 


=(0.01, 0.07) = (0.01, 0.07) 


Interpretation: We are 95% confident that the interval from 0.01 to 0.07 
captures pU-pGPU ~ PG= the difference in the true proportions of all U.S. 
adults and all German adults who think brand names are important when 
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buying clothes. 


Note that the confidence interval does not include 0 as a plausible 
value for pU-pGPU ~ PG, so we have convincing evidence of a difference 
between the population proportions. In fact, it is believable that pU-pG 
PU ~ PG has any value between 0.01 and 0.07. We can restate this in 
context as follows: The interval suggests that the importance of brand 
names when buying clothes for U.S. adults is between 1 and 7 percentage 
points higher than for German adults. 


It would not be correct to say that the importance of brand names when 
buying clothes for U.S. adults is between 1 and 7 percent higher than for 
German adults. To see why, suppose that pU=0.25Pu = 9.25 and pG=0.20 
Pe = 0.20. The difference pU-pG=0.25-0.20=0.05 
pu — pe = 0.25 — 0.20 = 0.05 or 5 percentage points. But the proportion 
of U.S. adults who think brand names are important when buying clothes 
is 0.05/0.20=0.250-05/0.20 = 0.25 or 25% higher than the corresponding 
proportion of German adults. 


The researchers in the preceding example selected independent random 
samples from the two populations they wanted to compare. In practice, it’s 
common to take one random sample that includes individuals from both 
populations of interest and then to separate the chosen individuals into two 
groups. For instance, a polling company may randomly select 1000 U.S. 
adults, then separate the Republicans from the Democrats to estimate the 
difference in the proportion of all people in each party who favor the death 
penalty. The two-sample z procedures for comparing proportions are still 
valid in such situations, provided that the two groups can be viewed as 
independent samples from their respective populations of interest. 


INFERENCE FOR EXPERIMENTS So far, we have focused on doing 
inference using data that were produced by random sampling. However, 
many important statistical results come from randomized comparative 
experiments. Defining the parameters in experimental settings is more 
challenging. 

The “Who likes tattoos?” activity on page 619 describes an experiment 
that used 60 mall shoppers as subjects. Sarah and Miranda randomly 
assigned half the subjects to be asked, “Do you like tattoos?” by an 
interviewer wearing a tank top with tattoos showing, and the other half to 
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be asked the same question by the same interviewer wearing long sleeves 
that hid the tattoos. Then Sarah and Miranda compared the proportions of 
people in the two groups who said “Yes.” The parameters in this setting 
are 


p1=P1 ~ the true proportion of people like these who would say 
they like tattoos when asked by the interviewer with tattoos 
showing 


p2=P2 ~ the true proportion of people like these who would say 
they like tattoos when asked by the interviewer with tattoos 
hidden 


Most experiments on people use recruited volunteers as subjects. When 
subjects are not randomly selected, researchers cannot generalize the 
results of an experiment to some larger populations of interest. But 
researchers can draw cause-and-effect conclusions that apply to people 
like those who took part in the experiment. This same logic applies to 
experiments on animals or things. Also, unless the experimental units are 
randomly selected without replacement, we don’t have to check the 10% 
condition when performing inference about an experiment. 


The following example shows how to construct and interpret a 
confidence interval for a difference in proportions. As usual with inference 
problems, we follow the four-step process. 


EXAMPLE | Treating lower back pain (>) 


Confidence interval for p; — p> 
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Syda Productions/Shutterstock 


PROBLEM: Patients with lower back pain are often given nonsteroidal 
anti-inflammatory drugs (NSAIDs) like naproxen to help ease their pain. 
Researchers wondered if taking Valium along with the naproxen would 
affect pain relief. To find out, they recruited 112 patients with severe 
lower back pain and randomly assigned them to one of two treatments: 
naproxen and Valium or naproxen and placebo. After 1 week, 39 of the 
57 subjects who took naproxen and Valium reported reduced lower back 
pain, compared with 43 of the 55 subjects in the naproxen and placebo 
group.2 


a. Construct and interpret a 99% confidence interval for the difference 
in the proportion of patients like these who would report reduced 
lower back pain after taking naproxen and Valium versus after taking 
naproxen and placebo for a week. 


b. Based on the confidence interval in part (a), what conclusion would 
you make about whether taking Valium along with naproxen affects 
pain relief? Justify your answer. 


SOLUTION: 


a. STATE: 99% CI for p1 — p2?1 ~ P2, where p1 =?! ~ true proportion 
of patients like these who would report reduced lower back pain after 
taking naproxen and Valium for a week and p2 =P2 ~ true 
proportion of patients like these who would report reduced lower 
back pain after taking naproxen and placebo for a week. 
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Be sure to indicate the order of subtraction when defining the parameter. 


Then you can just mimic the wording in your conclusion. 


PLAN: Two-sample z interval for p1 — p2P1 ~ P2 


e Random: Randomly assigned patients to take naproxen and Valium or 
naproxen and placebo. / 


e Large Counts: 39, 57 — 39 =1897 — 39 = 18 43, and 55 - 43 = 12 
55 — 43 = 12 are all>10all > 10. ¥ 


We don’t have to check the 10% condition because researchers did not 


sample patients without replacement from a larger population. 


Se an ca WG 
DO: p41 = 3957 = 0.63471 = 57 0.684 


, pA2 = 4355 = 0.782 
(0.684 — 0.782) + 2.5760.684(0.316)57+0.782(0.218)55 


: es : , | 0.684(0.316) 0.782(0.218) 
(0.684 — 0.782) + 2.576,/-—————> + ———— 


= —0.098+0.214 = — 0.098 + 0.214 
= ( -0.312, 0.116)= ( — 9-312, 0.116) 


(p41—p’2)+z*p1(1—-—p41)n1+pA2(1-p’2)n2 
[Pi (1-61) _ P2(1-P2) 
| ng 


Refer to the Technology Corner that follows the example. The 
calculator’s 2-PropZInt gives (—0.3114, 0.1162). —0.3114, 0.1162). 


CONCLUDE: We are 99% confident that the interval from —0.312 to 
0.116 captures pl — p2 =P1 ~ P2 ~ the difference in the true proportions 
of patients like these who would report reduced pain after taking 
naproxen and Valium versus after taking naproxen and a placebo for a 
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week. 


b. Because the interval includes 0 as a plausible value for p1 — p2 
P1 ~ P2, we don’t have convincing evidence that taking Valium 
along with naproxen affects pain relief for patients like these. 


The interval suggests that the true proportion of patients like these who 
would report reduced pain after taking naproxen and Valium is between 


31.2 percentage points lower and 11.6 percentage points higher than after 
taking naproxen and placebo. 


FOR PRACTICE, TRY EXERCISE 9 


We could have subtracted the proportions in the opposite order in part 


(a) of the example. The resulting 99% confidence interval for p2—p1 
P2 — P1 is 


(0.782—0.684)+2.5760.782(0.218)55+0.684(0.316)57 

(0.782 — 0.684) + 2.576 /9.782(0.218) | 0.684(0.316) aciedlachiea 
\ 55 57 

=0,098+0.214 = 0.098 + 0.214 


=(-0.116, 0.312)— (—0.116, 0.312) 


Notice that the endpoints of the interval have the same values but opposite 
signs to the ones in the example. This interval suggests that the true 
proportion of patients like these who would report reduced pain after 
taking naproxen and placebo is between 11.6 percentage points lower and 
31.2 percentage points higher than after taking naproxen and Valium. 
That’s equivalent to our interpretation of the confidence interval for p1—p2 
Pi ~ P2 in part (a) of the example. 

The fact that 0 is included in a confidence interval for p1—p2P1 ~ P2 
means that we don’t have convincing evidence of a difference between the 
true proportions. Keep in mind that 0 is just one of many plausible values 
for p1—p2?1 ~ P2 based on the sample data. o@ Never suggest that you 
believe the difference between the true proportions is 0 just because 0 
is in the interval! 
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You can use technology to perform the calculations in the “Do” step. 
Remember that this comes with potential benefits and risks on the AP® 
Statistics exam. 


23. Technology Corner |CONSTRUCTING A 
CONFIDENCE INTERVAL FOR A DIFFERENCE IN 
PROPORTIONS (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


The TI-83/84 can be used to construct a confidence interval for p1—p2 
P1 ~ P2, We’|l demonstrate using the preceding example. Of n1=57 

m4 = 957 subjects who took naproxen and Valium, X1=39%1 = 39 
reported reduced lower back pain after a week. Of n2=55"2 = 99 
subjects who took naproxen and placebo, X2=43%X2 = 43 reported 
reduced lower back pain after a week. To construct a confidence 
interval: 


e Press [STAT], then choose TESTS and 2-PropZInt. 


e When the 2-PropZInt screen appears, enterr the values shown. Note 
that the values of x;, 7, X7, and ny must be integers. 


e Highlight “Calculate” and press [ENTER |. 
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NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-PropZIint 


n2:55 
C-Level:8.99 
Calculate 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-PropZiInt 
(-@.3114.0.11623) 
6’1=0. 6842105263 
b2=0. 7818181818 
n1i=57 
n2=55 


AP® EXAM TIP 


The formula for the two-sample z interval for pl—p2?1 ~ P2 often leads 
to calculation errors by students. As a result, your teacher may 
recommend using the calculator’s 2-PropZInt feature to compute the 
confidence interval on the AP® Statistics exam. Be sure to name the 
procedure (two-sample z interval for p1—p2?1 ~ P2) in the “Plan” step 
and give the interval (—0.311, 0.116)(—9-311, 0.116) in the “Do” step. 


CHECK YOUR UNDERSTANDING 


A Pew Research Center poll asked independent random samples of 
working women and men how much they value job security. Of the 806 
women, 709 said job security was very or extremely important, 
compared with 802 of the 944 men surveyed. Construct and interpret a 
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95% confidence interval for the difference in the proportion of all 
working women and men who consider job security very or extremely 
important. 


Significance Tests for p1 — p2?: — P2 


An observed difference between two sample proportions can reflect an 
actual difference in the parameters, or it may just be due to chance 
variation in random sampling or random assignment. Significance tests 
help us decide which explanation makes more sense. 


STATING HYPOTHESES AND CHECKING CONDITIONS In a test 
for comparing two proportions, the null hypothesis has the general form 


HO: p1—p2=hypothesized value/0: Pi — p2 = hypothesized value 


We’re often interested in situations in which the hypothesized difference is 
0. Then the null hypothesis says that there is no difference between the two 
parameters: 


HO: p1—p2=04o: pi — p2 = 0 


(You will sometimes see the null hypothesis written in the equivalent form 
HO: p1=p2/0: Pi = P2.) The alternative hypothesis says what kind of 
difference we expect. 

The conditions for performing a significance test about p1—p2P!1 ~ P2 
are the same as for constructing a confidence interval. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE 
TEST ABOUT A DIFFERENCE BETWEEN TWO 
PROPORTIONS 


e Random: The data come from two independent random samples or 
from two groups in a randomized experiment. 


© 10%: When sampling without replacement, n1<0.10N1 
n, <0.10N) and n2<0.10N2%2 < 0-10N2. 


e Large Counts: The counts of “successes” and “failures” in each 
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sample or group—n1pA1"1P1, n1(1—-p1)™ (1—p,) n2pA2"2P2 , 
n2(1—pA2)"2 (1 — P2)__are all at least 10. 


Here’s an example that illustrates how to state hypotheses and check 
conditions. 


EXAMPLE | Hungry children » 
Stating hypotheses and checking conditions 


Alamy 


PROBLEM: Researchers designed a study to compare the proportion 
of children who come to school without eating breakfast in two low- 
income elementary schools. An SRS of 80 students from School 1 found 
that 19 missed breakfast today. At School 2, an SRS of 150 students 
included 26 who missed breakfast today. More than 1500 students 
attend each school. Do these data give convincing evidence of a 
difference in the population proportions at the a=0.05@ = 0.05 
significance level? 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 
SOLUTION: 


a. HO: p1 — p2 = 0/40 : Pi — p2 =0 
Ha: p1 - p2He : pi — p2 207% 9 
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where p1 =?! ~ the true proportion of all students at School 1 who 
missed breakfast today and p2 =?2 ~ the true proportion of all 
students at School 2 who missed breakfast today. 


You can also state the hypotheses as 


HO: p1=p2 40 : pi = p2 


Ha: p1#p2 Ha : pi # P2 


e Random? Independent random samples of 80 students from 
School 1 and 150 students from School 2. / 


o 10%: 80<1080 < 10% of students at School 1; 150<10150 < 10 
% of students at School 2. “ 


e Large Counts? 19, 80 - 19 = 6180 — 19 = 61, 26, and 150 — 26 = 
124150 — 26 = 124 are all 102 10. ¥ 


The numbers of successes (missed breakfast!) and failures (ate breakfast) 


in both samples are at least 10. 


FOR PRACTICE, TRY EXERCISE 15 


CALCULATIONS: STANDARDIZED TEST STATISTIC AND P- 
VALUE If the conditions are met, we can proceed with calculations. To 
do a test of HO: p1-p2=0/0: pi — p2 = 9, standardize p\1—-pA2P1 ~ P2 to 
get a Z Statistic: 
standardized test statistic=statistic-parameterstandard deviation of statistic 
; Bw statistic—parameter 
standardized test statistic = 
standard deviation of statistic 
(pj —P2)—0 
piri), Pal=Pa) 
z=(p1-p/2)—-0p1(1-p1)n1+p2(1-p2)n2 yoo n2 
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With a little factoring, we can rewrite the denominator of the standardized test 


Ly 2 
VPC Bl a ne ee 
Statistic as p(1—p)1n1+1n2 “ . You’Il find this expression on 
the AP® Statistics exam formula sheet under the heading “Standard Deviation of 


Statistic” for a difference of sample proportions in the special case when p1=p2 
Pl = P2, 


You may be tempted to replace p1?! and p2P2 in the denominator with 
the corresponding sample proportions pA1P1 and pA2P 2. Don’t do it! 
Here’s why. 

A significance test begins by assuming that HO: p1—p2=0 
Ho: pi — P2 = 9 is true. In that case, p1=p2?1 = P2. We call the common 
value of these two parameters p?. Substituting this value in place of both 
p1P1 and p2P2 in the denominator of the standardized test statistic yields 


___ (1 -P2)—0 
|PQ—-p) | pip) 
2=(pM-p\2)-Op(1-pynt+p(1-pn2 «V4 "2 


Unfortunately, we don’t know the common value of p. To estimate p, 
we combine (or “pool”) the data from the two samples as if they came 
from one larger sample. This combined sample proportion is 


p4=number of successes in both samples combinednumber of individuals in 
both samples combined=X1+X2n1+n2 
. _ number ofsuccessesin bothsamplescombined _ X4+Xo 
= number of individuals in both samples combined — ny+ng 
In other words, pp gives the overall proportion of successes in the 
combined samples. Use p/? in place of both p1?1 and p2P2 in the 
denominator of the standardized test statistic: 


Some people prefer to use p‘P to check the Large Counts condition because this is 


consistent with assuming that H0: p1-p2=04 0: Pi — Pp = 0 is true. If the expected 
counts nlpA%P, n1(1-p)™ (1— P) n2pAN2P , and n2(1-p)'?2 (2) neal 


at least 10, the sampling distribution of p\1-p\2?1 ~ P2 is approximately Normal. 
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_ (p1 —p2)—0 
[p(1—p) p(1—p) 

+ 4 
2=(p\1-p’2)—OpA(1—p\)n1+p\(1-p)n2 Vom "2 


When the Large Counts condition is met, this z statistic will have 
approximately the standard Normal distribution. We can find the 
appropriate P-value using Table A or technology. 


EXAMPLE | Calculating the standardized test statistic 
and P-value |») 
Who eats breakfast? 


Yoshiyoshi Hirokawa/Shutterstock 


PROBLEM: Refer to the preceding example. The two-way table 
summarizes the data from the independent random samples of children 
at School 1 and School 2. We already confirmed that the conditions for 
performing a significance test are met. 


School 
1 2 Total 
No 45 
Breakfast? 
Yes 185 
Total 80 150 230 


a. Explain why the sample results give some evidence for the alternative 
hypothesis. 
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b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make using a= 0.050 = 0.05? 
SOLUTION: 


a. The observed difference in the sample proportions is p41 — pA2 = 
1980 — 26150 = 0.2375 — 0.1733 = 0.0642 


a “ 19 26 ? a, “f 
Pi P22 =F i = 0.2375 itso = 0.0642) Relneh gives 


some evidence in favor of Ha: p1 — p270/a : Pi — P2 #0 because 
0.0642#0.9-0642 # 0. 


The rightmost column in the table above makes it easy to see that the 
overall (combined) proportion of successes in the two samples is 


pr=45/230P = 45/230, 


p= 12 _ 4 _ 9.196 
b. p= 19+2680+150 = 45230 =0.196 ~ s0+i50 ~ 230 ~ © 


z = (0.2375 — 0.1733) — 00.196(0.804)80+0.196(0.804)150 = 
0.06420.055 = 1.17 


- (0.2375 — 0.1733) — 0 _ 0.0642 _ | 
” (0.196(0.804) 0.196(0.804) —*9.055 . 
80 +—T50 


Using Table A:P(z< -1.17 or z21.17)P (2 S — 1.170rz > 1.17) — 
2(0.1210) = 2(9-1210)—9 24990= 0.2420 


Z—(p'l sp) = 0p. =p nip — p*)n2 
(p1 —B2) —0 


—— 


|p (1—B) | (1-5) 
\ ny no 


Using technology: normalcdf(lower: 1.17, upper: 1000, mean: 0, SD: 
1) x 2=0.24202 = 0.2420 
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. Because the P-value of 0.2420>a = 0.050.2420 > a = 0.05, we fail 
to reject HO0. There is not convincing evidence of a difference in 
the true proportions of students at School 1 and School 2 who 


skipped breakfast today. 


Standard 
Normal curve 


Z=~1.9F Z=1.1F 


FOR PRACTICE, TRY EXERCISE 19 


The two-sample z test and two-sample z interval for the difference between two 


proportions don’t always give consistent results. That’s because the “standard 
deviation of the statistic” used in calculating the standardized test statistic is 
or a er 
/p(l—p) _ p(1—p) 
ae 
no 


} 
pA(L-pa)ntsprl-payn2) 


but for the confidence interval, it’s 


j——— 
pA1(1-p‘41)n1+p42(1-p/2)n2 \ | no 


What does the P-value in the example tell us? If there is no difference 
in the population proportions of students who missed breakfast today at the 
two schools and we repeated the random sampling process many times, 
we'd get a difference in sample proportions as large as or larger than 
0.0642 in either direction about 24% of the time. With such a high 
probability of getting a result like this just by chance when the null 
hypothesis is true, we don’t have enough evidence to reject HO. 

We can get additional information about the difference between the 
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population proportions who missed breakfast today at School 1 and School 
2 with a confidence interval. Technology gives the 95% confidence 
interval for p1—p2P1 — P2 as -0.047—9-047 to 0.175. That is, we are 95% 
confident that the difference in the true proportions of students who missed 
breakfast at the two schools is between 4.7 percentage points lower at 
School 1 and 17.5 percentage points higher at School 1. This is consistent 
with our “fail to reject HOM” conclusion because 0 is included in the 
interval of plausible values for p1—p2P1 ~ P2., 


24. Technology Corner |PERFORMING A 
SIGNIFICANCE TEST FOR A DIFFERENCE IN 
PROPORTIONS (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


The TI-83/84 can be used to perform significance tests for comparing 
two proportions. Here, we use the data from the hungry children 
example. To perform a test of HO: p1—p2=0/0 : pi — P2 = 9 versus 
Ha: p1—p220:4a : pi — po #0: 

e Press|STAT|, then choose TESTS and 2-PropZTest. 


e When the 2-PropZTest screen appears, enter the values shown. 
Specify the alternative hypothesis p1#p2P1 7 P2., 


e If you select “Calculate” and press [ENTER], you will see that the 
standardized test statistic is z=1.1682 = 1.168 and the P-value is 
0.2427. Do you see the combined proportion of students who didn’t 
eat breakfast? It’s the value labeled p/P, 0.1957. 


e If you select the “Draw” option, you will see the screen shown here. 
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NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-PropZTest 


x1:19 

ni:8@ 

x2:26 

n2:158 

P1:EEW <p2 >p2 
Color: aU 


Calculate Draw 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-PropZTest 
P1i*P2 
z=1.168347138 
P=0. 2426668816 
pi1=0.2375 
6 2=@. 1733333333 
b=0. 1956521739 
n1=80 
n2=150 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


AP® EXAM TIP 


The formula for the two-sample z statistic for a test about p1—p2P1 ~ P2 
often leads to calculation errors by students. As a result, your teacher 
may recommend using the calculator’s 2-PropZTest feature to perform 
calculations on the AP® Statistics exam. Be sure to name the procedure 
(two-sample z test for p1—p2P1 ~ P2) in the “Plan” step and report the 
standardized test statistic (z=1.17) (2 = 1.17) and P-value (0.2427) in 
the “Do” step. 
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Putting It All Together: Two-Sample z Test for p1 — p2 
Pi —P2 


Here is a summary of the details for the two-sample z test for the 
difference between two proportions. 


TWO-SAMPLE z TEST FOR THE DIFFERENCE 
BETWEEN TWO PROPORTIONS 


Suppose the conditions are met. To test the hypothesis HO: p1—p2=0 
Ho : pi ~ P2 = ©) first find the pooled proportion p/? of successes in 
both samples combined. Then compute the standardized test statistic 


(bj —b2)—0 


‘ a aes a Pe 
\ oom Z p(1—p) 

2=(p1—p'2)-Op*(1-p)n1+p\(1-pA)n2 ml ma 
Find the P-value by calculating the probability of getting a z statistic 


this large or larger in the direction specified by the alternative 
hypothesis Ha/« in a standard Normal distribution. 


As with any test, be sure to follow the four-step process. 


EXAMPLE | Cholesterol and heart attacks » 


Performing a significance test about p; — po 
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Science Photo Library/Getty Images 


PROBLEM: High levels of cholesterol in the blood are associated with 
a higher risk of heart attacks. Will using a drug to lower blood 
cholesterol reduce heart attacks? The Helsinki Heart Study recruited 
middle-aged men with high cholesterol but no history of other serious 
medical problems to investigate this question. The volunteer subjects 
were assigned at random to one of two treatments: 2051 men took the 
drug gemfibrozil to reduce their cholesterol levels, and a control group 
of 2030 men took a placebo. During the next five years, 56 men in the 
gemfibrozil group and 84 men in the placebo group had heart attacks. 


a. Do the results of this study give convincing evidence at the a=0.01 
a = 0.01 significance level that gemfibrozil is effective in preventing 
heart attacks? 


b. Interpret the P-value you got in part (a) in the context of this 
experiment. 


SOLUTION: 


a. STATE: We want to test 
HO: pG - pc = 0/0 > Po —- po = 0 
Ha: pG — pC<0Ha : pe — pc < 0 


where pG =PG ~ the true heart attack rate for middle-aged men like 
these who take gemfibrozil and pC =Pc ~ the true heart attack rate for 
middle-aged men like these who take a placebo using a=0.01e = 0.01. 
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PLAN: Two-sample z test for pG — pCP¢ ~ Pc 


e Random: Volunteer subjects were randomly assigned to gemfibrozil 
or placebo. / 


e Large Counts: 56, 2051 — 56 = 1995, 84, and 2030 — 84 = 1946 are 
all> 10.4% 


Note that we do not have to check the 10% condition because the 


subjects in the experiment were not sampled without replacement from 


some larger population. 


The numbers of successes (heart attacks!) and failures (no heart attacks) 
in both samples are at least 10. 


DO: 
* PAG = 562051 = 0. 02736 = 051 = 9-0273 nc = 842030 = 


0.0414PC = aoa5 = 90-0414 


The sample result gives some evidence in favor of Hg: py — p2 < 0 


because 0.0273 — 0.0414 = -0.0141 < 0. 


pA = 56+842051+2030 = 1404081 = 0.0343 


_ 56+84 =, PG 2 
p = 2051+2030 4081 = 0.0343 


= (0.0273 — 0.0414) — 00.0343(0.9657)2051+0.0343(0.9657)2030 


(0.0273 — 0.0414) — 0 


& 


/ 0.0343(0.9657) ; 0.0343(0.9657) 
2061 2000 = —0.01410.0057 = -2.47 
= 0 = -247 


0.0057 


e P-value 
Using Table A:P(z< -2.47) = 0.00687 (2 = — 2.47) = 0.0068 


Using technology: normalcdf(lower: -1000 ~ 1900, upper: -2.47~—2-47 
, mean:0, SD:1)=0.0068= 0.0068 

CONCLUDE: Because our P-value of 0.0068<a = 0.01 

0.0068 < a = 0.01, we reject Hoo. There is convincing evidence of a 
lower heart attack rate for middle-aged men like these who take 
gemfibrozil than for those who take only a placebo. 

(b) Assuming HO: p1— p2=0o : pi — P2 = 9 is true, there is a 
0.0068 probability of getting a difference (Gemfibrozil — Placebo) in 
heart attack rate for the two groups of —0.0141~—9-9141 or less just by 
the chance involved in the random assignment. 


Standard 
Normal curve 


FOR PRACTICE, TRY EXERCISE 21 


We chose a=0.01a = 0.01 in the example to reduce the chance of 
making a Type I error—finding convincing evidence that gemfibrozil 
reduces heart attack risk when it really doesn’t. This error could have 
serious consequences if an ineffective drug was given to lots of middle- 
aged men with high cholesterol! 

The random assignment in the Helsinki Heart Study allowed 
researchers to draw a cause-and-effect conclusion. They could say that 
gemfibrozil reduces the rate of heart attacks for middle-aged men like 
those who took part in the experiment. Because the subjects were not 
randomly selected from a larger population, researchers could not 
generalize the findings of this study any further. No conclusions could be 
drawn about the effectiveness of gemfibrozil at preventing heart attacks for 
all middle-aged men, for men of other ages, or for women. 
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Think About It 


WHY DO THE INFERENCE METHODS FOR RANDOM 
SAMPLING WORK FOR RANDOMIZED EXPERIMENTS? 
Confidence intervals and tests for p1—p2?1 ~ P2 are based on the 
sampling distribution of p\1-p2P1 ~ P2, But in most experiments, 
researchers don’t select subjects at random from any larger populations. 
They do randomly assign subjects to treatments. We can think about 
what would happen if the random assignment were repeated many times 
under the assumption that HO: p1—p2=00 : Pi — P2 = 9 is true. That is, 
we assume that the specific treatment received doesn’t affect an 
individual subject’s response. 

Let’s see what would happen just by chance if we randomly 
reassign the 4081 subjects in the Helsinki Heart Study to the two groups 
many times, assuming the drug received doesn’t affect whether or not 
each individual has a heart attack. We used software to redo the random 
assignment 500 times. Figure 10.3 shows the value of p\G—pAC 
Pa ~ Pc in the 500 simulated trials. This distribution (sometimes 
referred to as the randomization distribution of pAG-pACPa Pc ) has 
an approximately Normal shape with mean 0 and standard deviation 
0.0058. This matches well with the distribution we used to perform 
calculations in the example. Because the Large Counts condition was 
met and we assumed that HO: pG—pC=040: pe — pc = 9 is true, we 
used for the standard deviation of the statistic. 


In 500 random 
reassignments, there 
were only 5 times when 
the difference in sample 


proportions was as 

small as or smaller than 

| the | ed — 0.0141. | 
Tae li. 


UU a, ~0.015 -0.010 -0.005 0.005 0.010 0.015 0.020 


Shape: Approximately Normal 
Center: Mean = 0 
Variability: SD = 0.0058 


PB, - B- = simulated difference in sample proportions of subjects 
who have heart attacks 


FIGURE 10.3 Dotplot of the values of p\G-—pAC 
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‘aie 8, 


from each of 500 simulated random reassignments of subjects to treatment groups 


in the Helsinki Heart Study, assuming no treatment effect. 


p\(1-p)n1+p\(1-p)n2=0.0343(1-0.0343)2051+0.0343(1-0.0343)2030 
[pG—b) pU-p)  [0.0343(1—0.0343) _0.0343(1—0.0343) 
\ “a ° wa 2051 2030 
=0.0057= 0.0057 


In the Helsinki Heart Study, the difference in the proportions of 
subjects who had a heart attack in the gemfibrozil and placebo groups 
was 0.0273-0.0414=0-0273 — 0.0414 =—9 .9141—0.0141, How likely is 
it that a difference this large or larger would happen just by chance 
when H0/Z0 is true? Figure 10.3 provides a rough answer: 5 of the 500 
random reassignments yielded a difference in proportions less than or 
equal to —0.0141—9-0141, That is, our estimate of the P-value is 0.01. 
This is quite close to the 0.0068 P-value that we calculated in the 
preceding example. Neither of these values is the exact P-value. To get 
the exact P-value in an experiment, you would have to consider the 
difference in sample proportions for all possible random assignments. 
This approach is called a permutation test. 


CHECK YOUR UNDERSTANDING 


To study the long-term effects of preschool programs for poor 
children, researchers designed an experiment. They recruited 123 
children who had never attended preschool from low-income families in 
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Michigan. Researchers randomly assigned 62 of the children to attend 
preschool (paid for by the study budget) and the other 61 to serve as a 
control group who would not go to preschool. One response variable of 
interest was the need for social services as adults. Over a 10-year period, 
38 children in the preschool group and 49 in the control group have 


needed social services.= 


iy 


4 


Do these data provide convincing evidence that preschool reduces the 
later need for social services for children like the ones in this study? 
Justify your answer. 


. Based on your conclusion to Question 1, could you have made a Type 


I error or a Type II error? Explain your reasoning. 


. Should you generalize the result in Question 1 to all children from 


low-income families who have never attended preschool? Why or 
why not? 


Section 10.1 Summary 


Choose independent SRSs of size n1™1 from Population 1 with 
proportion of successes p1?1 and of size n2’2 from Population 2 with 
proportion of successes p2?2. The sampling distribution of p1—p/2 
P: — P2 has the following properties: 
« Shape: Approximately Normal if the Large Counts condition is met: 
nip1,n1(1—-p1), n2p2™1P1> ™ (1—pi), 2P2 and n2(1-p2) 
nz (1 — 2) are all at least 10. 


= Center: The mean is up\1—p\2=p1-p2"r: a aa ot 
« Variability: The standard deviation is 
op/1-pA2=p1(1-p1)n1+p2(1-p2)n2 P1—-P2 V nl n2 as 


long as the 10% condition is met when sampling without replacement: 
n1<0.10N1% < 9-101 and n2<0.10N2"2 < 9.10N2. 


Confidence intervals and significance tests to compare the proportions 
p1P1 and p2P2 of successes for two populations or treatments are based 
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on the difference pA1—pA2P1 P2 between the sample proportions. 


Before estimating or testing a claim about pl—p2?! ~ P2, check that 
these conditions are met: 


= Random: The data come from two independent random samples or 
from two groups in a randomized experiment. 
10%: When sampling without replacement, n1<0.10N1 
n, <0.10N) and n2<0.10N2"2 < 0.10N2_ 


: Large Counts: The counts of “successes” and “failures” in each 
sample or group—n1p\11P 1 ,ni(i-pA1)™ (1— pi), n2pA2"2P2 , 
n2(1—p/2)"2 (1 — P2)__are all at least 10. 

When conditions are met, a C% confidence interval for p1—p2P! ~ P2 is 

(p41—p2)+z*p41(1-p’1)n1+pA2(1-p’2)n2 


- 2 y , » |P1(l-b1) . bo (1-B9) 
(p) — Po) +2"4| a Van ae 


where z*2° is the standard Normal critical value with C% of its area 


between —Z* Zz" and DZ j This is called a two-sam le Z interval for 
p 
pl _ p2P1 =a. 


Significance tests of HO: p1—p2=0/0 : Pi — P2 = 9 use the combined 
(pooled) sample proportionp’? in the formula for the standard 
deviation of the statistic: 


p/=count of successes in both samples combinedcount of individuals in 
both samples combined=X1+X2n1+n2 
.. count ofsuccesses in bothsamplescombined | X1+X9 
= count of individuals in both samples combined — n1+ng 
When conditions are met, the two-sample z test for p1 — p2?1 ~ P2 
uses the standardized test statistic 
| (61 —P2)—0 


[pQ—p)  p(1—p) 
———— 
z=(p\1-p42)-Op\(1-p’)n1+p\(1-p’)n2 \ ny ny 


with P-values calculated from the standard Normal distribution. 


Be sure to follow the four-step process whenever you construct a 
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confidence interval or perform a significance test for comparing two 


proportions. 


10.1 Technology Corners (> 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


23. Constructing a confidence interval for a difference in 


proportions Page 629 


24. Performing a significance test for a difference in 


proportions Page 633 


Section 10.1 Exercises 


1. pg 624 |») I want red! A candy maker offers Child and Adult bags 
of jelly beans with different color mixes. The company claims that the 
Child mix has 30% red jelly beans, while the Adult mix contains 15% 
red jelly beans. Assume that the candy maker’s claim is true. Suppose 
we take a random sample of 50 jelly beans from the Child mix and a 
separate random sample of 100 jelly beans from the Adult mix. Let 
pAcPc ' and pAAPA be the sample proportions of red jelly beans from 
the Child and Adult mixes, respectively. 


a. What is the shape of the sampling distribution of p\C-pAA 
Po ~ Pa? Why? 

b. Find the mean of the sampling distribution. 

c. Calculate and interpret the standard deviation of the sampling 
distribution. 


2. Literacy A researcher reports that 80% of high school graduates, but 
only 40% of high school dropouts, would pass a basic literacy test.2 
Assume that the researcher’s claim is true. Suppose we give a basic 
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literacy test to a random sample of 60 high school graduates and a 


separate random sample of 75 high school dropouts. Let pAGPa and 


pADP D be the sample proportions of graduates and dropouts, 
respectively, who pass the test. 


a. 


What is the shape of the sampling distribution of p\G-—p4D 
Pc — Pp? Why? 


Find the mean of the sampling distribution. 


Calculate and interpret the standard deviation of the sampling 
distribution. 


I want red! Refer to Exercise 1. 


a. 


Find the probability that the proportion of red jelly beans in the 
Child sample is less than or equal to the proportion of red jelly 
beans in the Adult sample, assuming that the company’s claim is 
true. 


Suppose that the Child and Adult samples contain an equal 
proportion of red jelly beans. Based on your result in part (a), 
would this give you reason to doubt the company’s claim? Explain 
your reasoning. 


Literacy Refer to Exercise 2. 


a. 


Find the probability that the proportion of graduates who pass the 
test is at most 0.20 higher than the proportion of dropouts who 
pass, assuming that the researcher’s report is correct. 


. Suppose that the difference (Graduate — Dropout) in the sample 


proportions who pass the test is exactly 0.20. Based on your result 
in part (a), would this give you reason to doubt the researcher’s 
claim? Explain your reasoning. 


pg 625 |») Don’t drink the water! The movie A Civil Action 


(1998) tells the story of a major legal battle that took place in the 
small town of Woburn, Massachusetts. A town well that supplied 
water to east Woburn residents was contaminated by industrial 

chemicals. During the period that residents drank water from this 
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well, 16 of 414 babies born had birth defects. On the west side of 
Woburn, 3 of 228 babies born during the same time period had birth 
defects. Let p1=?1 = the true proportion of all babies born with birth 
defects in west Woburn and p2=P2 = the true proportion of all babies 
born with birth defects in east Woburn. Check if the conditions for 
calculating a confidence interval for p1—p2?1 ~ P2 are met. 


Broken crackers We don’t like to find broken crackers when we 
open the package. How can makers reduce breaking? One idea is to 
microwave the crackers for 30 seconds right after baking them. 
Randomly assign 65 newly baked crackers to the microwave and 
another 65 to a control group that is not microwaved. After 1 day, 
none of the microwave group were broken and 16 of the control 
group were broken.® Let p1=P1 = the true proportions of crackers 
like these that would break if baked in the microwave and p2=P2 — 
the true proportions of crackers like these that would break if not 
microwaved. Check if the conditions for calculating a confidence 
interval for p1—p2P1 ~ P2 are met. 


Cockroaches The pesticide diazinon is commonly used to treat 
infestations of the German cockroach, Blattella germanica. A study 
investigated the persistence of this pesticide on various types of 
surfaces. Researchers applied a 0.5% emulsion of diazinon to glass 
and plasterboard. After 14 days, they randomly assigned 72 
cockroaches to two groups of 36, placed one group on each surface, 
and recorded the number that died within 48 hours. On glass, 18 
cockroaches died, while on plasterboard, 25 died. If p1?1 and p2P2 
are the true proportions of cockroaches like these that would die 
within 48 hours on glass treated with diazinon and on plasterboard 
treated with diazinon, respectively, check if the conditions for 
calculating a confidence interval for p1—p2?! ~ P2 are met. 


Digital video disks A company that records and sells rewritable 
DVDs wants to compare the reliability of DVD fabricating machines 
produced by two different manufacturers. They randomly select 500 
DVDs produced by each fabricator and find that 484 of the disks 
produced by the first machine are acceptable and 480 of the disks 
produced by the second machine are acceptable. If p1?1 and p2?P2 are 
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10. 


11. 


the proportions of acceptable DVDs produced by the first and second 
machines, respectively, check if the conditions for calculating a 
confidence interval for p1—p2?1 ~ P2 are met. 


pg 628 |») Young adults living at home A surprising number of 


young adults (ages 19 to 25) still live in their parents’ homes. The 
National Institutes of Health surveyed independent random samples 
of 2253 men and 2629 women in this age group.2 The survey found 
that 986 of the men and 923 of the women lived with their parents. 


a. Construct and interpret a 99% confidence interval for the 
difference in the true proportions of men and women aged 19 to 25 
who live in their parents’ homes. 


b. Does your interval from part (a) give convincing evidence of a 


difference between the population proportions? Justify your 
answer. 


Where’s Egypt? In a Pew Research poll, 287 out of 522 randomly 
selected U.S. men were able to identify Egypt when it was 
highlighted on a map of the Middle East. When 520 randomly 
selected U.S. women were asked, 233 were able to do so. 


a. Construct and interpret a 95% confidence interval for the 
difference in the true proportion of U.S. men and U.S. women who 
can identify Egypt on a map. 


b. Based on your interval, is there convincing evidence of a 


difference in the true proportions of U.S. men and women who can 
identify Egypt on a map? Justify your answer. 


Response bias Does the appearance of the interviewer influence how 
people respond to a survey question? Ken (white, with blond hair) 
and Hassan (darker, with Middle Eastern features) conducted an 
experiment to address this question. They took turns (in a random 
order) walking up to people on the main street of a small town, 
identifying themselves as students from a local high school, and 
asking them, “Do you support President Obama’s decision to launch 
airstrikes in Iraq?” Of the 50 people Hassan spoke to, 11 said “Yes,” 
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12. 


13. 


14. 


while 21 of the 44 people Ken spoke to said “Yes.” Construct and 
interpret a 90% confidence interval for the difference in the 
proportion of people like these who would say they support President 
Obama’s decision when asked by Hassan versus when asked by Ken. 


Quit smoking Nicotine patches are often used to help smokers quit. 
Does giving medicine to fight depression help? A randomized 
double-blind experiment assigned 244 smokers to receive nicotine 
patches and another 245 to receive both a patch and the 
antidepressant drug bupropion. After a year, 40 subjects in the 
nicotine patch group had abstained from smoking, as had 87 in the 
patch-plus-drug group. Construct and interpret a 99% confidence 
interval for the difference in the true proportion of smokers like these 
who would abstain when using bupropion and a nicotine patch and 
the proportion who would abstain when using only a patch. 


Ban junk food! A CBS News poll asked 606 randomly selected 
women and 442 randomly selected men, “Do you think putting a 
special tax on junk food would encourage more people to lose 
weight?” 170 of the women and 102 of the men said “Yes.”8 A 99% 
confidence interval for the difference (Women — Men) in the true 
proportion of people in each population who would say “Yes” is 
—0.020—-9-020 to 0.120. Does the confidence interval provide 
convincing evidence that the two population proportions are equal? 
Explain your answer. 


Artificial trees? An association of Christmas tree growers in Indiana 
wants to know if there is a difference in preference for natural trees 
between urban and rural households. So the association sponsored a 
survey of Indiana households that had a Christmas tree last year to 
find out. In a random sample of 160 rural households, 64 had a 
natural tree. In a separate random sample of 261 urban households, 89 
had a natural tree. A 95% confidence interval for the difference 
(Rural — Urban) in the true proportion of households in each 
population that had a natural tree is —0.036~—9-936 to 0.154. Does the 
confidence interval provide convincing evidence that the two 
population proportions are equal? Explain your answer. 
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16. 
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pg 631 |») Children make choices Many new products introduced 


into the market are targeted toward children. The choice behavior of 
children with regard to new products is of particular interest to 
companies that design marketing strategies for these products. As part 
of one study, randomly selected children in different age groups were 
compared on their ability to sort new products into the correct product 
category (milk or juice).2 Here are some of the data: 


Age group N Number who sorted correctly 
4-to 5-year-olds 50 10 
6-to 7-year-olds 53 28 


Researchers want to know if a greater proportion of 6- to 7-year-olds 
can sort correctly than 4- to 5-year-olds. 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 


Steroids in high school A study by the National Athletic Trainers 
Association surveyed random samples of 1679 high school freshmen 
and 1366 high school seniors in Illinois. Results showed that 34 of the 
freshmen and 24 of the seniors had used anabolic steroids. Steroids, 
which are dangerous, are sometimes used in an attempt to improve 
athletic performance.22 Researchers want to know if there is a 
difference in the proportion of all Illinois high school freshmen and 
seniors who have used anabolic steroids. 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 


Shrubs and fire Fire is a serious threat to shrubs in dry climates. 
Some shrubs can resprout from their roots after their tops are 
destroyed. Researchers wondered if fire would help with resprouting. 
One study of resprouting took place in a dry area of Mexico.! The 
researchers randomly assigned shrubs to treatment and control 
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groups. They clipped the tops of all the shrubs. They then applied a 
propane torch to the stumps of the treatment group to simulate a fire. 
All 12 of the shrubs in the treatment group resprouted. Only 8 of the 
12 shrubs in the control group resprouted. 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 


Ticks Lyme disease is spread in the northeastern United States by 
infected ticks. The ticks are infected mainly by feeding on mice, so 
more mice result in more infected ticks. The mouse population, in 
turn, rises and falls with the abundance of acorns, their favored food. 
Experimenters studied two similar forest areas in a year when the 
acorn crop failed. To see if mice are more likely to breed when there 
are more acorns, the researchers added hundreds of thousands of 
acorns to one area to imitate an abundant acorn crop, while leaving 
the other area untouched. The next spring, 54 of the 72 mice trapped 
in the first area were in breeding condition, versus 10 of the 17 mice 


trapped in the second area.t4 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 


19. pg 632 |) Children make choices Refer to Exercise 15. 


a. Explain why the sample results give some evidence for the 
alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


20. Steroids in high school Refer to Exercise 16. 


a. Explain why the sample results give some evidence for the 
alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 
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22. 
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c. What conclusion would you make? 


pg 635 |») Bag lunch? Phoebe has a hunch that older students at 


her very large high school are more likely to bring a bag lunch than 
younger students because they have grown tired of cafeteria food. She 
takes a simple random sample of 80 sophomores and finds that 52 of 
them bring a bag lunch. A simple random sample of 104 seniors 
reveals that 78 of them bring a bag lunch. 


a. Do these data give convincing evidence to support Phoebe’s hunch 
at the a=0.05a = 0.05 significance level? 


b. Interpret the P-value from part (a) in the context of this study. 


Are teenagers going deaf? In a study of 3000 randomly selected 
teenagers in 1990, 450 showed some hearing loss. In a similar study 
of 1800 teenagers reported in 2010, 351 showed some hearing loss.42 


a. Do these data give convincing evidence that the proportion of all 
teens with hearing loss has increased at the a=0.010 = 0.01 
significance level? 


b. Interpret the P-value from part (a) in the context of this study. 


Preventing peanut allergies A recent study of peanut allergies—the 
LEAP trial—explored the relationship between early exposure to 
peanuts and the subsequent development of an allergy to peanuts. 
Infants (4 to 11 months old) who had shown evidence of other kinds 
of allergies were randomly assigned to one of two groups. Group 1 
consumed a baby-food form of peanut butter. Group 2 avoided peanut 
butter. At 5 years old, 10 of 307 children in the peanut-consumption 
group were allergic to peanuts, and 55 of 321 children in the peanut- 


avoidance group were allergic to peanuts.14 


a. Does this study provide convincing evidence of a difference at the 
a= 0.05 = 0.05 significance level in the development of peanut 
allergies in infants like the ones in this study who consume or 
avoid peanut butter? 


b. Based on your conclusion in part (a), which mistake— a Type I 
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error or a Type II error—could you have made? Explain your 
answer. 


c. Should you generalize the result in part (a) to all infants? Why or 
why not? 


24. Lowering bad cholesterol Which of two widely prescribed drugs— 
Lipitor or Pravachol—helps lower “bad cholesterol” more? In an 
experiment, called the PROVE-IT Study, researchers recruited about 
4000 people with heart disease as subjects. These volunteers were 
randomly assigned to one of two treatment groups: Lipitor or 
Pravachol. At the end of the study, researchers compared the 
proportion of subjects in each group who died, had a heart attack, or 
suffered other serious consequences within two years. For the 2063 
subjects using Pravachol, the proportion was 0.263. For the 2099 
subjects using Lipitor, the proportion was 0.224.142 


a. Does this study provide convincing evidence at the a=0.05 
a = 0.05 significance level of a difference in the effectiveness of 
Lipitor and Pravachol for people like the ones in this study? 


b. Based on your conclusion in part (a), which mistake—a Type I 
error or a Type II error—could you have made? Explain your 
answer. 


c. Should you generalize the result in part (a) to all people with heart 
disease? Why or why not? 


25. Preventing peanut allergies Refer to Exercise 23. 


a. Construct and interpret a 95% confidence interval for the 
difference between the true proportions. Assume that the 
conditions for inference are met. 


b. Explain how the confidence interval provides more information 
than the test in Exercise 23. 


26. Lowering bad cholesterol Refer to Exercise 24. 


a. Construct and interpret a 95% confidence interval for the 
difference between the true proportions. Assume that the 
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conditions for inference are met. 


b. Explain how the confidence interval provides more information 
than the test in Exercise 24. 


Exercises 27 and 28 involve the following setting. Some women would like 
to have children but cannot do so for medical reasons. One option for these 
women is a procedure called in vitro fertilization (IVF), which involves 
fertilizing an egg outside the woman’s body and implanting it in her 
uterus. 


27. Prayer and pregnancy Two hundred women who were about to 
undergo IVF served as subjects in an experiment. Each subject was 
randomly assigned to either a treatment group or a control group. 
Several people (called intercessors) prayed intentionally for the 
women in the treatment group, although they did not know the 
women, a process known as intercessory prayer. Prayers continued 
for 3 weeks following IVF. The intercessors did not pray for the 
women in the control group. Here are the results: 44 of the 88 women 
in the treatment group got pregnant, compared to 21 out of 81 in the 
control group.t® 
Is the pregnancy rate significantly higher for women who received 
intercessory prayer? To find out, researchers perform a test of HO: 
pl=p2Ho : pi = P2 versus Ha: p1>p2Ha : Pi > P2, where p1P1 and 
p2P2 are the actual pregnancy rates for women like those in the study 
who do and don’t receive intercessory prayer, respectively. 


a. Name the appropriate test and check that the conditions for 
carrying out this test are met. 


b. The appropriate test from part (a) yields a P-value of 0.0007. 
Interpret this P-value in context. 


c. What conclusion should researchers draw at the a=0.05a@ = 0.05 
significance level? 


d. The women in the study did not know whether they were being 
prayed for. Explain why this is important. 
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Acupuncture and pregnancy A study reported in the medical 
journal Fertility and Sterility sought to determine whether the ancient 
Chinese art of acupuncture could help infertile women become 
pregnant./Z A total of 160 healthy women who planned to have IVF 
were recruited for the study. Half of the subjects (80) were randomly 
assigned to receive acupuncture 25 minutes before implanting the 
embryo and again 25 minutes after the implant. The remaining 80 
women were assigned to a control group and instructed to lie still for 
25 minutes after the embryo transfer. Results are shown in the table. 


Acupuncture group Control group 
Pregnant 34 21 
Not pregnant 46 59 
Total 80 80 


Is the pregnancy rate significantly higher for women who received 
acupuncture? To find out, researchers perform a test of HO: p1=p2 
Ho : pi = P2 versus Ha: p1>p24o : Pi > P2, where p1P1 and p22 
are the actual pregnancy rates for women like those in the study who 
do and don’t receive acupuncture, respectively. 


a. Name the appropriate test and check that the conditions for 
carrying out this test are met. 


b. The appropriate test from part (a) yields a P-value of 0.0152. 


Interpret this P-value in context. 


c. What conclusion should researchers draw at the a=0.05a@ = 0.05 
significance level? 


d. The women in the study knew whether or not they received 


acupuncture. Explain why this is important. 


Texting and driving Does providing additional information affect 
responses to a survey question? Two statistics students decided to 
investigate this issue by asking different versions of a question about 
texting and driving. Fifty mall shoppers were divided into two groups 
of 25 at random. The first group was asked version A and the other 
half were asked version B. Here are the actual questions: 
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e Version A: A lot of people text and drive. Are you one of them? 


e Version B: About 6000 deaths occur per year due to texting and 
driving. Knowing the potential consequences, do you text and 
drive? 

Of the 25 shoppers assigned to version A, 16 admitted to texting and 

driving. Of the 25 shoppers assigned to version B, only 12 admitted to 

texting and driving. 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Explain why you should not use the methods of this section to 
calculate the P-value. 


c. We performed 100 trials of a simulation to see what differences 
(Version A-Version B) in proportions would occur due only to 
chance variation in the random assignment, assuming that the 
question asked doesn’t matter. A dotplot of the results is shown 
here. What is the estimated P-value? 


d. What conclusion would you draw? 


o e e e e 
wt e e e o e 


e 
T T I T T T T 
04 03 -O2 -0.1 0 0.1 0.2 0.3 0.4 


Simulated difference (Version A — Version B) in sample 
proportion of yes answers 


30. Botox benefits? You may have heard that Botox (botulinum toxin 
type A) is used for cosmetic surgery, but could it have other 
beneficial uses? A total of 31 patients who suffered chronic low-back 
pain were randomly assigned to receive 200 units of either Botox or 
saline solution through 5 injections at 5 different locations in their 
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backs. The saline injection was not expected to reduce pain but was 
given as a placebo treatment. Pain relief was defined as a patient’s 
pain level being reduced to less than half of the original pain level 
after 8 weeks. Here are the results: 


Botox Saline Total 
Pain relief 10 3 13 
No pain relief fs) 13 18 


Total 15 16 31 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Explain why you should not use the methods of this section to 
calculate the P-value. 


c. We performed 100 trials of a simulation to see what differences in 
proportions would occur due only to chance variation in the 
random assignment, assuming that the type of injection (Botox or 
saline) doesn’t matter. A dotplot of the results is shown here. What 
is the estimated P-value? 


d. What conclusion would you draw? 


eeceeceeoesees 

Seeeecececeesees sessecoosasesese 
eeeeeeeeeooeoe eeeeeeeoeeoes 
eeeseesoeones 


T T T T T T T T 
03 02 -0.1 0.0 0.1 0.2 0.3 0.4 0.5 


Simulated difference (Botox — Saline) in proportion of 
subjects that experience pain relief 


Multiple Choice Select the best answer for Exercises 31—34. 


Exercises 31—33 refer to the following setting. A sample survey interviews 
SRSs of 500 female college students and 550 male college students. 
Researchers want to determine whether there is a difference in the 
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proportion of male and female college students who worked for pay last 
summer. In all, 410 of the females and 484 of the males say they worked 
for pay last summer. 


31. Let pMP™ and pFPF be the proportions of all college males and 
females who worked last summer. The hypotheses to be tested are 


a. HO: pM-pF=040 : pu ~— pr = 9 versus Ha: pM-pF~0 
H, : PM PF x 0 


b. HO: pM-pF=040 : pa — pr = 9 versus Ha: pM—pF>0 
Hi, > PM PF > 0. 


c. HO: pM-pF=040 : pa — pr = 9 versus Ha: pM-pF<0 
Hy: py — pr <0. 


d. HO: pM-pF>040 : pu ~— pr > 9 versus Ha: pM-pF=0 
A, > PM PF = 0. 


e. HO: pM-pF~0/40 : pw — Pr # 9 versus Ha: pM-pF=0 
H,: pm — pr = 0. 


32. The researchers report that the results were statistically significant at 
the 1% level. Which of the following is the most appropriate 
conclusion? 


a. Because the P-value is less than 1%, fail to reject Hoo. There is 
not convincing evidence that the proportion of male college 
students in the study who worked for pay last summer is different 
from the proportion of female college students in the study who 
worked for pay last summer. 


b. Because the P-value is less than 1%, fail to reject HO/40. There is 
not convincing evidence that the proportion of all male college 
students who worked for pay last summer is different from the 
proportion of all female college students who worked for pay last 
summer. 


c. Because the P-value is less than 1%, reject HO. There is 
convincing evidence that the proportion of all male college 
students who worked for pay last summer is the same as the 
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33. 


34. 


proportion of all female college students who worked for pay last 
summer. 


d. Because the P-value is less than 1%, reject HO/40. There is 
convincing evidence that the proportion of all male college 
students in the study who worked for pay last summer is different 
from the proportion of all female college students in the study who 
worked for pay last summer. 


e. Because the P-value is less than 1%, reject HO. There is 
convincing evidence that the proportion of all male college 
students who worked for pay last summer is different from the 
proportion of all female college students who worked for pay last 
summer. 


Which of the following is the correct margin of error for a 99% 
confidence interval for the difference in the proportion of male and 
female college students who worked for pay last summer? 


a. 2.5760.851(0.149)550+0.851(0.149)500 


s /0.851(0.149) 0.851(0.149) 
2.576 Vn + 
/0.851(0.149) 
2.5764 / ——_—— 
b. 2.5760.851(0.149)1050 V 1050 


c. 2.5760.880(0.120)550+0.820(0.180)500 
} 0.880(0.120) ; 0.820(0.180) 


2.576 V 550 y 500 


d. 1.9600.851(0.149)550+0.851(0.149)500 


1.960 / 0.851(0.149) 0.851(0.149) 
ga 550 500 


e. 1.9600.880(0.120)550+0.820(0.180)500 


/ 0.880(0.120) —  0.820(0.180) 


1.960 \V 550 ' 500 


In an experiment to learn whether substance M can help restore 
memory, the brains of 20 rats were treated to damage their memories. 
First, the rats were trained to run a maze. After a day, 10 rats 
(determined at random) were given substance M and 7 of them 
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succeeded in the maze. Only 2 of the 10 control rats were successful. 
The two-sample z test for the difference in the true proportions 


a. gives 2=2.25, P<0.022 = 2.25, P < 0.02, 
b. gives z=2.60, P<0.0052 = 2-60, P < 0.005, 


c. gives 2z=2.25, P<0.04 but not<0.02 
z= 2.25, P < 0.04 but not < 0.02. 


d. should not be used because the Random condition is violated. 
e. should not be used because the Large Counts condition is violated. 
Recycle and Review 


Exercises 35 and 36 refer to the following setting. Thirty randomly 
selected seniors at Council High School were asked to report the age (in 
years) and mileage of their main vehicles. Here is a scatterplot of the data: 


200,000 | ¢ Represents 2 points P 
& 
ve 
xe e 
S 100,000 4 . ars : 
= e e 7 & 
as - 
ok ee 
0- e e 
I I I T T 
0.0 2.5 5.0 15 10.0 
Age (years) 


We used Minitab to perform a least-squares regression analysis for these 
data. Part of the computer output from this regression is shown here. 


Predictor Coef Stdev t-ratio P 
Constant -13832 8773 = 156 0.126 
Age 14954 1546 9.67 0.000 
§ = 22723 R-sq = 77.0% R-sq(adj) = 76.1% 


35. Drive my car (3.2) 


a. What is the equation of the least-squares regression line? Be sure 
to define any symbols you use. 
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b. Interpret the slope of the least-squares line. 


c. One student reported that her 10-year-old car had 110,000 miles on 
it. Find and interpret the residual for this data point. 


36. Drive my car (3.2, 4.3) 


a. Explain what the value of r2r” tells you about how well the least- 
squares line fits the data. 


b. The mean age of the students’ cars in the sample was x =5% = 5 
years. Find the mean mileage of the cars in the sample. 


c. Interpret the value of ss. 


d. Would it be reasonable to use the least-squares line to predict a 
car’s mileage from its age for a Council High School teacher? 
Justify your answer. 
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SECTION 10.2 Comparing Two Means 


LEARNING TARGETS By the end of the section, you should be able to: 


e Describe the shape, comes and variability of the sampling 
0.200 = 20.0%F —Z2 


305 
distribution of x 1—-x Sine 


Determine whether the conditions are met for doing inference about 
a difference between two means. 


Construct and interpret a confidence interval for a difference 
between two means. 


Calculate the standardized test statistic and P-value for a test about 
a difference between two means. 


Perform a significance test about a difference between two means. 


In the preceding section, we developed methods for comparing two 
proportions. What if we want to compare the mean of some quantitative 


variable for the individuals in Population 1 and Population 2? Our 
305 


0.200 = 20.0% 
parameters of interest are the population means 111526 — 1 and 


~ 0.200 = 20.9442 Once again, the best approach is to take 


independent random samples from each population and to compare the 
305 305 


= 0.200 = 20.0% 
sample means x 11526 Ti and x 21526 


Suppose we want to compare the average effectiveness of two 
treatments in a randomized experiment. In this case, the parameters p11 


305 305 
= 0.200 = 20.0% = 0.200 = 20.0%,,. 
1526 PF) and 21526 #2 are the true mean responses 


for Treatment 1 and Treatment 2, respectively. We use the mean response 


305 20.0% 305 
°£1 and x 21526 , to 


in the two groups, x 11526 
make the comparison. Here’s a table that summarizes these two situations: 


= 0.200 = 20.0% %, 


= 0.200 = 0.200 = 20.0% Fo 


Population Parameter Statistic Sample size 
or 
treatment 


1 
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i os 305 


‘ oy. or 

117526 = 0.200 = 20.0% 41 Fe55 = 0-200 = 20.0% 147555 0-200 = 20.0% 

2 305 _ 0.200 = 20.0% 08 0.200 = 20.0% 305 _ 9.200 = 20.0% 
1121526 rare x “£2 n21526 


We compare the populations or treatments by doing inference about 


305 
the difference p1—-121526 ~ 0.200 = 20.0% 441 — #2 between the parameters. The 


statistic that estimates this difference is the difference between the two 


305 
—, n= 0.200 = 20.0%, — FZ. 
sample means, x 1-x 21526 °L1 — X2 


305 
1526 7 0-200 = 20.0% %E1 — Bo 


distribution. 


.To use x 1-x 2 


for inference, we must know its sampling 


The Sampling Distribution of a Difference Between Two 
Means 


“ud 
To explore the sampling distribution of x 1-x 21526 


let’s start with two Normally distributed Seton having known means 
and standard deviations. Based on information from the U.S. National 
Health and Nutrition Examination Survey (NHANES), the heights of 10- 
year-old girls can be modeled by a Normal distribution with mean 


305 
UG=56.41526 — 0.200 = 20.0% 4G = 56.4 inches and standard deviation oG=2.7 


305° 


= 0.200 = 20.0%%, — Fy 
d 


2.7 inches. The heights of 10- years -old boys can be 


modeled by a Normal distribution with mean pB=55. 7iga6 ~ 0-200 = 20.0% 


305 anival 
tp = 55.7 inches and standard deviation oB=3.81526 "= °°°"op = 3.8 


inches.48 The table summarizes this information. 


Monkey Business Images/Shutterstock 


Population Shape Mean Standard dey 
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Hyear ong Approximately ae = (0.200 = 20.0%,,.. — OG=2.,. 72 = 0.200 = 20.0% 
i G=56.41526 0.0 %1G = 56.4 7 is26 
girls Normal EH 26 . 


in 
10-year-old Approximately 305 _ 9.200 = 20.0%,,,. — saa Geeta 
B=55.7 1526 ae sate “UB — Asie -O1526 
boys Normal a 926 fe n" 


Suppose we take independent SRSs of 12 girls and 8 boys of this age and 


measure their heights. What can we say about the difference x G-x B 


305 
=2 ‘3 —, 
1526-200 = 20.0% — ZB in the average heights of the sample of girls and 


the sample of boys? 


In Chapter 7, we saw that the sampling distribution of a sample mean 


305 
———-= 0.200 = 20.0%= 
X 1526 © has the following properties: 


Shape: (1) If the pepulavon distribution is Normal, then so is the sampling 


= 0.200 = 20.0 %- oF 


distribution of x 15 1528 ; (2) a the population distribution isn’t 


Normal, the sampling distribution of x 1526 eh ers 0s will be 


approximately Normal if the sample size is large enough (say, n>30 
305 
me ee 0 ) by the central limit theorem (CLT). 


305 
—_, = 0.200 = 20.0% Bea aes 
Center: HX =[11526 ‘Center : uz = 


305 
in a = ‘ —— = 0.200 = 20.0% 
Variability: ox =on oi n<0.10N 1526 ' 
Variability : oz =—if n<0.10N 
Jn 
___ 305 0.200 = 20.0 %= = 
For the sampling distributions of x G1526 "~~ ""G and x B 
305 
= 0.200 = 20.0%, - ; 
1526 "© B in this case: 
305 
‘ ee ee —_.—— = 0.200 = 20.0%% 
Sampling distribution of x _G1526 "LG 
Shape Approximately Normal, because the population distribution is 
approximately Normal 
305 . 
Center 30° = 0.200 = 20.0% uz, = UG = 56.4 inches 


ux G=pG=56.41526 — 
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Variability 305 _ 9 900 = 20.0%0z, = = = 74 = 0.78 
ox G=oGnG=2.712=0.781526 ~ 0-200 = 20.0%" 26 ~ mg ~ Via 


inch because 305 _ = % of all 10-year-old girls 
12<101526 ~ 0-200 = 20.0% 9 < 10 0 x 8 


in the United States. 
_, _ 
What about the sampling distribution of x G-x B1526 
<G¢ — £B ? We used software to take an SRS of 12 ten-year-old girls and 8 


ten-year-old boys. Our first set of samples gave x G=56.09 


305 305 
——_= 0.200 = 20.0%s, — ; = — _= 0.200 = 20.0% 
1526 ‘EG = 56.09 inches and x B=54.681526 , 


<p = 54.68 inches, resulting in a difference of x G 


= 0.200 = 20.0% 


305 
-~x B=56.09-54,68=1.411526 ~ °:2© = 20-9%F— — Fy = 56.09 — 54.68 = 1.41 


inches. A red dot for this value appears in Figure 10.4. The dotplot shows 
the results of repeating this process 1000 times. 


4 2 0 2 4 6 


FIGURE 10.4 Simulated sampling distribution of the difference in sample means 


30° _ 9.200 = 20.0% 


1526 


1401 


Lg —<B 


"305 
Te) = 0.200 = 20.0% 
"305. 
a556 —ueeis 00. 0.0% 


HG > 56.4 
nches and oG=2.7 


305 
1526 = 0.200 = 20.0% 


aq = 2.7 


"305 — 
1526 


Be 
from an approxim 


305 
1526 


—— = ().200 = 20.0% 


305 _ 0.200 = - 20.0% 


[Lp = 90.7 


inches and oB=3.8 


305 
= 0.200 = 20.0% 
1526 0.200 = 20.0% 


OB — 3.8 


inches. 


The figure suggests that the sampling distribution of x G-x B 


305 _ 0,200 = 20.0%%, — F 
1526 _"“<G — £B has an approximately Normal shape. This makes 


sense from what you learned in Section 6.2 because we are subtracting two 


305 
: . — .——= 0.200 = 20.0% 7 — 
independent random variables, x G1526 “€G andx B 


305 
= 0.200 = 20.0% % eee as 
1526 *€B , that have approximately Normal distributions. 


The mean of the sampling distribution is 0.7. The true mean height of 
305 


all 10-year-old girls is pG=56.41526 — st Oe = = 4 inches and the 


true mean height of all 10-year-old boys is pB=55. 7 siiaiaiimlia cal 


305 
LB = 55.7 inches. We expect the difference x G-x B1526 


Zc — ©B to center ot ae actual difference in the population means, G 


= 0.200 = 20.0% 


—\B=56.4-55.7=0. 715357 0200 = 20.0% yg — pp = 56.4 — 55.7 = 0.7 inch, 
The standard deviation of the sampling distribution is 1.55 inches. It can 
be found using the formula 
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oG2nG+oB2nB=2.7212+3.828=1., 55i5n ane 20 


D) D) 

[% 8 2.72 3.82 

i Sy pe ae 
NG NB 12 8 


That is, the difference (Girls — Boys) in the sample mean heights typically 
varies by about 1.55 inches from the true difference in means of 0.7 inch. 


THE SAMPLING DISTRIBUTION OF x 1-x 2 
a = 0.200 = 20.0%F, — Bo 


305 
Choose an SRS of size n11526— °70=99%ny from Population 1 with 


05 
= 0.200 = 20.0% = 0.200 = 20.0% 
mean 17596 “1 and standard deviation 611526 


305 
= 0.200 = 20.0%. 
71 and an independent SRS of size n21526 one 


305 
Population 2 with mean 21526) 77”? = 


305 
~~ 0.200 = 20.0% 
021526 “02 . Then: 


from 


20.0%u2 and standard deviation 


305 
0.200 = 20.0%, — 
The sampling distribution of x 1-x 21526~ *B1 — Ba is 


Normal if both population distributions are Normal. It is 
approximately Normal if both sample sizes are large (n1>30 and 


305 
n2>30)1526— 0.200 = 20.0% (n, > 30 and n, > 30) 


or if one population is 
Normally distributed and the other sample size is large. 


305 


The mean of the sampling distribution ofx 1-x lime 


Z1— £2 is [Lx ~ |-x J= U1 wigs 0.200 = 20.0% pz, =e = p41 — pe 


The standard deviation of the sampling distribution of x 1-x 2 
305 


2 — 0.200 = 20.0%. _ =. - 
jae AE — Ba is 


305 o3,-z, = 
ox 1-x 2=012n1+o22n21526~ °700= 200% 
as Jong as the 10% condition is met for both samples: n1<0.10N1 


{n= 0200 = 20.0%) < 0.10N1 and n2<0.10N2i508~ 0200 = 20.0% 


no < 0.10N> : 
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Note that the formula for the standard deviation of the sampling 
distribution is only correct for independent random samples from the 
corresponding populations. As in Section 10.1, we will adjust the Random 
condition for inference about a difference between two means to reflect 
this added requirement. The standard deviation of the sampling 
distribution tells us how much the difference in sample means will 
typically vary from the difference in the population means if we repeat the 
random sampling process many times. 


Think About It 


WHERE DO THE FORMULAS FOR THE MEAN AND 
ee aa cai OF THE SAMPLING DISTRIBUTION 


= 0.200 = 20.0%, _ >. 

OF x 1-x_ vss Z1-£2 COME FROM? Both x 1 
305 0.200 = 20.09 305 
1526 ‘Ey and x 21526 


0.200 = 20.0%, ; 
%2 are random variables. 


That is, their values would vary in repeated independent SRSs of size 


305 305 
° — 9.200 = = 


20.02% = 0.200 = 20.0% 
n11526 ny and n21526 2 


. Independent random 


305 
= 0.200 = 20.09 
samples yield independent random variables x 11526 Ey 
305 305 
— ——= 0.200 = 20.0% %. ie Sy 
and x 21526 %2 . The statistic x 1—x 21526 
“1 ~ “2 is the difference of these two independent random variables. 


In Chapter 6, we learned that for any two random variables X and Y, 


0.200 = 20.0% 


305 
Se SOD es Sos 
UX-Y=yX-py 1526 Mx-y = Mx — by 
30° = 0.200 = 20.0% = 
For the random variables x 11526 ~~ ~° “1 andx 2 
305 
= 0.200 = 20.0% 
1526 LQ , we have 
305 
—, — = — = 0.200 = 20.0% 
ux 1-x 2=px 1-px 2=p1-p21526 
Mz, IQ — Mz, oe Mz, = ff — 2 


We also learned in Chapter 6 that for independent random variables 
X and Y, 
_ 
OX-Y2=0X2+0¥ 21526 = 0200 = 0.0% 0% _y = Ox + oy 
305 


: —,——= 0.200 = 
For the random variables x 11526 


20.0% = 
71 and x 2 
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305 = 0.200 = 20.0%= 
1526 ° © “2, wehave 


ox 1-x 22=0x 12+ox 22=(61n1)2+(o62n2)2=012n1+022n2 


2 2 2 2 
o o oe Co 
nae 7a 2 \ a] \ ag) my 


ao 
305 — = =ygfiyl 
So ox 1-x2=012n1+622n2i526~ 0200 = 20.0%7 21-22 ~ Ym Fa 


When the conditions are met, we can use the Normal density curve 


shown in Figure 10.5 to model the sampling distribution of x 1-x 2 


5 
30° _ 9.200 = 20.0%%, — Fo 


1526 . Note that this would allow us to calculate 
— __ 305 _ sifeatelit ee - cian 
probabilities involving x 1-x 21526 © 0.200 = 20.0%, — 2 with a Normal 


distribution. 


Values of x, — X, 


FIGURE 10.5 Select independent SRSs from two populations having means 11 


305 
1526 0.200 = 20.0% 
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‘305 - , 
To 0.200 = 20.0% 


"805 
a" 0.200 = 20.0% 


1526 0.200 = 20.0% 


"805 
Toh 0.200 = 20.0% 


. When the conditions are met, the sampling distribution of the difference x 1-x 2 


A. 0.200 = 20.0% 


EE SE 
l ye, 
is approximately Normal with mean p 1-1 2 
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305 
1526 


fy — Be 


"305 
1526 


—— = ().200 = 20.0% 


—— = ().200 = 20.0% 


Describing the sampling distribution of x 1—x 2 
305 
oq = 0.200 = 20.0%%, — Fy 


RichLegg/Getty Images 


PROBLEM: A fast-food restaurant uses an automated filling machine 
to pour its soft drinks. The machine has different settings for small, 
medium, and large drink cups. According to the machine’s 
manufacturer, when the large setting is chosen, the amount of liquid L 
dispensed by the machine follows a Normal distribution with mean 27 
ounces and standard deviation 0.8 ounce. When the medium setting is 
chosen, the amount of liquid M dispensed follows a Normal distribution 
with mean 17 ounces and standard deviation 0.5 ounce. To test the 
manufacturer’s claim, the restaurant manager measures the amount of 
liquid in each of 20 cups filled using the ues setting and 25 cups filled 


2 F - om = 
using the medium setting. Let x L-—x M1526 — 0.20 = UONE, — Farina 
the difference in the sample mean amount of liquid under the two 
settings. 


a. What is the shape of the sampling distribution of x L-x M 


305 
= 0,200 = 20.0% = 


b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling 
distribution. 


SOLUTION: 


a. Normal, because both population distributions are Normal. 
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305 scueiad 
b. px L-x M=27-17= 101526 0.200 = 20.0% 


Mz, -zy = 27 17 = 10 Aanece 


Save 305 
ux 1-x 2=p1-121526 


= 0.200 = 20.0%Uz, Zz. = [1 — pe 


305 : ieee 
c. ox L—x M =0.80220+0.50225 = 0.205126. °°") = °°” 
/ 0.802 0.507 _ 
Ozr,—rM — Vy 0 + 95 — 0.205 ounce 


305 ae 
> — 0,200 = 20.0%72,-z2 = 


ox 1-x 2=012n1+022n21526 


305 
——= 0.200 = 20.0% 
The difference (Large cup — Medium cup) 1526 i 


(Large cup — Medium cup) jn the sample mean amounts of liquid 
typically varies by about 0.2 ounce from the true difference in means of 


10 ounces. 


Note that we do not have to check the 10% condition because we are not 


sampling without replacement from a finite population. 


FOR PRACTICE, TRY EXERCISE 37 


305 ; ‘ o 
Confidence Intervals for p11 — p2i526° °°" °°" — He 


When data come from two independent random samples or two groups in a 


randomized experiment (the Random condition), the statistic x 1-x 2 
305 

1526 
305 
1526 


= 0.200 = 20.0%7, — %, - 
“1 ~ “2 is our best guess for the value of p1—p2 


= 0.200 = 20.0% 4, — po 


. Before constructing a confidence interval for a 
difference in means, we must check that the conditions for performing 
inference are met. 
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CONDITIONS FOR CONSTRUCTING A CONFIDENCE 
INTERVAL ABOUT A DIFFERENCE IN MEANS 


e Random: The data come from two independent random samples or 
from two groups in a randomized experiment. 


© 10%: When sampling without replacement, n1<0.10N1 
305, aieeas 305 _,, serail 
7526 = 0-200 = 20.0%, < 0.10Ni and n2<0.10N2 i520 ~ 9-200 = 20.0% 


no < 0.10N> . 


e Normal/Large Sample: For each sample, the corresponding 
population distribution (or the true distribution of response to the 


treatment) is Normal or the sample size is large n>30 
305 


1526 00 = 20.0n > 30 | For each sample, if the population 
(treatment) distribution has unknown shape and n<30 
305 

= 0.200 = 20.0% 


1526 n < 30, a graph of the sample data shows no strong 
skewness or outliers. 


Recall from Chapter 4 that the Random condition is important for 
determining the scope of inference. Random sampling allows us to 
generalize our results to the populations of interest; random assignment in 
an experiment permits us to draw cause-and-effect conclusions. 


EXAMPLE |Do bigger apartments cost more money? 
No] 


Checking conditions 
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NOW RENTING 


Justin Sullivan/Getty Images 


PROBLEM: A college student wants to compare the cost of one- and 
two-bedroom apartments near campus. She collects the following data 
on monthly rents (in dollars) for a random sample of 10 apartments of 
each type. 


1 bedroom 900 650 600 505 450 550 515 495 650 395 
2 bedroom 995 500 580 650 675 675 750 500 495 670 


305 


—— = 0.200 = 2 
Let p1=1526 0-40 


0.00441 = the true mean monthly rent of all one- 


305 
—— = 0.200 = 20.0% = 
bedroom apartments near campus and p12=1526 ‘#2 = the true 


mean monthly rent of all two-bedroom apartments near campus. Check 


if the conditions for calculating a confidence interval for p1-12 
305 
1526 


0.200 = 20.0% 414 — pe are met 


SOLUTION: 


e Random: Independent random samples of 10 one-bedroom 
apartments and 10 two-bedroom apartments near campus. / 


305 


© 10%: We can assume that 10<10%1526 "°° 7°°"10 < 10% % of 
all one-bedroom apartments near campus and that 10<10% 


i Te 
i526 "9 = *°.°°10 < 10% of all two-bedroom apartments near 


campus. / 


Be sure to mention independent random samples from the 
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| populations of interest when checking the Random condition. | 


e Normal/Large Sample? The sample sizes are small, but the dotplots 
don’t show any outliers or strong skewness. / 


Because there is no strong skewness or outliers in either sample, it is 
plausible that the population distributions of monthly rent for one- 


bedroom and two-bedroom apartments near the campus are Normal. 


B @ ese td a $ 
1 bedroom 
e 
oe S 
ce ee e*e¢ e 


Monthly rent (+) 


FOR PRACTICE, TRY EXERCISE 41 


If the conditions are met, we can use our familiar formula to calculate a 


305 
. —— = 0.200 = 20.0% 1, — py. 
confidence interval for p1—p21526 fo at Be 


305 
statistic+(critical value) - (standard deviation of statistic) 1526 
statistic + (criticalvalue) - (standarddeviationofstatistic) 


= 0.200 = 20.0% 


As mentioned earlier, the standard deviation of the sampling distribution 
305 


—, —,——s 0.200 = 20.0%. _ =, . 
of x 1-x 21526 "£1 — £2 js 


a) 
305 Oo: - = Plage 
ox 1-x 2=012n1+022n2 1596 = 0-200 = 20.0% ny ng 


305 
Because we usually don’t know the values of 011526 — 


305 
= 0.200 = 20.0%,,. : ia gi 
1526 "02 , we replace them with the sample standard deviations 


305 305 
2 — 0.200 = 20.0% « 2 = 0.200 = 20.0% « 
$11526 "81 and $21526 °S9 
305 


error of x 1-x 21526 


0.200 = 20.0%, and o2 


. The result is the standard 
= 0.200 = 20.0%%, — Fo . 


~-x 305 _ 9.200 20.0% 71 2 
SEx 1-x 2=s12n1+s22n21526 
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This value estimates how much the difference in sample means will 
typically vary from the difference in the true means if we repeat the 
random sampling or random assignment many times. 


305 

In the unlikely event that both population standard deviations 611526 — 
305 

71 and 021526 — 


0.200 = 20.0% 
0.200 = 20.0%. . 
°O2 are known, the interval becomes 


ee 
0.200 = 20.0% 071 — 22) +2 


305 
(x 1-x 2)+z*012n1+022n21526 — 
This interval is known as a two-sample z interval for a difference in means. It is rarely 


used in practice. 


When the Normal/Large Sample condition is met, we find the critical 
305 


——_ = 0.200 = 20.0% + 
value t*1526 _ 


for a given confidence level using Table B or 


305 
—— = 0.200 = 20.0% 4, — pio : 
technology. Our confidence interval for 11—p121526 "HA ~ F2 is 


therefore 

305 
statistict+(critical value) - (standard deviation of statistic)1526 — 
statistic + (criticalvalue) - (standard deviation of statistic) 


0.200 = 20.0% 


= = | 
0.200 = 20.0%= (1 — 2) +t \ 


305 
=(x 1-x 2)+t*s12n1+s22n21526 — “os 
This is often called a two-sample t interval for a difference between two 
means. 


There is just one issue left to resolve: What df should we use to find 


305 
——_ = 0.200 = 20.0% + _... 
the t*is2e 72" = 2°-°"t* Critical value? It turns out that there are two 


practical options. 


Option 1 (Technology): Use the t distribution with degrees of freedom 
calculated from the data by the following formula. Note that the df given 
by this formula is usually not a whole number. This option results in 
confidence intervals with a margin of error that is approximately correct 
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for the stated confidence level. A significance test using this option gives a 
P-value that is approximately correct. 


Statisticians B. L. Welch and F. E. Satterthwaite discovered this fairly remarkable 
formula in the 1940s. 


305 a 
df=(s12n1+s22n2)21n1-1(s12n1)2+1n2-1(s22n2)21526 "700 = *9.” 


oy 
+ 
ny ng 
df = 
2\ 2 2\ 2 
1 $7 1 85 
ny <1 ny ng—1 no 


Option 2 (Conservative): Use the ¢ distribution with degrees of freedom 


305 sass 
equal to the smaller of n1-11526 © 0.200 = 20.0%, — 1 and n2-1 


305 
= 0.200 = 20.0%. ; ; 
1526 *n2 — 1 | With this option, the resulting confidence interval 


has a margin of error as large as or larger than is needed for the desired 
confidence level. A significance test using this option gives a P-value 
greater than or equal to the true P-value. 


Simulation studies reveal that the two-sample t procedures are most accurate when 


the sizes of the two samples are equal and the population (treatment) distributions 
have similar shapes. In planning a two-sample study, choose equal sample sizes 


whenever possible. 


As you can imagine, Option 2 was much more popular in the days 
when Table B and a four-function calculator were the main calculation 
tools. 


TWO-SAMPLE t INTERVAL FOR A DIFFERENCE 
BETWEEN TWO MEANS 


305 


a = 0.200 = 20.0% 
When the conditions are met, a C%1526 


“C% confidence 
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305 


—— = 0.200 = 20.0%, — 
interval for p1—-p121526 “1 


H2 is 


= = 305 0.200 = 20 pol @l a 2) tt *' 
(x 1-x 2)+t*s12n1+s22n21526 
305 

where t*1526 


= 0.200 = 20.0%,. . -_ : 
“t* is the critical value with C% 
305 305 


1526200 = 20-9C % of its area between —t*1526 700 = 7°-°—t& and tx 
305 

—— = 0.200 = 20.0%. eels 
i526 "9 ~ °°.“ for the t distribution with degrees of freedom using 

either Option 1 (technology) or Option 2 (the smaller of n1—-1 

305 ck 

596 = 0-200 = 20.0%) — 1 and 19-1 1596 = 0-200 = 20.0% 9 — 1 ) 


Let’s return to the apartment-rental example. Recall that the college 
0.200 = 20.0% 


305 
student took independent random samples of n1=101526 ~ 


05 ‘ F o 
mi = 10 one-bedroom apartments and n2=101526— a ws Re = 10 two- 
bedroom apartments. Here are summary statistics on the monthly rents (in 
dollars) of these apartments: 


Group name n Mean SD Min Q, Med Q3 Max 


1: 1-bedroom 10.531 82.792 395 495 510.0 600 650 
2: 2-bedroom 10 =~ 609 89.312 495 500 622.5 675 750 


We already confirmed that the a are met. 


Using Option 1, df=17. g98isn0 0.200 = 20.0% af — 17.898 and the 90% 


305 
confidence interval is ( —144.802i526~ "°° = 7°-°”—144.802 | -11.198 


305 
i526 00 = 0.811.198 ) | See the Technology Corner after the next 


example for details on how to obit this interval on the TI-83/84. Using 


= 0.200 = 20.0%"; — 1 and n2-1=9 


Option 2, df=the smaller of n1- ie 55 


305 
= 0.200 = 20.0 %, = =, ete Z 
1526 ‘ng ~1=9 . The critical value for a 90% confidence level 


305 
in at distribution with 9 degrees of freedom is t*=1.833i526 °°? = 790% 
tx = 1.833 . So the resulting interval is 


(531-609)+1.83382.792210+89.312210=—78+70.591=(—148.591, —7.409) 
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/82.7922 89.3122 
(531 — 609) + 1.8334) oe | an 


10 10 
= —78 + 70.591 


Notice that this interval is wider than the one obtained from technology. 


That is, Option 1 produces a more precise estimate of 11-2 
305 eae 
1526 > 700 = 20-07%) — M2 than Option 2. 


305 


We are 90% confident that the interval from —144.8021526 7"? = 20.0% 


305 - 
— 144,802 to -11.1981526~ 0.200 = 20.0% 


305 
1526 the difference in the true mean monthly rents 


of one-bedroom and two-bedroom apartments close to the college campus. 
The interval suggests that the mean monthly rent of one-bedroom 
apartments is between $11.20 and $144.80 less than the mean monthly rent 
for two-bedroom apartments near campus. 


~11.198 dollars captures 11-p2= 


= 0.200 = 20.0 aT _ pe _—— 


305 
Because Option 1 produces more precise estimates of :1—p121526 


= 0.200 = 20.0% 


#1 ~ #2 than Option 2, we will always interpret the confidence interval obtained 


from technology. 


The following example shows how to construct and interpret a 
confidence interval for a difference in means. As usual with inference 
problems, we follow the four-step process. 


EXAMPLE | Big trees, small trees, short trees, tall trees 
Ne 


Confidence interval for py - p2 
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Danita Delimont/Getty Images 


PROBLEM: The Wade Tract Preserve in Georgia is an old-growth 
forest of longleaf pines that has survived in a relatively undisturbed state 
for hundreds of years. One question of interest to foresters who study 
the area is “How do the sizes of longleaf pine trees in the northern and 
southern halves of the forest compare?” To find out, researchers took 
random samples of 30 trees from each half and measured the diameter at 
breast height (DBH) in centimeters./2 Here are summary statistics and 
comparative boxplots of the data: 


Descriptive Statistics: North, South 


Group n Mean StDev 
North 30 23./0 17.50 
South 30 34.53 14.26 


a. Based on the graph and numerical summaries, write a few sentences 
comparing the sizes of longleaf pine trees in the two halves of the 
forest. 


b. Construct and interpret a 90% confidence interval for the difference 
in the mean DBH of longleaf pines in the northern and southern 
halves of the Wade Tract Preserve. 
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South 


DBH (cm) 
SOLUTION: 


a. Shape: The distribution of DBH in the northern sample appears 
skewed to the right, while the distribution of DBH in the southern 
sample appears skewed to the left. 


It would not be correct to say that the northern half of the forest is 
skewed to the right! Only distributions of quantitative variables (like 
DBH) can be skewed. 


Outliers: No outliers are present in either sample. 

Center: It appears that trees in the southern half of the forest have 
larger diameters. The mean and median DBH for the southern 
sample are much larger than the corresponding values for the 
northern sample. 


Furthermore, the boxplots show that more than 75% of the southern 
trees have diameters that are above the northern sample’s median. 


Variability: There is more variability in the DBH of the northern 
longleaf pines. The range, JQR, and standard deviation are all larger 
for the northern sample. 


Don’t forget to include context (the variable of interest) when 


comparing distributions of quantitative data. In this case, that’s DBH. 
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305 > minal 
b. STATE: 90% CI for pl — p21526~ 9700 = 79°" — 2) where pl = 


305 
= 0.200 = 20.0% — 2 
1526 #1 = the true mean DBH of all trees in the southern 


SUD ak anne 
half of the forest and 2 = 1526 — 0.200 = 29.92 = the true mean DBH 


of all trees in the northern half of the forest. 


Be sure to indicate the order of subtraction when defining the 


parameter. Then you can mimic the wording in your conclusion. 


305 amas 

PLAN: Two-sample t interval for 1 — 21526 — citi cad) Wael 

e Random: Independent random samples of 30 trees each from the 
northern and southern halves of the forest. / 


305, noe 
o 10%: Assume 30<10%1i520 7"? = 7°30 < 10% of all trees in the 
305, ea 
northern half of the forest and 30<10%1526°°"" = °°.°"°30 < 10% of 
all trees in the southern half of the forest. “ 


305 


¢ Normal/Large Sample: n1 = 302301526 ”0?~ °.°’'n, = 30 > 30 and 
305. 
n2 = 30>301526 > 200 = 20-9%n. = 30 > 30 
305 . : o— 305° x 
DO: xot= 24 531520 ~ 0-700 = 20-0%3, = 34.5398] = papa isag ~ °-200 = 20-04 


305, on nad 

305 305 
— —— = 0.200 = 20.0%. — 9: = 0.200 = 20.0% 
x 2 = 23.701526 °E2 = 23.70 | 59 = 17.501526 ° 
305 
$2 = 17.50 , n2 = 301526 = 0.200 = 20.07%, ny = 30 


Option 1: 2-SampTInt gives (3.9362, 17.724) using df = 55.728 
305 ni 
15267 0.200 = 20.0% f — 55.728 


Refer to Technology Comer 25 for details on how to do calculations for 


a 2-sample t interval on the T1-83/84. 


308 4 305 
Option 2: df 2 a 0.200 = 20.04% f = 29,42 = 4 Gao ses ~ 9-200 = 20.0% 
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t* = 1.699 


0.200 = 20.0% 


305 
df=the smaller of n1—1 and n2—11526 ~ 


df = thesmaller ofn,; — landn, — 1 


ssa 0.200 = 20.0% 
= (34.53 — 23.70)+1.69914.26230+17.502301526 — 


‘ ‘ ‘ j / 14.26? 17.50? 
= (34.53 — 23.70) + 1.699)/ 420 4 175 


305 
f= ——-= 0.200 = 20.0% 
(x 1-x 2)+t*s12n1+s22n21526 


= 10.83+7. oor 596 7 0-200 = 20.0% _ 10.83 + 7.00 


305 . stall 
= (3.83, 17.83)1526 9200 = 20.0%= (3.83, 17.83) 


CONCLUDE: We are 90% confident that the interval from 3.9362 to 
305 9 9) or = 

17.724 centimeters captures p11 — 2 =1526. "= °°" — H2 = the 

difference in the true mean DBH of all the southern trees and the true 


mean DBH of all the northern trees. 


FOR PRACTICE, TRY EXERCISE 45 


The 90% confidence interval in the example does not include 0. This 
gives convincing evidence that the difference in the mean diameter of 
northern and southern trees in the Wade Tract Preserve isn’t 0. However, 
the confidence ee provides more information than a simple reject or 

“= 0,200 = 2 


fail to reject H01526 
305 


values for p1—121526 . The interval suggests that the 
mean diameter of a southern trees is between 3.94 and 17.72 cm larger 


0.0% . . : 
Ho conclusion. It gives a set of plausible 
0.200 = 20.0% 41 — pu 
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than the mean diameter of the northern trees. 


We chose the parameters in the DBH example so that x 1-x 2 


305 
— = 0.200 = 20.0%7, _ *, ets F ‘ 
1526 “1 ~ ¥2 would be positive. What if we had defined 11 
305 

= 0.200 = 20.0% 
1526 #1 as the true mean DBH of the northern trees and p12 
305 


~ 0.200 = 20-049 as the true mean DBH of the southern trees? The 90% 
305 
confidence interval for p1—p21526 
305, seis | 
(-17.72, -3.94)is26 "200 = 20.9%(—17.72, — 3.94) | This interval suggests 
that the mean diameter of the northern trees is between 3.94 and 17.72 cm 
smaller than the mean diameter of the southern trees. Changing the order 


of subtraction doesn’t change the result. 


1526 


= 0.200 = 20.0% — fhe . 
M1 ~ #2 from technology is 


Notice that the interval produced by technology is narrower than the 
one calculated using the conservative method. That’s because technology 
uses the formula on page 651 to obtain a larger (and more accurate) value 
of df. 

As with other inference procedures, you can use technology to perform 
the calculations in the “Do” step. Remember that technology comes with 
potential benefits and risks on the AP® Statistics exam. 


25. Technology Corner |CONSTRUCTING A 
CONFIDENCE INTERVAL FOR A DIFFERENCE IN 
MEANS (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


You can use the two-sample ¢ interval option on the TI-83/84 to 
construct a confidence interval for the difference between two means. 
Let’s confirm the results of the two previous examples with this feature. 


1. Do bigger apartments cost more money? (page 649) 


e Enter the one-bedroom monthly rents in L1 and the two-bedroom 
monthly rents in L2. 


e Press [sTAT] then choose TESTS and 2-SampTInt.... 
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e Choose Data as the input method and enter the inputs as shown. 


e Enter the confidence level: C-level: 0.90. For Pooled: choose 
“No.” We’ll discuss pooling later. 


e Highlight Calculate and press [ENTER|, 


NORMAL FLOAT AUTO REAL RADIAN MP ra] NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-SampT Int 
Inept :BERe) Stats i 
Listi:Li df=17.89752842 
List2:L2 x1=531 
Freai:i x2=609 
Freq2:1 Sx1=82. 7915723 
C-Level:@.9 Sx2=89.31218655 
Pooled: Yes ni=10 
Calculate n2=190 


2. Big Trees, Small Trees, Short Trees, Tall Trees (page 652) 
e Press [sTAT] then choose TESTS and 2-SampTInt.... 


e Choose Stats as the input method and enter the summary statistics 
as shown. 


e Enter the confidence level: C-level: 0.90. For Pooled: choose 
“No.” We’ll discuss pooling later. 


e Highlight Calculate and press [ENTER|, 


NORMAL FLOAT AUTO REAL RADIAN MP fl NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-SampTiInt 2-SampT int 
Inpt:Data (3.9362,17.724) 
x1:34.53 df=55. 7276914 
Sx1:14.26 x1=34.53 
ni:30 X2=23.7 
x2:23.7 Sx1=14.26 


Sx2:17.5 Sx2=17.5 
n2:3@ n1=30 
C-Level:0.9 n2=30 
+Pooled:ff) Yes 


AP® EXAM TIP 

305 
The formula for the two-sample t interval for p1-p21526 
#1 ~ #2 often leads to calculation errors by students. Also, the interval 
produced by technology is narrower than the one calculated using the 


= 0.200 = 20.0% 
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conservative method. As a result, your teacher may recommend using 
the calculator’s 2-SampTInt feature to compute the confidence interval. 
Be sure to name the procedure (two-sample t interval for 1-2 

305 
1526 


~ 0.200 = 20.04) — 2 ) in the “Plan” step and give the interval 


(3.9362, 17.724) and df (55.728) in the “Do” step. 


CHECK YOUR UNDERSTANDING 


. 


y Mr. Wilcox’s class performed an experiment to investigate whether 
drinking a caffeinated beverage would increase pulse rates. Twenty 
students in the class volunteered to take part in the experiment. All of 
the students measured their initial pulse rates (in beats per minute). Then 
Mr. Wilcox randomly assigned the students into two groups of 10. Each 
student in the first group drank 12 ounces of cola with caffeine. Each 
student in the second group drank 12 ounces of caffeine-free cola. All 
students then measured their pulse rates again. The table displays the 
change in pulse rate for the students in both groups. 


Change in pulse rate (Final pulse rate — Initial pulse Mean 

rate) change 
Caffeine 8 3 5 1 4 0 6 1 4 0 2 
No 3 =2 4 =1 eae ye =] 4 2.0 


caffeine 


1. Construct and interpret a 95% confidence interval for the difference 
in true mean change in pulse rate for subjects like these who drink 
caffeine versus who drink no caffeine. 


2. What does the interval in Question 1 suggest about whether caffeine 
increases the average pulse rate of subjects like these? Justify your 
answer. 


305, ae 
Significance Tests for pi—p2is2s °°" = 799% — be 
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An observed difference between two Salnpre means can reflect an actual 


difference in the parameters j11- 215967 een aa aad 


305 
= 0.200 = 20.0% 
1526 "2 


and p12 


, or it may just be due to chance variation in random 
sampling or random assignment. Significance tests help us decide which 
explanation makes more sense. 


Stating Hypotheses and Checking Conditions The null hypothesis has 
the general form 


HO: .1-p2=hypothesized vai” Ree Oe 


Ho: 4, — 2 = hypothesized value 


We’re often interested in situations in which the hypothesized difference is 
0. Then the null hypothesis says that there is no difference between the two 
parameters: 


HO: 1-2= os 0.200 = 20.0% Hy: uy — po = 0 


(You will sometimes see the null hypothesis written in the equivalent form 


305 oF . — 
HO: p1=p21526 © Sane AO EL gS fly = fly .) The alternative hypothesis says 
what kind of difference we expect. 


The conditions for performing a significance test about 1-2 
305 


——= 0.200 = 20.0% — [Lh : é 
1526 “#1 ~ #2 are the same as for constructing a confidence 
interval. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE 
TEST ABOUT A DIFFERENCE IN MEANS 


e Random: The data come from two independent random samples or 


from two groups in a randomized experiment. 

© 10%: When sampling without replacement, n1<0.10N1 
305 eens 305 « 
7596 = 0-200 = 20.0%, < 0.10Ni and n2<0.10N2 i506 7 0-200 = 20-0% 


no < 0.10N> . 


1431 


e Normal/Large Sample: For each sample, the corresponding 
population distribution (or the true distribution of response to the 
treatment) is Normal or the sample size is large (n>30) 


305 
——= 0.200 = 20.0% 
1526 ‘(n > 30) 


. For each sample, if the population 


(treatment) distribution has unknown shape and n<30 
Te 
i526 0 ~ °°" -< 30 , a graph of the sample data shows no strong 


skewness or outliers. 


Here’s an example that illustrates how to state hypotheses and check 
conditions. 


EXAMPLE |A longer work week? » 
Stating hypotheses and checking conditions 


Bill Varie/Getty Images 


PROBLEM: Has the mean number of hours Americans work in a week 
changed? One of the questions on the General Social Survey (GSS) 
asked respondents how many hours they work each week. Responses 
from random samples of employed Americans were recorded for 1975 
and 2014—they are summarized here: 


Year Sample size Mean SD 
1975 764 38.97 hours 13.13 hours 
2014 1501 41.91 hours 14.35 hours 


305 
: ee 3 —— = 0.200 = 20.0% 
Do these data give convincing evidence at the a=0.051526 , 
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a = 0.05 significance level of a difference in the true mean number of 
work hours per week for Americans in 1975 and 2014? 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 


SOLUTION: 


a. HO: 12014 - 11975 = one 0.200 = 20.0%Ho : f2014 — His7s = 0 


305 . 
Ha: 12014 — 11975#01526 0200 = 20-°%Ha : bona — Hio7s # 0 
305 


where 111975 =1526 cia 


20.0% 
‘i975 = the me mean hours worked 


per week by Americans in 1975 and 12014 =i526 °°? ~ °°” 


#2014 ~ the true mean hours worked per week by Americans in 
2014. 


You could also state the hypotheses as 


305 . canes 
HO: 12014=119751526 700 = °9-°*Ho = M2014 = M1975 


Ha: 12014%19751520~ 0.200 = 20.0%, : poo1a # M1975 


e Random? Independent random samples of 764 working Americans 
in 1975 and 1501 working Americans in 2014. “ 


305 oF. 
° 10%: 764<10%1526" 0.200 = 20.9764 < 10% of all working 
305 


Americans in 1975; 1501<10%1526 77"? = °°°1501 < 10% of 
all working Americans in 2014. / 
305 


e Normal/Large Sample? n1975 =7642301526 — ia 
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305 
mo7s5 = 764 > 30 and n2014 = 1501>301526 
no14 = 1501 > 30 ir: 


= 0.200 = 20.0% 


Be sure to mention independent random samples from the populations of 


interest when checking the Random condition. 


FOR PRACTICE, TRY EXERCISE 51 


Calculations: Standardized Test Statistic and P-Value If the conditions 


are met, we can proceed with calculations. To do a test of HO: p1-p2=0 


305 

——= 0.200 = 20.0%: 4, — to = ee ee 
1526 °Ho: 1 — M2 = 9) start by standardizing x 1-x 2 
305, on. fe 


standardized test statistic=statistic-parameterstandard deviation of statistic 
statistic— parameter 


305 ; P standardized test statistic = ——_—_—_—_—____—_—_—_- 
1526 0.200 = 20.0% standard deviation of statistic 


305 


A F a ——= 0.200 = 20.0% 
In the unlikely event that both population standard deviations 011526 , 


305 
——= 0.200 = 20.0% . 52s 
1 and 021526 "02 are known, the standardized test statistic becomes 


_ (%1—-¥2)—0 
—_ 
305 a “2 
z=(x 1-x 2)-0012n1+022n2i526~ 7=900% — ny ng 


We could then use the standard Normal distribution to find the P-value. This is 


known as a two-sample z test for a difference in means. It is rarely used in practice. 


Because we seldom know the population standard deviations o1 


305 305 
—— = 0.200 = 20.0% —— = 0.200 = 20.0% 
1526 *o1 and 021526 9 = 99%, 


= =, awe eitiieliddeec, — sum 
of x 1-x 21526 2700 = 20°" — Z2 in the denominator of the 
standardized test statistic: 


, we use the standard error 


1434 


i= 
2 2 
= moe ae 
t=(x1-x”2)—0s12n1+s22n21526 "200 = 20.0% ny ng 


When the Normal/Large Sample condition is met, we can find the P- 
value using the ¢ distribution with degrees of freedom given by Option 1 


305 a 
(technology) or Option 2 (df = smaller of n1-11526- °°” ~ 


o 
—— = 0.200 = 20.0% 
n2—11526 *n2 —1), 


20.0% _ 


EXAMPLE |A longer work week? (>) 


Calculating the standardized test statistic and P- 
value 


David Papazian/Getty Images 


PROBLEM: Refer to the previous example. The table summarizes data 
on hours worked per week from the independent random samples of 
working Americans in 1975 and 2014. 


Year Sample size Mean SD 
1975 764 38.97 hours 13.13 hours 
2014 1501 41.91 hours 14.35 hours 


We already confirmed that the conditions for performing a significance 
test are met. 


a. Explain why the sample results give some evidence for the alternative 
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hypothesis. 
b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 
SOLUTION: 


a. The observed difference in the sample means is x 2014-x 1975= 
305 
41.91 — 38.97 = 2.941526 — 
E2014 — Li975 = 41.91 


0.200 = 20.0% 


38.97 = 2.94 | which gives some evidence 


305 
in favor of Ha: 12014 - 11975201526 7" = 20.0% 


305 a 
Hy: p2014 — -1975 ~ 9 because 2.94 41526 200 = 20.0% gy 


b. t=(41.91 — 38.97) — 014.3521501+13.132764 = 2.940.602 = 4.88 


(41.91 — 38.97) —0 294 
t= ——__ = 002 > = 4.88 
305 ~ avai / 14.352 13.132 
T5967 0-200 = 20.0% is a 


305 4 on 1 o% 
fee) ie ees 


P-value: 
305 — 0.200 = 20.0% 
Option 1: 2-SampTTest gives t = 4.881526 — “eet = 4.88 and 


= 0.200 = 20.0% 


305 
P-value = 0.000001151526 °° 00000115 using df= 1660.58. 


305 
T526 = 0-200 = 20.0%d Ff = 1660.58. 


305 
Option 2: df =smaller1526 = 0.200 = 20.0% f = smaller of 764-1 
305 305 


1526 ~ °-200 = 20.0%764 — 1 and 1501 -1=7631526~ 
1501 — 1 = 763 


0.200 = 20.0% 


Using Table B: df = 100 gives P-value <2(0.0005) = 0.001 
oo = 0.200 = 20.0%< 2 (0.0005) — 0.001 


Using technology: tcdf(lower: 4.88, upper:1000, df:763) x2 = 
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305 ; . eo 
0.000001292.1526 > "200 = 29.9% 9 — 9.000001292. 


Refer to Technology Corner 26 on page 660 for details on how to do 


calculations for a 2-sample t test on the TI-83/84. 


05 


3 
c. Because the P-value of 0.00000115<a = 0.051526 
305 ‘ anche 
0.00000115 < a = 0.05 , we reject H01s26 °°" ~ °°” . There is 


convincing evidence of a difference in the true mean hours worked 
by Americans in 1975 and 2014. 


= 0.200 = 20.0% 


t distribution 
with df = 763 


-5) 4 -3 2 -1 0 1 2 3 4 | s 


T= -4.88 T= 4.88 


FOR PRACTICE, TRY EXERCISE 55 


What does the P-value in the example tell us? If there is no difference 
in the true mean hours worked by Americans in 1975 and in 2014, there is 
a 0.00000115 probability of getting a difference in sample means as large 
as or larger than 2.94 hours in either direction purely by the chance 
involved in random sampling. With such a small probability of getting a 


result like this just by chance when the null hypothesis is true, we have 
305, invainia 
convincing evidence to reject H01526 © iil ai | 
We can get more information about the difference between the 
population mean hours worked by Americans in 1975 and 2014 with a 


confidence interval. Technology gives the 95% confidence interval for 


305 : aio 
412014—-119751526 "700 = °°-°*"2014 — 1975 as 1.76 to 4.12 hours. That is, 


we are 95% confident that the true mean hours worked per week by 
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Americans is between 1.76 hours and 4.12 hours larger in 2014 than in 


ae 
1975. This is consistent with our “reject H01526 ~ iene! (ee 
conclusion because 0 is not included in the interval of plausible values for 


305 ee 
12014-11975 1526 700 = 29-9014 — Ha97s 


Putting It All Together: Two-Sample t Test for py — po 


Here is a summary of the details for the two-sample t test for a difference 
between two means. 


TWO-SAMPLE t TEST FOR A DIFFERENCE BETWEEN 
TWO MEANS 


Suppose the conditions are met. To test the hypothesis HO: p1-—y2=0 
—— = 0.200 = 20.0% Hy : 


Hi ~ #2 = 0 | compute the standardized test 


1526 
Statistic 


305 © . — 
t=(x 1-x 2)-0s12n1+s22n21s26~ 70 = 70.0% ny ng 


Find the P-value by calculating the probability of getting a t statistic 
this large or larger in the direction specified by the alternative 


305 
hypothesis Ha1526 = 0.200 = 20.0% FT, Use the t distribution with degrees 


of freedom approximated by Option 1 (technology) or Option 2 (the 


305 a 
= 0.200 = 20.0%, se = 0.200 = 20.0% 

smaller of n1—11526 *m — 1 and n2-11526 

m—1), 


Here’s an example that shows how to perform a two-sample t test for a 
difference between two means in a randomized experiment. 


EXAMPLE | Calcium and blood pressure » 


Significance test for a difference between two 
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rdrgraphe/Shutterstock 


PROBLEM: Does increasing the amount of calcium in our diet reduce 
blood pressure? Examination of a large sample of people revealed a 
relationship between calcium intake and blood pressure. Such 
observational studies do not establish causation. Researchers therefore 
designed a randomized comparative experiment. 


The subjects were 21 healthy men who volunteered to take part in the 
experiment. They were randomly assigned to two groups: 10 of the men 
received a calcium supplement for 12 weeks, while the control group of 
11 men received a placebo pill that looked identical. The experiment 
was double-blind. The response variable is the decrease in systolic (top 
number) blood pressure for a subject after 12 weeks, in millimeters of 
mercury. An increase appears as a negative number.” Here are the data: 


Group 1 (calcium) 7 =4 18 17 =3 =5 1.10 11 =2 
Group 2 (placebo) Se SS Oo oe ee 8 


a. Do the data provide convincing evidence that a calcium supplement 
reduces blood pressure more than a placebo, on average, for subjects 
like the ones in this study? 


b. Interpret the P-value you got in part (a) in the context of this 
experiment. 


SOLUTION: 


305 “ anne 
a. STATE: HO: pC- pP=01526~ "7° = 7°-°"Ho = Ho — Hp = 9 Ha: uC 
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305 eee 
—P>01526~ 0.200 = 20.0%H, : uc — up > 0 


305 
——= 0.200 = 20.0% 4. = 
where [C=1526 Hc ~ the true mean decrease in systolic 


blood pressure for healthy men like the ones in this study who take a 


305 
. —— = 0.200 = 20.0%), 5 — 
calcium supplement and ppP=1526 HP ~ the true mean 


decrease in systolic blood pressure for healthy men like the ones in this 

study who take a placebo. No significance level was given, so we’ll use 
|, RTE 

o=0.051526 9 = *°-°%a = 0.05 . 


Note that we did not have to check the 10% condition because the 
subjects in the experiment were not sampled without replacement from 


some larger population. 


305 


PLAN: Two sample t test for wC-pPis26 °°” ~ 20.0% uc — LP 


e Random: The 21 subjects were randomly assigned to the calcium or 
placebo treatments. 


e Normal/Large Sample: The sample sizes are small, but the dotplots 
show no strong skewness and no outliers. 


Calcium sesso cI e ee ae 
se 
e & 3 $ ee ¢ Be 
ae 
-15 -10 =< 0 5 10 15 20 


Decrease in systolic blood pressure 


Based on these graphs, it’s reasonable to believe that the true 
distributions of decrease in systolic blood pressure are Normal for 


subjects like these when taking calcium or the placebo. 


305 " an neem 
DO: «x C=5.000i1526~ "700 =99”"c = 5.000 sc=8.743 


305 305 

~~ 0.200 = 20.0%, — ~~ 0.200 = 20.0%, — = 

1526 *sc = 8.743 | nC = 101526 ‘nc = 10. p= 
305 9 ann = 20.0% _. 

—ieeis6 «cp = 0.273 Penis 


- 


5 


30 
== ——= 0.200 = 20.0%), — 
sp = 9.901 nP = 111526 C= 2IAn, = 11 
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The sample results give some evidence in favor of Ha: pC-pP>0 
305 
1526 
305 


0.200 = 20.0%], : 


uc — Bp > 0 because 5.000-(-0.273) = 5.273>0. 
aq = 0.200 = 20.0% 5.000 — (—0.273) = 5.273 > 0. 


* t=[5.000-(—0.273) ] — 08.743210+5.901211 = 5.2733.2878 = 1.604 


5.000—(—0.273)] — 0 9 
t= = SS P= 1.604 

305 9 on ne | 8.7432 5.9012 3.2878 
1526 0.200 = 20.0% \ 10 = 


P-value 


305, sihinie 
Option 1: 2- ao anp Nest gives t= 1.6041526 = 0.200 = 20.9% = 1.604)and 


P-value = 0. 064disz0 0.200 = 20.0%_ 0.0644 using df = 15.59. 
305 
1526 > 0.200 = aren — 15.59. 


Option 2: df= gine 0200 = 094% af = 9 


Using Table B: 0.05<Piszs = 0200 = 20.0% 95 < P value <0.10 
| rp 

1526 > 0.200 = 20.0 %=- 0.10 

Using technology: tcdf(lower:1.604, upper: 1000, df:9) = 0.0716 
305 = 0.200 = 20.0% _ 

1526 = 0.0716 


Refer to Technology Comer 26 below for details on how to do 


calculations for a 2-sample t test on the TI-83/84. 


305 

CONCLUDE: Because the P-value of 0.0644>a = 0.05, 1526 
305 

0.0644 > a = 0.05, we fail to reject H01526— . The 


experiment does not provide convincing evidence that the true mean 
decrease in systolic blood pressure is higher for men like these who take 
calcium than for men like these who take a placebo. 
305 
e Assuming HO: p11 - p2 =01526 H2 = Vis true, 
there is a 0.0644 probability of getting a difference (Calcium — 
Placebo) in mean blood pressure reduction for the two groups of 


= 0.200 = 20.0% 


0.200 = 20.0% F7, 
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5.273 or greater just by the chance involved in the random 
assignment. 


t distribution 
with df =9 


i -3 2 -1 0 1 | 2 3 
t= 1.604 


FOR PRACTICE, TRY EXERCISE 57 


Notice that technology gives smaller, more accurate P-values for two- 
sample t tests than the conservative method. That’s because calculators 
and software use the more complicated formula on page 651 to obtain a 
larger number of degrees of freedom. 


Why didn’t researchers find a significant difference in the calcium and 
blood pressure experiment? The difference in mean systolic blood 
pressures for the two groups was 5.273 millimeters of mercury. This seems 
like a fairly large difference. With the small group sizes of 10 and 11, 
however, this difference wasn’t large enough to reject HO: pwC-pP=0 


305 
= 0.200 = 20.0%H, + nv oe ee ; 
1526 Hy : uc ~ Lp = 9 in favor of the one-sided alternative. 


We suspect that larger groups might show a similar difference in mean 
blood pressure reduction, which would indicate that calcium has a 


significant effect. If so, then the researchers in this experiment made a 
305 

Type II error—failing to reject a false H01526 ~ . In fact, 

later analysis of data from an experiment with more subjects resulted in a 

P-value of 0.008. Sample size strongly affects the power of a test. It is 

easier to detect a difference in the effectiveness of two treatments if both 


are applied to large numbers of subjects. 


0.200 = 20.0% FT, 


26. Technology Corner |PERFORMING A 
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SIGNIFICANCE TEST FOR A DIFFERENCE IN MEANS 
Na 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


You can use the two-sample t test option on the TI-83/84 to do the 
calculations for a significance test about the difference between two 
means. Let’s confirm the results of the two previous examples with this 
feature. 


1. A longer work week? (page 657) 
e Press [STAT|, then choose TESTS and 2-SampTTest. 


e Inthe 2-SampTTest screen, specify “Stats” and adjust your other 
settings as shown. For Pooled: choose “No.” We’II discuss pooling 
shortly. 


e Highlight “Calculate” and press [ENTER|. 


NORMAL FLOAT AUTO REAL RADIAN MP 


Inpt:Data Hix 
x1:41.91 t=4. 880781802 
Sx1:14.35 p=1.158065087€e -6 


ni:1501 
X2:38.97 
S$x2:13.13 

n2: 764 

1: <u2 du2 
LPooled:f Yes 


2. Calcium and blood pressure (page 659) 


e Enter the Group 1 (calcium) data in L1 and the Group 2 (placebo) 
data in L2. 


e Press [STAT] then choose TESTS and 2-SampTTest. 


e In the 2-SampTTest screen, specify “Data” and adjust your other 
settings as shown. For Pooled: choose “No.” We’II discuss pooling 
shortly. 


e Highlight “Calculate” and press [ENTER|. 
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NORMAL FLOAT AUTO REAL RADIAN MP fl NORMAL FLOAT AUTO REAL RADIAN MP fl 


2-SampT Test 2-SampTTest 
Inept: BER) Stats H1>u2 
Listi:Li t=1.603717288 
List2:L2 pP=0. 0644196844 


Freai:1 df=15.59051297 
Frea2:1 x1=5 

H1i:4u2 <u2 X2=-@. 2727272727 
Pooled: Yes Sx1=8. 743251366 
Color: ia Sx 2=5. 900693334 
Calculate Draw 


Note: If you select “Draw” instead of “Calculate,” the appropriate t 
distribution will be displayed, showing the standardized test statistic 
and the shaded area corresponding to the P-value. 


AP® EXAM TIP 


The formula for the two-sample t statistic for a test about p1—p2 


305 
——= 0.200 = 20.0%, — py. 
‘1 ~ #2 often leads to calculation errors by students. 


Also, the P-value from technology is smaller and more accurate than 

the one obtained using the conservative method. As a result, your 

teacher may recommend using the calculator’s 2-SampTTest feature to 

perform eons: Be sure to name the procedure (two-sample t test 
o 


<= 0.200 = 20.0% 11 — pu 


for p1—p21526 ) in the “Plan” step and to report the 


305 ceca 
standardized test statistic (t=1.60)1526 — Sm DAE = LG) , P-value 
(0.0644), and df (15.59) in the “Do” step. 


Think About It 


WHY DO THE INFERENCE METHODS FOR RANDOM 
SAMPLING WORK FOR RANDOMIZED EXPERIMENTS? 


305 


—— = 0.200 = 20.0%, — 
Confidence intervals and tests for p1—121526 iii all ae 


are 


305 a. 


based on the sampling distribution of x 1-x 21526 
. But in experiments, we aren’t sampling at random from any larger 
populations. We can think about what would happen if the random 
assignment were repeated many times under the assumption that HO: 
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305 
—— = 0.200 = 20.0% . a= F F 
H1—-p2=01526 Ho : 1 — #2 = 9 is true. That is, we assume 


that the specific treatment received doesn’t affect an individual 
subject’s response. 

Let’s see what would happen just by chance if we randomly 
reassign the 21 subjects in the calcium and blood pressure experiment to 
the two groups many times, assuming the drug received doesn’t affect 
each individual’s change in systolic blood pressure. We used software 
to redo the tango assignment 1000 times. Figure 10.6 shows the value 

305 
of x C-x Pisa *0.0%Zc — Zp in each of the 1000 simulation 
trials. This distribution (sometimes referred to as the randomization 


= 0.200 = 


5 

distribution of Cire eRs RISES ARP ) has an 
approximately Normal shape with mean 0 (no difference) and standard 
deviation 3.42. This matches fairly well with the distribution we used to 
perform calculations in the example. 

In the actual experiment, the difference in the mean change in blood 
pressure in the calcium and placebo groups was 5.000-(—0.273)=5.273 
say = 0.200 = 20.0%5,000 — (—0.273) = 5.273 po. iikeivis ithaca 


difference this large or larger would happen just by chance when HO 
305 
= 0.200 = 


1526 20.0% Hy is true? Figure 10.6 provides a rough answer: 64 of 
the 1000 random reassignments (indicated by the red dots) yielded a 
difference in means greater than or equal to 5.273. That is, our estimate 
of the P-value is 0.064. This is quite close to the 0.0644 P-value that we 
obtained in the Technology Corner. Neither of these values is the exact 
P-value. To get the exact P-value in an experiment, you would consider 
the difference in sample means for all possible random assignments. 
This approach is called a permutation test. 
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: A 5 64 of the 1000 
. eee ° . 
z=: ae: random reassignments 
pad e@eeceee ee eee 
3 
3 


$ 82 Ss S see 8 see resulted in a difference 
in sample means of 
5.273 or greater. 


© eee 
a ee ee eeece 
ee seececessccooses 
oO @ 0890060 0000000600006 


~10 -8 -6 a4 -2 0 2 4 6 8 10 
¥,.—X, = simulated difference in sample mean decrease in systolic blood pressure 


FIGURE 10.6 Dotplot of the values of x C-x P 


30° _ 9.200 = 20.0% 


from each of 1000 simulated random reassignments of subjects to treatment groups 
in the calcium and blood pressure experiment, assuming no treatment effect. 


CHECK YOUR UNDERSTANDING 


How quickly do synthetic fabrics such as polyester decay in 
landfills? A researcher buried polyester strips in the soil for different 
lengths of time, then dug up the strips and measured the force required 
to break them. Breaking strength is easy to measure and is a good 
indicator of decay. Lower strength means the fabric has decayed more. 


For one part of the study, the researcher buried 10 strips of polyester 
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fabric in well-drained soil in the summer. The strips were randomly 
assigned to two groups: 5 of them were buried for 2 weeks and the other 
5 were buried for 16 weeks. Here are the breaking strengths in 
pounds:24 


Group 1 (2 weeks) 118 126 126 120 129 
Group 2 (16 weeks) 124 98 110 140 110 


Do the data give convincing evidence that polyester decays more in 
16 weeks than in 2 weeks, on average? 


THE POOLED TWO-SAMPLE t PROCEDURES (DON’T USE 
THEM!) Most software offers a choice of two-sample t procedures. One is 
often labeled “unequal” variances; the other, “equal” variances. The 
unequal variance procedure uses our formula for the two-sample t interval 
and test, with df calculated as shown on page 651. This test is valid 
whether or not the population variances are equal. 


Recall that the variance is the square of the standard deviation. 


The other choice is a special version of the two-sample t procedures 
that assumes the two population distributions have the same variance. This 
procedure combines (the statistical term is pools) the two sample variances 
to estimate the common population variance. The resulting statistic is 
called the pooled two-sample t statistic. 


The pooled t statistic has exactly the t distribution with nl+n2-2 


305 y 
-a9G = 0-200 = 20.0%, + no 


2 degrees of freedom if the two population 
variances really are equal and the population distributions are exactly 
Normal. This method offers more degrees of freedom than Option 1 
(Technology), which leads to narrower confidence intervals and smaller P- 
values. The pooled t procedures were in common use before software 
made it easy to use Option 1 for our two-sample t procedures. 


Remember, we always use the pooled sample proportion p4=X1+X2n1+n2 


X1+Xo 


mi +n2 when performing a test of HO: p1-—p2=0 


305 : 
—— = 0.200 = 20.0%p = 
1526 
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305 oon — 99.0% 305, 
= 0.200 = YH : 1 — P2 = 0. That’s because p11526 


200 — or 
1526 0.200 20.0% 
0.200 = 20.0 %n5 


305 
and p21526 — are equal if the null hypothesis is true. We estimate 


305 
; ; . —— = 0.200 = 20.0% 
this common value using the overall proportion p/ 1526 P of successes 


in the two samples combined. 


In the real world, distributions are not exactly Normal, and population 
variances are not exactly equal. In practice, the Option 1 two-sample t 
procedures are almost always more accurate than the pooled procedures. 
Our advice: Never use the pooled t procedures if you have technology that 
will carry out Option 1. 


Remember, we always use the pooled sample proportion 


305 _ 9.200 = 20.0% = ~1+*2 when performing a test of 


pA=X1+X2n1+n2 1526 © ny +ny 
305 : 
= 0,200 = 20.0% 7. - _ gq. That’s because 
HO: p1—p2=01526 ee Ey Du pO 
305 305 


= 0.200 = 20.0%, and = 0.200 = 20.0%, are equal if the null 


p11526 p2 1526 
hypothesis is true. We estimate this common value using the overall 


. 5 . 2 
proportion _2”° — 9.209 = 20.0%, of successes in the two samples combined. 
p/1526 Pp 


Think About It 


HOW DOES POOLING WORK? Suppose we have two Normally 
distributed populations with the same variance ois = 
If we take independent random samples from the two populations, both 
sample variances 5125367 
estimate o2isz - 


0.200 = 20.0% 52 ; 


0.200 = 20.0% 2 = 


— = 0.200 = 20.0% 82 
°1 and s221526 °S5 


Saag! . The best way to combine (pool) these 
two estimates is to average them with weights equal to their degrees of 
freedom. This gives more weight to the information from the larger 
sample, which is reasonable. The resulting estimator of 02 


305 
0.200 = 20.0% 2 . 
1526 “o~ is 


305 ; ; w 
sp2=(n1-1)s12+(n2—-1)s22n1+n2—21526~ °700 = 70.0% 
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— (m1 -1) sf +(ng—1) 85 

- nj +ng —2 
305 

This is called the pooled estimator of 021526 


combines the information in both samples. 


= 0.200 = 20.0% : 
“a” because it 


305 
To perform inference about p1—p121526 "°° = 7°. °% 11 — M2 in this 


setting, we start by simplifying our formula for the standard deviation of 
the statistic: 


ox 1-x 2=012n1+022n2=02n1+02n2=02(1n1+1n2)=o1n1+1n2 
305 


1526 0.200 = 20.0% 
o o Pe. 2 
1 2 o o i AX. a, | 
O%,-z, = SP — |? ae 
ny } Thy ng \ it} 1 re =) ny * aa 
305 _ 0.200 = 20.0! 
We then substitute the pooled estimate sp1526 = 20.0%8p for g 
305 305 a 
ie ns The pooled two-sample t1526 — Smee ee interval 
is 
(Z| —T2) +t folic 
& ©} — 29 Sy ————— 
= = 305_ ‘ 20.0% . , In n 
(x 1-x 2)+t*sp1n1+1n21526 — lal a \ I : 


305 


The pooled two-sample t1526 ales 


20.0’ statistic for a test of 
305 
HO: p1-p2=01526 © 0.200 = 20.0% Ho: py — 2 = 0 jg 


64-6 >) opin sine a20=200% ny ng 
For either procedure, we use a t distribution with nl1+n2-2 
oe a iy +2 degrees of freedom. 


Section 10.2 Summary 


305 
; —— = 0,200 = 20.0%, 
e Choose independent SRSs of size n11526 “M1 from 
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305 
Population 1 with mean 1111526 © 
305 305 
——-= 0.200 = 20.0%, 
011526 71 and of size n21526 — 
305 
2 with mean 121526 — 
305 
= 0.200 = 20.0% 
5g = 0-200 = 20.0% 
305 
1526 


0.200 = 20.0% _ 
“#1 and standard deviation 


0.200 = 20.0% ; 
*n2 from Population 


0.200 = 20.0% . 
#2 and standard deviation o2 


. The sampling distribution of x 1-x 2 


= 0.200 = 20.0%%, _ | 
*t1 — £2 has the following properties: 


: Shape: Normal if both population distributions are Normal; 


approximately Normal if both sample sizes are large (n1>30 and 
305 


n2>30)1526~ 7") = 2°- 0%(n; > 30and nz > 30) or if one population is 


Normally distributed and the other sample size is large. 

305 

» Center: Its mean is px 1-x 2=p1-p121526 
Mz, -Zo = M1 be : 


= 0.200 = 20.0% 


» Variability: The standard deviation is ox 1-x 2=012n1+022n2 

305 5 re 

——s y = % OF Zo 7 / 9 eng A 

i ll Vm 7 as long as the 10% condition is 
305 said 

met when sampling without replacement: n1<0.10N11526 aa 


cl 
ny <0.10M and n2<0.10N21526 © 200 = 20-9%*n2 < 0.10N2 © 


Confidence intervals and tests for the difference between the means of 
two populations or the mean responses to two treatments p11 


305 305 

= 0.200 = 20.09 = 0.200 = 20.0% 

1526 “41 and [12 1526 “#2 are based on the 
305 


difference x 1-x 21526 


- 0.200 = 20.0%, _ ZF. 
“1 ~ ©2 between the sample means. 


305 
Before estimating or testing a claim about p1—p121526 © 
#1 ~ #2. check that these conditions are met: 


0.200 = 20.0% 


: Random: The data come from two independent random samples or 
from two groups in a randomized experiment. 


© 10%: When sampling without replacement, n1<0.10N1 


305 305 
——= 0.200 = 2 ie eeninins 
1526 20.0%, <0.10M1 and n2<0.10N2i526~ "70? = 79° 


no < 0.10N> . 


: Normal/Large Sample: For each sample, the corresponding 
population distribution (or the true distribution of response to the 


1450 


treatment) is Normal or the sample size is large (n>30) 


305 


Teoq = 0-200 = 20.0%(n > 30) 


1526 . For each sample, if the population 


305 . 
Seca distribution has unknown shape and n<30is26 07"? ~ °°.” 


< 30 , confirm that a graph of the sample data shows no strong 
skewness or outliers. 


e When conditions are met, a C% confidence interval for p1-p2 
= 0.200 = 20.0% 41 — [19 jo 


305 
(x 1-x 2)+t*s12n1+s22n21526 — 
305 


ees a ee 
where t*1526 = 0.200 = 20.0 


305 
0.200 = 20.0% ,« 
“t® is the critical value with C, 1526 


on 305 
: = 0.200 = 20.0% _ 4* 
C, of its area between —t*1526 t” and t*1526— 


for the t distribution with nee of freedom from either Option 1 


0.200 = 20.0 “ops 


305 
senualoey) or Option 2 (the smaller of n1—11526 i 


= 0.200 = 20.0%, — 1 ). This is called a two-sample t 


= 0.200 = 20.0% 4, — pio 


and n2—11526 


2H 


05 
interval for 1—p121526 


305 ; ‘ o 
e To test HO: p1-p2=01526 ~ 0.200 = 20.0%H9 : 41 — H2 = 9 use a two- 


I acorns 
sample t test for 1—-p2i526 "°°? ~ *0-0"1 — #2, The standardized test 


Statistic is 


= 0.200 = 20.0% 


ie 
t=(x 1-x 2)-0s12n1+s22n21526 


P-values are calculated using the t distribution with degrees of freedom 
from either Option 1 (technology) or Option 2 (the smaller of n1—-1 


305 305 


= 0.200 = 20.0%, 0.200 = 20.0% 
ny no 1), 


1526 1 and n2—11526— 


e Be sure to follow the four-step process whenever you construct a 
confidence interval or perform a significance test for comparing two 
means. 
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10.2 Technology Corners (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


25. Constructing a confidence interval for a difference in 
means Page 654 


26. Performing a significance test for a difference in means 


Section 10.2 Exercises 


37. pg 648 |») Cholesterol The level of cholesterol in the blood for all 


men aged 20 to 34 follows a Normal distribution with mean hM=188 
305 _ 0,200 = 20.0%), 4, = 188 
1526 iM = 


milligrams per deciliter (mg/dl) and 
305 » onwer 
standard deviation oM=41i526 70° = *°°”’oy = 41 mg/dl. For 14- 


year-old boys, blood cholesterol levels follow a Normal distribution 


305 
with mean pB= oo Re pe LTO mg/dl and standard 


deviation oB=30%536 ai! aa mg/dl. Suppose we select 


independent SRSs of 25 men aged 20 to 34 and 36 poye aged 14 and 
= 0.200 = 20.0%% yy 


305 
calculate ie sample mean cholesterol levels x M1526 536 
305 


= 0.200 = 20.0% 
and x Bi526 


°F B 


a. What is the shape of the sampling distribution of x M-x B 
305 . ae 7 
{506 = 0-200 = 20.0%F yy — Ep > Why? 

b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling 
distribution. 
38. How tall? The heights of young men follow a Normal distribution 


305 Po 
with mean tM=69.3 1526 = 0.200 = 20.9 = 69.3 inches and standard 
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39. 


40. 


41. 


305 - 
deviation OM=2.81526 — "ay = 2.8 inches. The heights of 


young women follow a Normal distribution with mean p»W=64.5 
305 - ; 
= 0.200 = 20.0% yy = 64.5 


0.200 = 2 


inches and standard deviation oW=2.5 


305 sien 
1526 0 = 20-9’ = 2.5 inches. Suppose we select independent 
SRSs of 16 young men and 9 young women and calculate the sample 


305 305 


—~ p= = 0.200 = 20.0% % ype = 0.200 = 20.0% % 
mean heights x M1526 EM and x Wi526 ry 


a. What is the shape of the sampling distribution of x M-x W 


305 
9 » or — _ 
1596 = 0200 = 200% ay — Ew > Why? 


b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling 
distribution. 


Cholesterol Refer to Exercise 37. 


a. Find the probability of getting a difference in sample means x M 


305 
= 0.200 = 


-x Bi526 *0.0°E m — ©B that’s less than 0 mg/dl. 


b. Should we be surprised if the sample mean cholesterol level for the 


14-year-old boys exceeds the sample mean cholesterol level for the 
men? Explain your answer. 


How tall? Refer to Exercise 38. 


a. Find the probability of getting a difference in sample means x M 


305 
= 0.200 = 2 


Types 0.0%7 ., — Fu, : 
—x W1526 “M ~ “W that’s greater than 2 inches. 


b. Should we be surprised if the sample mean height for the young 


men is at least 2 inches greater than the sample mean height for the 
young women? Explain your answer. 


pg 649 |») Shoes How many pairs of shoes do teenagers have? To 


find out, a group of AP® Statistics students conducted a survey. They 
selected a random sample of 20 female students and a separate 
random sample of 20 male students from their school. Then they 
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42. 


43. 


recorded the number of pairs of shoes that each student reported 
having. Here are their data: 


Males 14 7 6 59 12 38 8 7 10 10 

Females 50 260 260 381 57 19 240 22 2338 

Males 10 = 11 4 5 22 ii 59 10 35 7 

Females 13 500 13 340 2B 830 9 13S 15S 51 
= = 0.200 = 2 


0.0% . 
#1 = the true mean number of pairs of shoes 


305 


that male students at the school have and 1121526 — pan care lig 
true mean number of pairs of shoes that female students at the school 
have. Check if the conditions for calculating a confidence interval for 


Let 1111526 


a 


305 
1-2 1526 


= 0.200 = 20.0% 41 — 2 are met. 
Who texts more? For their final project, a group of AP® Statistics 
students wanted to compare the texting habits of males and females. 
They asked a random sample of students from their school to record 
the number of text messages sent and received over a 2-day period. 
Here are their data: 


Males 127 44 28 83 0 6 78 6 

Females 112 203 102 54 379 305 179 24 

Males 5 213 73 20 214 28 11 

Females 127 65 41 27 298 6 130 0 
305 


0.200 = 20.0% 
#1 = the true mean number of texts sent by 


305 ee 
male stu- dents at the school and 1121526 ~ 0.200 = 20.942 = the true 


mean number of texts sent by female students at the school. Check if 
the conditions for calculating a confidence interval for 11-2 

305 
1526 


Let 11526 — 


= 0.200 = 20.0% );, — yp. 
#1 ~ #2 are met. 


Household size How do the numbers of people living in households 
in the United Kingdom (U.K.) and South Africa compare? To help 
answer this question, we chose independent random samples of 50 
students from each country. Here is a dotplot of the household sizes 
reported by the students in the survey: 
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South Africa 


Place 


U.K. 
SHCHHSSSHSSHSSSSSSSSSEEe | Geoseseooooseoos 


MN —| @eeeeeeeeeeeeoe 


eee 
eeese 
eoesee 


o-7 


Household size 


305 


Let pUKis26~ "7" = 


20.0%uuK = the true mean number of people 


305 
living in U.K. households and pSA1526 © 0.200 = 20.0%4454 = the true 


mean number of people living in South African households. Check if 


the conditions for calculating a confidence interval for tUK-pSA 
305 
1526 200 = 20.0% uu — MSA are met. 


Long words Mary was interested in comparing the mean word length 
in articles from a medical journal and an airline’s in-flight magazine. 
She counted the number of letters in the first 400 words of an article 
in the medical journal and in the first 100 words of an article in the 
airline magazine. Mary then used statistical software to produce the 
histograms shown. 


Medical journal Airline magazine 


Percent 


0246 8101214 2 4 6 8 101214 
Word length 


305 


Let pMJis26~ "700 =? 


°0%uMJ = the true mean length of all words in 


305 
the medical journal and pAM1526 © ~ 0.200 = 20.0tam = the true mean 


length of all words in the airline —* Check if the conditions 


305 . 
for calculating a confidence interval for pMJ-pAMis26— "7? = °°.°% 
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45. 


46. 


!éMJ — FAM are met. 


pg 652 |») Is red wine better than white wine? Observational 


studies suggest that moderate use of alcohol by adults reduces heart 
attacks and that red wine may have special benefits. One reason may 
be that red wine contains polyphenols, substances that do good things 
to cholesterol in the blood and so may reduce the risk of heart attacks. 
In an experiment, healthy men were assigned at random to drink half 
a bottle of either red or white wine each day for two weeks. The level 
of polyphenols in their blood was measured before and after the 2- 
week period. Here are the percent changes in polyphenols for the 
subjects in each group:22 


Red wine 3.5 8.1 7.4 4.0 0.7 49 84 7.0. 5.5 
White wine 3.1 05 -38 41 06 2.7 19 -5.9 0.1 


a. A dotplot of the data is shown, along with summary statistics. 
Write a few sentences comparing the distributions. 


Red wine = coos 3 


‘ A a J e* ee¢ ¢ @@¢ o 
White wine — T T T T 


-10 5 0 5 10 
Percent change in polyphenols 
Group name n Mean SD Min Qi Med Q3 Max 


Red wine 9 5.500 2.517 0.7 3.75 5.5 7.75 84 
White wine 9 0.233 3.292 -5.9 =<2.20 05 290 4.1 


b. Construct and interpret a 90% confidence interval for the 


difference in true mean percent change in polyphenol levels for 
healthy men like the ones in this study when drinking red wine 
versus white wine. 


Tropical flowers Different varieties of the tropical flower Heliconia 
are fertilized by different species of hummingbirds. Researchers 
believe that over time, the lengths of the flowers and the forms of the 
hummingbirds’ beaks have evolved to match each other. Here are 
data on the lengths in millimeters for random samples of two color 


varieties of the same species of flower on the island of Dominica:22 
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H. caribaea red 
41.90 42.01 41.93 43.09 41.17 41.69 39.78 40.57 
39.63 42.18 40.66 37.87 39.16 37.40 38.20 38.07 
38.10 37.97 38.79 38.23 38.87 37.78 38.01 


H. caribaea yellow 
36.78 37.02 36.52 36.11 36.03 35.45 38.13 37.10 
35.17 36.82 36.66 35.68 36.03 34.57 34.63 


a. A dotplot of the data is shown, along with summary statistics. 
Write a few sentences comparing the distributions. 


e +4 eee $ eee ese © 
Red = : 
e. 
J ee 
ee ¢ 6e@ @ ece8 oO 


Yellow 4 T T T ] T T T T T 
34 35 36037 38 39 40 41 42 43 44 
Length (mm) 


Group n Mean SD Min Qi Med Q3 Max 
Red 23 39.698 1.786 37.4 38.07 39.16 41.69 43.09 


Yellow 15 3618 0.975 34.57 35.45 36.11 3682 38.13 


b. Construct and interpret a 95% confidence interval for the 
difference in the true mean lengths of these two varieties of 
flowers. 


Paying for college College financial aid offices expect students to 
use summer earnings to help pay for college. But how large are these 
earnings? One large university studied this question by asking a 
random sample of 1296 students who had summer jobs how much 
they earned. The financial aid office separated the responses into two 
groups based on gender, so these can be viewed as independent 
samples. Here are the data in summary form:24 


Group n _ 305 _ 9.200 = 20.0% x Sx 
XxX 1526 4 


Males 675 $1884.52 $1368.37 
Females 621 $1360.39 $1037.46 


a. How can you tell from the summary statistics that the distribution 
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48. 


49. 


of earnings in each group is strongly skewed to the right? The use 
of two-sample t procedures is still justified. Why? 


b. Construct and interpret a 90% confidence interval for the 


difference between the true mean summer earnings of male and 
female students at this university. 


c. Interpret the 90% confidence level in the context of this study. 


Beta blockers In a study of heart surgery, one issue was the effect of 
drugs called beta blockers on the pulse rate of patients during surgery. 
The available subjects were randomly assigned into two groups. One 
group received a beta blocker; the other group received a placebo. 
The pulse rate of each patient at a critical point during the operation 
was recorded. Here are the data in summary form: 


Grou n 305 9 2 0% Ss 
p x 16g6 0.200 = 20.0 oF x 


Beta blocker 30 65.2 7.8 
Placebo 30 70.3 8.3 


a. The distribution of pulse rate in each group is not Normal. The use 


of two-sample t procedures is still justified. Why? 


b. Construct and interpret a 99% confidence interval for the 


difference in mean pulse rates for patients like these who receive a 
beta blocker or a placebo. 


c. Interpret the 99% confidence level in the context of this study. 


Reaction times Catherine and Ana wanted to know if student athletes 
(students on at least one varsity team) have faster reaction times than 
non-athletes. They took separate random samples of 33 athletes and 
30 non-athletes from their school and tested their reaction time using 
an online reaction test, which measured the time (in seconds) between 
when a green light went on and the subject pressed a key on the 
computer keyboard. A 95% confidence interval for the difference 
(Non-athlete — Athlete) in the mean reaction time was 0.018+0.034 


305 


i526 2200 = 20.0% 018 + 0.034 seconds. 
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a. Does the interval provide convincing evidence of a difference in 
the true mean reaction time of athletes and non-athletes? Explain 
your answer. 


b. Does the interval provide convincing evidence that the true mean 


reaction time of athletes and non-athletes is the same? Explain 
your answer. 


50. Bird eggs A researcher wants to see if birds that build larger nests lay 


ol. 


larger eggs. She selects two random samples of nests: one of small 
nests and the other of large nests. Then she weighs one egg (chosen at 
random if there is more than one egg) from each nest. A 95% 
confidence interval for the difference (Large — Small) between the 
mean mass (in grams) of eggs in small and large nests is 1.6+2.0 

529 = 0-200 = 20.0% § 49.9 


a. Does the interval provide convincing evidence of a difference in 


the true mean egg mass of birds with small nests and birds with 
large nests? Explain your answer. 


b. Does the interval provide convincing evidence that the true mean 


egg mass of birds with small nests and birds with large nests is the 
same? Explain your answer. 


pg 656 |») Sorting the music Student researchers Adam, Edward, 


and Kian wondered if music would affect performance for certain 
tasks. To find out, they had student volunteers sort a shuffled set of 
26 playing cards by face value and by color. Nineteen of the 38 
volunteers were randomly assigned to listen to music during the 
sorting, while the others listened to no music. Here are parallel 
boxplots of the time in seconds that it took to sort the cards for the 
students in each group: 
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a2. 


53. 


2 

~ a7 7 

— 

cs) 

E 

Bo 

22 

a 
° 
vA 


0 20 40 60 80 100 120 140 160 
Time to sort (sec) 


Do these data give convincing evidence of a difference in the true 


ad 


305 
mean sorting times at the a=0.101526 
significance level? 


= (0.2 — 9 % 
0.200 = 20.0 ~ = 0.10 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 


Ice cream For a Statistics class project, Jonathan and Crystal held an 
ice-cream-eating contest. They randomly selected 29 males and 35 
females from their large high school to participate. Each student was 
given a small cup of ice cream and instructed to eat it as fast as 
possible. Jonathan and Crystal then recorded each contestant’s gender 
and time (in seconds), as shown in the dotplots. 


: . se 
Male a ee ee 8s #e¢ eee s 


Gender 


15 3 45 60 75 90 105 


Time to eat ice cream (sec) 


o 


Do these data give convincing evidence of a difference in the 


305 
=.= 0,200 = 20.0% ce 
population means at the a=0.101526 a = 0.10 significance 


level? 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 


Happy customers As the Hispanic population in the United States 
has grown, businesses have tried to understand what Hispanics like. 
One study interviewed separate random samples of Hispanic and 
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Anglo customers leaving a bank. Customers were classified as 
Hispanic if they preferred to be interviewed in Spanish or as Anglo if 
they preferred English. Each customer rated the importance of several 
aspects of bank service on a 10-point scale.22 Here are summary 
results for the importance of “reliability” (the accuracy of account 
records, etc.): 


Grou n 305 ‘ anne s 

p x isae 0.200 = 20.0 om x 
Anglo 92 6.37 0.60 
Hispanic 86 5.91 0.93 


Researchers want to know if there is a difference in the mean 
reliability ratings of all Anglo and Hispanic bank customers. 


a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check that the conditions for performing the test are met. 


Does music help or hinder memory? Many students at Matt’s 
school claim they can think more clearly while listening to their 
favorite kind of music. Matt believes that music interferes with 
thinking clearly. To find out which is true, Matt recruits 84 volunteers 
and randomly assigns them to two groups. The “Music” group listens 
to their favorite music while playing a “match the animals” memory 
game. The “No Music” group plays the same game in silence. Here 
are some descriptive statistics for the number of turns it took the 
subjects in each group to complete the game (fewer turns indicate a 
better performance): 


Group Sample size Mean 
Music 305 _ 0.200 = 20.0%. — 49 — 305 _ 9.200 = 20.0% 
n1=421526 “ean, = 42 x-1=15.8331526 canis 
305 305 
—— = 0.200 = 20.0%, — z = —— = 0.200 = 20.0% 
n2=47 1526 no — 42 x 2=13.7141526 E2 = 13.714 


music 


Matt wants to know if listening to music affects the average number 
of turns required to finish the memory game for students like these. 
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a. State appropriate hypotheses for performing a significance test. Be 
sure to define the parameters of interest. 


b. Check if the conditions for performing the test are met. 
99. Dg 657 |») Happy customers Refer to Exercise 53. 


a. Explain why the sample results give some evidence for the 
alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 


c. What conclusion would you make? 


56. Does music help or hinder memory? Refer to Exercise 54. 


a. Explain why the sample results give some evidence for the 
alternative hypothesis. 


b. Calculate the standardized test statistic and P-value. 
c. What conclusion would you make? 


57. pg 659 |) Fish oil To see if fish oil can help reduce blood 


pressure, males with high blood pressure were recruited and randomly 
assigned to different treatments. Seven of the men were assigned to a 
4-week diet that included fish oil. Seven other men were assigned to a 
4-week diet that included a mixture of oils that approximated the 
types of fat in a typical diet. Each man’s blood pressure was 
measured at the beginning of the study. At the end of the 4 weeks, 
each volunteer’s blood pressure was measured again and the 
reduction in diastolic blood pressure was recorded. These differences 
are shown in the table. Note that a negative value means that the 
subject’s blood pressure increased.2© 


Fish oil 8 12 10 14 2 0 
Regular oil -6 0 1 2 -3 -4 


a. Do these data provide convincing evidence that fish oil helps 
reduce blood pressure more, on average, than regular oil? 
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b. Interpret the P-value from part (a) in the context of this study. 


58. Baby birds Do birds learn to time their breeding? Blue titmice eat 
caterpillars. The birds would like lots of caterpillars around when 
they have young to feed, but they must breed much earlier. Do the 
birds learn from one year’s experience when to time their breeding 
next year? Researchers randomly assigned 7 pairs of birds to have the 
natural caterpillar supply supplemented while feeding their young and 
another 6 pairs to serve as a control group relying on natural food 
supply. The next year, they measured how many days after the 
caterpillar peak the birds produced their nestlings.22 The investigators 
expected the control group to adjust their breeding date the following 
year, whereas the well-fed supplemented group had no reason to 
change. Here are the data (days after caterpillar peak): 


Control 4.6 2. “Vel 6.0 46 -1.2 
Supplemented 15.5 113 54 165 11.3 11.4 7.7 


a. Do the data provide convincing evidence that birds like these that 
have to rely on the natural food supply produce their nestlings 
closer to the caterpillar peak, on average, than birds like these that 
have the caterpillar supply supplemented? 


b. Interpret the P-value from part (a) in the context of this study. 


59. Who talks more—men or women? Researchers equipped random 
samples of 56 male and 56 female students from a large university 
with a small device that secretly records sound for a random 30 
seconds during each 12.5-minute period over 2 days. Then they 
counted the number of words spoken by each subject during each 
recording period and, from this, estimated how many words per day 
each subject speaks. The female estimates had a mean of 16,177 
words per day with a standard deviation of 7520 words per day. For 
the male estimates, the mean was 16,569 and the standard deviation 
Was 9108. Do these data provide convincing evidence at the a=0.05 
Tg a = 005 significance level of a difference in the 
average number of words spoken in a day by all male and all female 


students at this university? 
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62. 


63. 


Gray squirrel In many parts of the northern United States, two color 
variants of the Eastern Gray Squirrel— gray and black—are found in 
the same habitats. A scientist studying squirrels in a large forest 
wonders if there is a difference in the sizes of the two color variants. 
He collects random samples of 40 squirrels of each color from a large 
forest and weighs them. The 40 black squirrels have a mean weight of 
20.3 ounces and a standard deviation of 2.1 ounces. The 40 gray 
squirrels have a mean weight of 19.2 ounces and a standard deviation 
of 1.9 ee Do these data provide convincing evidence at the 
a=0.011526 x = 0.01 significance level of a difference in 
the mean weights of all gray and black Eastern Gray Squirrels in this 
forest? 


= 0.200 = 20.0% 
Cc 


Who talks more—men or women? Refer to Exercise 59. 


a. Construct and interpret a 95% confidence interval for the 
difference between the true means. If you already defined 
parameters and checked conditions in Exercise 59, you don’t need 
to do them again here. 


b. Explain how the confidence interval provides more information 


than the test in Exercise 59. 


Gray squirrel Refer to Exercise 60. 


a. Construct and interpret a 99% confidence interval for the 
difference between the true means. If you already defined 
parameters and checked conditions in Exercise 60, you don’t need 
to do them again here. 


b. Explain how the confidence interval provides more information 


than the test in Exercise 60. 


Teaching reading An educator believes that new reading activities in 
the classroom will help elementary school pupils improve their 
reading ability. She recruits 44 third-grade students and randomly 
assigns them into two groups. One group of 21 students does these 
new activities for an 8-week period. A control group of 23 third- 
graders follows the same curriculum without the activities. At the end 
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of the 8 weeks, all students are given the Degree of Reading Power 
(DRP) test, which measures the aspects of reading ability that the 
treatment is designed to improve. Here are parallel boxplots of the 
data:=2 


Activities 


Group 


Control 


DRP score 


a. Write a few sentences comparing the DRP scores for the two 
groups. 


After checking that the conditions for inference are met, the 


305 
—— = 0.200 = 20.0% 
educator performs a test of HO: pA-pC=01526 
305 
ee —— = 0.200 = 20.0% 
Ao: #4 ~ to = 9 versus Ha: wA-pC>01526 


: 305 ihe 
Hq: wa ~ Uc > 9 where 1A 1526 “ “FA = the true mean 


DRP score of third-graders like these who do the new reading 

305 iis 
activities and C1526 “HC = the true mean DRP score of 
third-graders like these who follow the same curriculum without 


the activities. Computer output from the test is shown. 


0.200 = 2 


= 0.200 = 20.07 


Two-sample T for DRP score 


Group N Mean StDev SE Mean 
Activities 21 51.5 11.0 2.4 
Control 23 41.5 17.1 3.6 


T-Value = 2.31 P-Value= 0.013 DF= 37 


b. What conclusion should the educator make at the a=0.05 
1526 700 = 20-.°q — 0.05 significance level? 


c. Can we conclude that the new reading activities caused an increase 
in the mean DRP score? Explain your answer. 


d. Based on your conclusion in part (b), which type of error—a Type 
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I error or a Type II error—could you have made? Explain your 
answer. 


64. Does breast-feeding weaken bones? Breast-feeding mothers secrete 
calcium into their milk. Some of the calcium may come from their 
bones, so mothers may lose bone mineral. Researchers compared a 
random sample of 47 breast-feeding women with a random sample of 
22 women of similar age who were neither pregnant nor lactating. 
They measured the percent change in the bone mineral content 
(BMC) of the women’s spines over 3 months. Here are comparative 
boxplots of the data:22 


Breast- 
feeding | i 
Not | | 
pregnant 


-10 8 4 + 2 0 2 4 
Percent change in BMC 


a. Write a few sentences comparing the percent changes in BMC for 
the two groups. 


After checking that the conditions for inference are met, the 
305 

researchers perform a test of HO: pBF-pNP=01526 
re 


= 0.200 = 20.0% 


305 
a ‘anit ——= 0.200 = 20.0% 4 pp 
Hq: HBr — HNP <9 | where pBFi526 HBF = the true 


mean percent change in BMC for breast-feeding women and pNP 


305 
0.200 = 20.0% 4 rp 
1526 HNP = the true mean percent change in BMC for 


women who are not pregnant or lactating. Computer output from 
the test is shown. 


Two-sample T for BMC change 


Group N Mean StDev SE Mean 
Breastfeed 47 —3.59 2.51 0.37 
Notpregnant 22 0.31 1.30 0.28 


T-Value = -8.50 P-Value = 0.000 DF = 66 
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b. What conclusion should the researchers make at the a=0.05 
305 ; pane 
1526 700 = 20-.°q — 0.05 significance level? 
c. Can we conclude that breast-feeding causes a mother’s bones to 


weaken? Why or why not? 


d. Based on your conclusion in part (b), which type of error—a Type 
I error or a Type II error—could you have made? Explain your 
answer. 


65. A better drug? In a pilot study, a company’s new cholesterol- 


66. 


reducing drug outperforms the currently available drug. If the data 
provide convincing evidence that the mean cholesterol reduction with 
the new drug is more than 10 milligrams per deciliter of blood 
(mg/dl) greater than with the current drug, the company will begin the 
expensive process of mass-producing the new drug. For the 14 
subjects who were assigned at random to the current drug, the mean 
cholesterol reduction was 54.1 mg/dl with a standard deviation of 
11.93 mg/dl. For the 15 subjects who were randomly assigned to the 
new drug, the mean cholesterol reduction was 68.7 mg/dl with a 
standard deviation of 13.3 mg/dl. Graphs of the data reveal no outliers 
or strong skewness. 


a. Carry out an appropriate significance test. What conclusion would 
you draw? (Note that the null hypothesis is not HO: p1-yp2=0 
305 7 
1526 0.200 — 20.0% Hy = [4] [lo = 0 .) 


b. Based on your conclusion in part (a), could you have made a Type 


I error or a Type IJ error? Justify your answer. 


Down the toilet A company that makes hotel toilets claims that its 
new pressure-assisted toilet reduces the average amount of water used 
by more than 0.5 gallon per flush when compared to its current 
model. To test this claim, the company randomly selects 30 toilets of 
each type and measures the amount of water that is used when each 
toilet is flushed once. For the current-model toilets, the mean amount 
of water used is 1.64 gal with a standard deviation of 0.29 gal. For the 
new toilets, the mean amount of water used is 1.09 gal witha 
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standard deviation of 0.18 gal. 


a. 


Carry out an appropriate significance test. What conclusion would 
you draw? (Note that the null hypothesis is not HO: p1-—y»2=0 
305 e 


Based on your conclusion in part (a), could you have made a Type 
I error or a Type IJ error? Justify your answer. 


Rewards and creativity Do external rewards—things like money, 
praise, fame, and grades—promote creativity? Researcher Teresa 
Amabile suspected that the answer is no, and that internal motivation 
enhances creativity. To find out, she recruited 47 experienced creative 
writers who were college students and divided them at random into 
two groups. The students in one group were given a list of statements 
about extrinsic reasons (E) for writing, such as public recognition, 
making money, or pleasing their parents. Students in the other group 
were given a list of statements about intrinsic reasons (I) for writing, 
such as expressing yourself and enjoying playing with words. Both 
groups were then instructed to write a poem about laughter. Each 
student’s poem was rated separately by 12 different poets using a 


creativity scale.22 These ratings were averaged to obtain an overall 
creativity score for each poem. The table shows summary statistics 
for the two groups. 


Group name n Mean SD 
Intrinsic 24 19.883 4.44 


Extrinsic 23 15.739 9.293 


We used software to randomly reassign the 47 subjects to the two 
groups 100 times, assuming the treatment received doesn’t affect each 
individual’s creativity rating. A dotplot of the simulated difference 


(Intrinsic—Extrinsic)1526 — 


305 


0.200 = 20.0%(Intrinsic — Extrinsic) j, mean 


creativity rating is shown. 
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5 4 343 2-1 0 1 @ 3 @ § 
Simulated difference (Intrinsic — Extrinsic) in 
mean creativity rating 
a. Why did researchers randomly assign the subjects to the two 
treatment groups? 


b. Estimate and interpret the P-value. 
c. What conclusion would you make? 


d. Based on your conclusion in part (c), could you have made a Type 
I error or a Type II error? Justify your answer. 


Sleep deprivation Does sleep deprivation linger for more than a day? 
Researchers designed a study using 21 volunteer subjects between the 
ages of 18 and 25. All 21 participants took a computer-based visual 
discrimination test at the start of the study. Then the subjects were 
randomly assigned into two groups. The 11 subjects in one group 
were deprived of sleep for an entire night in a laboratory setting. The 
10 subjects in the other group were allowed unrestricted sleep for the 
night. Both groups were allowed as much sleep as they wanted for the 
next two nights. On Day 4, all the subjects took the same visual 
discrimination test on the computer. Researchers recorded the 
improvement in time (measured in milliseconds) from Day 1 to Day 4 
on the test for each subject.2! The table shows summary statistics for 
the two groups. 


Group Mean SD 
Unrestricted sleep 19.62 12.17 
Sleep-deprived 3.90 14.73 


We used software to randomly reassign the 21 subjects to the two 
groups 100 times, assuming the treatment received doesn’t affect each 
individual’s time improvement on the test. A dotplot of the simulated 
difference (Unrestricted — Sleep-deprived) in mean time improvement 
is shown. 
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Simulated difference (Unrestricted — Sleep-deprived) 
in mean time improvement 
a. Explain why the researchers didn’t let the subjects choose whether 
to be in the sleep-deprivation group or the unrestricted sleep group. 


b. Etimate and interpret the P-value. 
c. What conclusion would you make? 


d. Based on your conclusion in part (c), could you have made a Type 
I error or a Type II error? Justify your answer. 


Multiple Choice Select the best answer for Exercises 69-72. 


69. A random sample of 30 words from Jane Austen’s Pride and 
Prejudice had a mean length of 4.08 letters with a standard deviation 
of 2.40. A random sample of 30 words from Henry James’s What 
Maisie Knew had a mean length of 3.85 letters with a standard 
deviation of 2.26. Which of the following is a correct expression for 
the 95% confidence interval for the difference in mean word length 
for these two novels? 


a. (4. 08-3. 85)+2. 576(2. 4030+2. 2630)i520 526 0.200 = 20.0% 


(4.08 — 3.85) + 2.576 (242 + 228) 
V30 | «30 


305 i ; “ 
b. (4.08-3.85)+2.045(2.4030+2.2630)i526 °° = *0.°" 
2.40 2.26 
(4.08 — 3.85) + 2.045 (34 O 4 238) 
= 0.200 = 20.0% 


c. (4.08-3.85)+2.045(2.40229+2. 26229)i5%0 
(4.08 — 3.85) + 2.045 (V ai" =) 


0.200 = 20.0% 


d. (4.08-3.85)+2.045(2.40230+2. 26230)i520~ 
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30 30 


(4.08 — 3.85) + 2.045 (= 7. | 
e. (4.08-3.85)+2.576(2.40230+2. 26230)i820 = 0.200 = 20.0% 
(4.08 — 3.85) + 2.576 (V 2.407 2.26? ) 


30 30 


Exercises 70—72 refer to the following setting. A study of road rage asked 
random samples of 596 men and 523 women about their behavior while 
driving. Based on their answers, each person was assigned a road rage 
score on a scale of 0 to 20. The participants were chosen by random digit 
dialing of phone numbers. The oo performed a test of the 


following hypotheses: H0:M= uF igs 0.200 = 20.0% Hy : tM = HF versus 

Ha: uM Fisa0 an AH, = tie > Be 

70. Which of the following describes a Type II error in the context of this 
study? 


a. Finding convincing evidence that the true means are different for 
males and females, when in reality the true means are the same 


b. Finding convincing evidence that the true means are different for 
males and females, when in reality the true means are different 


c. Not finding convincing evidence that the true means are different 
for males and females, when in reality the true means are the same 


d. Not finding convincing evidence that the true means are different 
for males and females, when in reality the true means are different 


e. Not finding convincing evidence that the true means are different 
for males and females, when in reality there is convincing evidence 
that the true means are different 


71. The P-value for the stated hypotheses is 0.002. Interpret this 
value in the context of this study. 


a. Assuming that the true mean road rage score is the same for males 
and females, there is a 0.002 probability of getting a difference in 
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Sample means equal to the one observed in this study. 


b. Assuming that the true mean road rage score is the same for males 
and females, there is a 0.002 probability of getting a difference in 
sample means at least as large in either direction as the one 
observed in this study. 


c. Assuming that the true mean road rage score is different for males 
and females, there is a 0.002 probability of getting a difference in 
sample means at least as large in either direction as the one 
observed in this study. 


d. Assuming that the true mean road rage score is the same for males 
and females, there is a 0.002 probability that the null hypothesis is 
true. 


e. Assuming that the true mean road rage score is the same for males 
and females, there is a 0.002 probability that the alternative 
hypothesis is true. 


72. Based on the P-value in Exercise 71, which of the following must be 
true? 
305 e 
a. A 90% confidence interval for HM-pF1526 — Sas I ae 
will contain 0. 


305 
b. A 95% confidence interval for j~M-pF 1526 
will contain 0. 


‘ ‘ or 
= 0.200 = 20.0 “1 in4 — pir 


305 
c. A 99% confidence interval for phM- AF I536 Soames AN aa le 
will contain 0. 
305 
d. A 99.9% confidence interval for hM-pF1526 
will contain 0. 


= 0.200 = 20.0% py) — pup 
e. It is impossible to determine whether any of these statements is 


true based only on the P-value. 


Recycle and Review 
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74. 


Quality control (2.2, 5.3, 6.3, 7.3) Many manufacturing companies 
use Statistical techniques to ensure that the products they make meet 
standards. One common way to do this is to take a random sample of 
products at regular intervals throughout the production shift. 


Assuming that the process is working properly, the mean 
305 


— = 0.200 = 20.0%— . 
measurement x 1526 © from a random sample varies 
— 305 _ 0.200 = 20.0% 
according to a Normal distribution with mean px 1526 ~~~ “Hz 


305 
and standard deviation ox 1526 . For each question 
that follows, assume that the process is working properly. 


= 0.200 = 20.0 %q— 


a. What’s the probability that at least one of the next two sample 


305 
means will fall more than 20x 1526 — 
305 


mean [xX 1526 


5 an A _ 
0.200 = 20.0%20— from the target 


= 0.200 = 20.0 %,,— 
0.200 = 20.0% p,— ? 


b. What’s the probability that the first sample mean that is greater 


305 
— ——— = 0.200 = 20.0% ,,— = 
than ux +20x 1526 Hz + 20% is the one from the 


fourth sample taken? 


Plant managers are trying to develop a criterion for determining 

when the process is not working properly. One idea they have is to 

look at the 5 most recent sample means. If at least 4 of the 5 fall 
305 sid 

outside the interval (x —ox , ux tox )isae 70? = 700% 

(Mz — 9%, Mz +97) | they will conclude that the process isn’t 


working. 


c. Find the probability that at least 4 of the 5 most recent sample 


means fall outside the interval, assuming the process is working 
properly. Is this a reasonable criterion? Explain your reasoning. 


Stop doing homework! (4.3) Researchers in Spain interviewed 7725 
13-year-olds about their homework habits—how much time they 
spent per night on homework and whether they got help from their 
parents or not—and then had them take a test with 24 math questions 
and 24 science questions. They found that students who spent 
between 90 and 100 minutes on homework did only a little better on 
the test than those who spent 60 to 70 minutes on homework. Beyond 
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100 minutes, students who spent more time did worse than those who 
spent less time. The researchers concluded that 60 to 70 minutes per 
night is the optimum time for students to spend on homework.“? Is it 
appropriate to conclude that students who reduce their homework 
time from 120 minutes to 70 minutes will likely improve their 
performance on tests such as those used in this study? Why or why 
not? 
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SECTION 10.3 Comparing Two Means: Paired Data 


LEARNING TARGETS By the end of the section, you should be able to: 


e Analyze the distribution of differences in a paired data set using 
graphs and summary Statistics. 


Construct and interpret a confidence interval for a mean difference. 
Perform a significance test about a mean difference. 


Determine when it is appropriate to use paired t procedures versus 
two-sample t procedures. 


Section 10.2 showed you how to perform inference about the difference 
between two means when data come from two independent random 
samples or two groups in a randomized experiment. What if we want to 
compare means in a setting that involves measuring a quantitative variable 
twice for the same individual or for two individuals who are much alike? 
For instance, a researcher studied a random sample of identical twins who 
had been separated and adopted at birth. In each case, one twin (Twin A) 
was adopted by a high-income family and the other (Twin B) by a low- 
income family. Both twins were given an IQ test as adults. Here are their 


scores:22 


Evan Hurd Photography/Getty Images 


Pair 1 2 3 4 5 6 7 8 9 10 11 12 


Twin A’s IQ (high-income 128 104 108 100 116 105 100 100 103 124 114 112 
family) 


Twin B’s IQ (low-income 120 99 99 94 111 97 99 94 104 114 113 100 
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family) 


Notice that these two groups of IQ scores did not come from 
independent samples of people who were raised in low-income and high- 
income families. The data were obtained from pairs of very similar people 
(identical twins), one living with a low-income family and the other living 
with a high-income family. This set of IQ scores is an example of paired 
data. 


DEFINITION Paired data 


Paired data result from recording two values of the same quantitative 
variable for each individual or for each pair of similar individuals. 


This section focuses on how to analyze paired data and how to perform 
inference about a true mean difference. 


Analyzing Paired Data 


The graph in Figure 10.7 shows a parallel dotplot of the IQ scores from the 
study of identical twins. We can see that the nalts raised in high-income 


households had a higher mean IQ (x A=109. S536 59g = 9-200 = 20.043, = 109.5 
) than the twins raised in low-income households (x B=103.667 
305 


——= 0.200 = 20.0%5 
1526 0.200 = 20.0 ‘ER 


= 103.667). There is a similar amount of variability in 
305 
IQ scores for me two groups of twins: sA=9.471526 07? = 7 0%s, = 9.47 


and sB=8. 66in2e 0.200 = 20.0%s = 8.66 But with so much overlap 
between the groups, the difference in means does not seem statistically 
significant. 


Twin A ~ 
(high-income family) — a 
Twin B 7“ 
(low-income family) ——**;—__;*_}___I 
90 95 100105 110 115 120 125 130 
IQ score 


FIGURE 10.7 Parallel dotplots of the IQ scores for pairs of identical twins raised 
in high-income (Twin A) and low-income (Twin B) households. 


1476 


The previous analysis ignores the fact that these are paired data. Let’s 
look at the difference in IQ scores for each pair of twins. 


Pair 1 2 3 4 5 6 7 8 9 10 11 «12 


Twin A’s IQ (high- 128 104 108 100 116 105 100 100 103 124 114 112 
income family) 


Twin B’sIQ(low- 120 99 99 94 111 97 99 94 104 114 113 100 
income family) 


Difference (A — B) 8 5 ES 6 5 8 1 6 -1 10 1 i 


The dotplot in Figure 10.8 displays these differences. Almost all of the 
differences (11 out of 12) are positive. This graph gives strong evidence 
that identical twins raised in a high-income household have higher IQ 
scores as adults, on average, than identical twins raised in a low-income 
household. 


& . 
° ° 6 6 ee 6 ° 
T T T T T T 
—2 0 2 4 6 8 10 12 


Difference (Twin A — Twin B) in IQ scores 
FIGURE 10.8 Dotplot of difference in IQ scores for each pair of twins. 


The mean difference in IQ scores is 


305 ; saritecies 
x diff=x A-B=8+5+9+ --- +1+1212=7012=5.833 pointsis2s "= °?-°* 


8+5+9+-+-+14+12 70 
Laitt = TAB SS Ti = 5.833 points 


This value tells us that the IQ score of the twin in each pair who was raised 
in a high-income household is 5.83 points higher than the twin who was 
raised in a low-income household, on average. 


The standard deviation of the difference in IQ scores is sdiff=3.93 
305 


1526-200 = 20.0% sai = 3.93 points. This value is much smaller than the 
standard deviations we computed earlier when we (incorrectly) viewed the 
305 


two groups of twins as unrelated: sA=9.471526 0700 = 90-0%s, = 9.47 


points and sB=8. 661520 0.200 = 20.0%5, = 8.66 points. Oo Remember: 


The proper method of analysis depends on how the data are produced. 


ANALYZING PAIRED DATA 
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To analyze paired data, start by computing the difference for each pair. 
Then make a graph of the differences. Use the mean difference x diff 


305 
—— = 0.200 = 20.0%% 
Tr ia 


305 
1526 


diff and the standard deviation of the differences sdiff 
= 0.20 


0 = 20.0%... ae 
*Sdiff as summary Statistics. 


EXAMPLE | Math and music (>) 
Analyzing paired data 


gawrav/Getty Images 


PROBLEM: Does music help or hinder performance in math? Student 
researchers Abigail, Carolyn, and Leah designed an experiment using 30 
student volunteers to find out. Each subject completed a 50-question 
single-digit arithmetic test with and without music playing. For each 
subject, the order of the music and no music treatments was randomly 
assigned, and the time to complete the test in seconds was recorded for 
each treatment. Here are the data, along with the difference in time for 
each subject: 


Student 2 Ae? iG 7 SU eo POU tS eS 


Time with 83 119 77 75 64 106 70 69 60 76 47 97 68 77 48 
music (sec) 

Time 70106 71 67 59 112 83 69 65 83 38 90 76 68 50 
without 

music (sec) 

Difference 13°13 /6.°8 5 =6-13 90 6=5."=7 - 9 97 =8 9 =) 
(Music — 

Without 

music) 
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Student 16 17° 18 19 20 21 22. 23 24 25 26 27 28 29 30 
Time with 78113 71 77 #37 #SO 58 52 47 71146 44 53 57 39 
music (sec) 


Time 739359) 70 39) 52) G0 54 951 60 141 AN) 56 53 37 
without 
music (sec) 


Difference BS 20) 12) 7 2 2) 2 2 e  , 24 S AD 
(Music — 
Without 
music) 


a. Make a dotplot of the difference (Music — Without music) in time for 
each subject to complete the test. 


b. Describe what the graph reveals about whether music helps or 
hinders math performance. 


c. Calculate the mean difference and the standard deviation of the 
differences. Interpret the mean difference. 


SOLUTION: 


AO -15 -170 =F 0 5 10 15 20 


Difference (Music — Without music) 
in time (sec) 


b. There is some evidence that music hinders performance on the math 
test. 17 of the 30 subjects took longer to complete the test when 
listening to music. 


305 
ie — é E ——= 0.200 = 20.0% 
c. Mean: x diff=x Music — Without=2.81526 


305 
Eaitt = FMusic — Without = 2.8 seconds, SD: sdiff=7.491526 
saitt = 7-49 seconds. 

The time it took these 30 students to complete the arithmetic quiz 


with music was 2.8 seconds longer, on average, than the time it took 
without the music. 


= 0.200 = 20.0% 


To get these summary statistics using the TI-83/84, start by typing the 
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| difference values into L1. Then do 1-Var Stats. | 


FOR PRACTICE, TRY EXERCISE 75 


There are two ways that a statistical study involving a single 
quantitative variable can yield paired data: 


_. Researchers can record two values of the variable for each individual. 


’. The researcher can form pairs of similar individuals and record the 
value of the variable once for each individual. 


We have seen one example of each method so far. The observational study 
of identical twins’ IQ scores used Method 2, with the pairs consisting of 
identical twins who were raised separately—one in a high-income 
household and one in a low-income household. The experiment 
investigating whether music helps or hinders learning used Method 1, with 
each subject taking a 50-question single-digit arithmetic test twice—once 
with music playing and once without—in a random order. We referred to 
this type of experiment as a matched pairs design in Chapter 4. Note that it 
is also possible to carry out a matched pairs experiment using Method 2 if 
the researcher forms pairs of similar subjects and randomly assigns each 
treatment to exactly one member of every pair. 


305 sinei@®. 
Confidence Intervals for jidiffiszs~ °?°° ~ °°°°4#i 


When paired data come from a random sample or a randomized 
0.200 = 20.0% 3% 


05 
experiment, the statistic x diffi526 


305 
; = 0.200 = 20.0%), .. ; 
the true mean difference pdiff1526 “diff, Before constructing a 


confidence interval for a mean difference, we must check that the 

conditions for performing inference are met. Aside from the paired data 
requirement, these conditions are the same as the ones for constructing a 
> _ 0.200 = 20.0% py 


diff is a point estimate for 


a 
confidence interval for 11526 — in Section 8.3. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE 
INTERVAL ABOUT A MEAN DIFFERENCE 
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e Random: Paired data come from a random sample from the 
population of interest or from a randomized experiment. 


o 10%: When sampling without replacement, ndiff<0.10Ndiff 
305 _ 
1526 > 0.200 = 20.0 “ony 3 6¢ < 0.10. Noite. 


e Normal/Large Sample: The population distribution of differences 
(or the true distribution of differences in response to the treatments) 
is Normal or the number of differences in the sample is large 


Sees Be 
(ndiff>30)1526 © 0.200 = 20.0%(nait > 30) Tf the population (true) 
distribution of differences has unknown shape and the number of 


differences in the sample is less than 30, a graph of the sample 
differences shows no strong skewness or outliers. 


The Random condition reminds us that we need paired data to 


305 
construct a confidence interval for pdiffi526 ~ , diff, Both the 
10% and Normal/Large Sample conditions emphasize that the proper 
method of analyzing paired data is to focus on the differences within each 
pair. When the conditions are met, we can proceed to calculations. 


200 = 20.0% py 


All of the inference procedures you have learned so far require that 
individual observations be independent. That’s why we check the 10% 
condition when sampling without replacement. For paired data, the two 
values of the response variable in each pair are generally not independent. 
After all, we are measuring the same quantitative variable twice for one 
individual or for two very similar individuals. Knowing the value of the 
variable for one of the measurements should help us predict the value for 
the other measurement. What is important is that the difference values be 
independent. 


You learned how to construct a confidence interval for a population 
305 


—— = 0.200 = 20.0% 
mean [11526 ee 


in Chapter 8. Assuming that the population 

305 

ve = 0.200 = 20.0% 

standard deviation 01526 
305 

one-sample t1526 


formula is 


7 is unknown, we can calculate a 


= 0.200 = 20.0%, . 
t interval for the mean. The appropriate 


statistic+(critical value) ‘(standard deviation of statistic)=x +t*sxn 
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5 
305 _ 0.200 = 20.0% 


1526 
statistic -+ (critical value) - (standard deviation of statistic) 
8 
=F+t*— 
fn 


30 _ 9.200 = 20.0% 
We find the critical value t*1526 ~~ “"t:« for a given confidence level 


305 _ 0,200 = 20.0% 
from a ¢ distribution with df=n-11526. “df = n — 1 ysing Table B 
or technology. 


Now we want to estimate the true mean difference jdiff 


308 _ 0.200 = 20.0% 
i526 °~Sté‘zt*™:*S diff aed ON a Single sample of differences calculated 


from paired data. All we have to do is modify the above formula to fit the 
new Setting: 


statistic+(critical value) : (standard deviation of statistic) =x diff 
305 . 
+t*sdiffndiff 1526 °?00 = 200% 
305 : 
statistic + (critical value) - (standard deviation of statistic) ji > 0.200 = 20.0% 
Sdiff 
diff 
This can be referred to as a one-sample t interval for a mean difference 
or as a paired t interval for a mean difference. 


= Faig + t° 


ONE-SAMPLE t INTERVAL FOR A MEAN DIFFERENCE 
(PAIRED t INTERVAL FOR A MEAN DIFFERENCE) 


When the conditions are met, a C% confidence interval for pdiff 
305 wee 
1596 = 0-200 = 20.0% waite ia 
ES , *diff 
7 305 _ 9 200 = 20.0% af <— 
x diff+t*sdiffndiffis2 ““ =~” diff 
305. _ 9.200 = 20.0% 

where t*1526.-” = ~~ &° is the critical value with C% of its area 


7 4 
305 305 _ 6.200 = 20.0% 


a = = oO a * 
between —t¥is26 270 = 20-°—¢* and tis26 t* for the t 


305 sbeee 
distribution with ndiff—1i526 > °7°0 = ?-°*naise — 1 degrees of freedom. 
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As with any inference procedure, follow the four-step process. 


EXAMPLE | Which twin is smarter? (») fi. 
4 STEP 
Confidence interval for a mean difference 


Jodi Cobb/Getty Images 


PROBLEM: The data from the random sample of identical twins are 
shown again in the following table. Construct and interpret a 95% 
confidence interval for the true mean difference in IQ scores among 
twins raised in high-income and low-income households. 


Pair P27 34S 9657 78 9 10 1 A2 


Twin A’s IQ (high-income 128 104 108 100 116 105 100 100 103 124 114 112 
family) 


Twin B’s IQ (low-income 120 99 99 94111 97 99 94 104 114 113 100 
family) 


Difference (A — B) SO td Bato Ue oe, Bh eon bw ay Led 


SOLUTION: 


Follow the four-step process! 


| rE 

STATE: 95% CI for diff i526 °°? ~ °°°“maitt = the true mean 
difference (High income — Low income) in IQ scores for pairs of 
identical twins raised in separate households. 


| Be sure to indicate the order of subtraction when defining the parameter. | 
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| Then you can mimic the wording in your conclusion. | 


| 
PLAN: One-sample t interval for pdiffi526 — sila 7 


e Random? Random sample of 12 pairs of identical twins, one raised in 
a high-income household and the other in a low-income household. “ 


o 10%: Assume 12<10% of all pairs of identical twins raised in 
separate households. “ 


e Normal/Large Sample? The number of differences is small, but the 
dotplot doesn’t show any strong skewness or outliers. / 


% es e 
s 2 e. ¢ *¢ ¢ & 
<< 0 Za 4 6 8 10 2 


Difference (Twin A — Twin B) in IQ scores 


305 z 
DO: x diff = 5.833, Sdiff = 3.93, ndiff = 121526 "20? ~ 20.0% 
z diff = 5.833, Sait = 3.93, Tiff = 12 

305 
With 95% confidence and df = 12-1 = 11, t* = 2.201. 1526 
df = 12 — 1 = 11, t* = 2.201. 


The TInterval function on the TI-83/84 gives (3.338, 8.329). 


505 ann on nub. 83° 3.93 
eae era isco = 9 shibisni eases Via 


= 0.200 = 20.0% 


to 


is . diff 
Ce lees 


305 aad 
x diff+txsdiffndiffiszs °°? = 20°” V diff 


a 
= 5.833 +2.4971526  °-700 = 20-0 5 8334 2.497 
305 
= (3.336 8.330)i520 0.200 = 20.0%= (3.336, 8.330) 


CONCLUDE: We are 95% confident that the interval from 3.336 to 
8.330 captures the true mean difference (High income — Low income) 


305 where a ; 
1596 = 0-200 = 20.0% (High income — Low income) 


in IQ scores among pairs 
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of identical twins raised in separate households. 


FOR PRACTICE, TRY EXERCISE 79 


The 95% confidence interval in the example suggests that IQs are 
between 3.336 and 8.330 points higher, on average, for twins raised in 
high-income households. However, we can’t conclude that household 
income level caused an increase in average IQ score because this was an 
observational study, not an experiment. 


CHECK YOUR UNDERSTANDING 


( 


,, The data from the matched pairs experiment investigating whether 
music helps or hinders math performance (page 675) are reproduced 
here. 


Student 1 2°03 4 5 6 7 8 -9 10 11 12 13 1445 


Time with music 83119 77 75 64106 70 69 60 76 47 97 68 77 48 
(sec) 


Time without 70 106 71 67 59112 83 69 65 83 38 90 76 68 50 
music (sec) 


Difference (Music 13 13 6 8 5 -6-13 0-5-7 9 7 -8 9 -2 
— Without music) 


Student 16-17 18 18 20 21 (22°23 24525. 26 27 28 29° 30 


Time with music 78 113 71 77 37 50 58 52 47 71146 44 53 57 39 
(sec) 


Time without 73 93 59 70 39 52 60 54 51 60141 40 56 53 37 
music (sec) 


Difference (Music 5 20 12 7 -2 -2 -2 -2 -4 11 5 4-3 4 2 
-Without music) 


1. Construct and interpret a 90% confidence interval for the true mean 
difference. 


2. What does the interval in Question 1 suggest about whether music 
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helps or hinders math performance? Explain your answer. 


305 ae 
Significance Tests for jidiffiszs— °?°° ~ °°°*#itr 
When paired data come from a random sample or a randomized 
experiment, we may want to perform a significance test about the true 


05 0 
mean difference pdiffis2e ”"° ~ °°°*“aitt, The null hypothesis has the 
general form 

05 2 
HO: yidiff=hypothesized valueis2s "70? = 70°” 


Ho : faite = hypothesized value 


We’ ll focus on situations where the hypothesized difference is 0. Then the 
null hypothesis says that the true mean difference is 0: 


305 ; ; - 
HO: pdiff=O1526~ "70 = 70-°”Ho = faite = 0 


The alternative hypothesis says what kind of difference we expect. 


The conditions for performing a significance test about diff 


200 = 20.0%), 4. i 
1526 “diff are the same as the ones for constructing a 
confidence interval for a mean difference. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE 
TEST ABOUT A MEAN DIFFERENCE 


e Random: Paired data come from a random sample from the 
population of interest or from a randomized experiment. 


© 10%: When sampling without replacement, ndiff<0.10Ndiff 
305 - 
1526 > 0.200 = 20.0 “ony a 6¢ < 0.10. Noite. 


e Normal/Large Sample: The population distribution of differences 
(or the true distribution of differences in response to the treatments) 
is Normal or the number of differences in the sample is large 

| 
(ndiff=30)1526 — 0.200 = 20.0%(nair > 30) Tf the population (true) 
distribution of differences has unknown shape and the number of 
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differences in the sample is less than 30, a graph of the sample 
differences shows no strong skewness or outliers. 


When conditions are met, we can carry out a one-sample t test for a 
mean difference (also known as a paired t test for a mean difference). 
The standardized test statistic is 


ead i 
Sdiff 
305, ae 
t=x diff—Osdiffndiffiszs "7"? = °°°” diff 


We can use Table B or technology to find the P-value from the t 


305, ae 
distribution with df=ndiff-1is26— °°"? ~ °°°*df = n 


aiff — 1 


ONE-SAMPLE t TEST FOR A MEAN DIFFERENCE 
(PAIRED t TEST FOR A MEAN DIFFERENCE) 


Suppose the conditions are met. To test the hypothesis HO: pdiff=0 
ae 7 
i526 700 = 20-0" Ho = aitt = 9 compute the standardized test statistic 


a la 


= S diff 
t=x diff-Osdiffndiffisae ~ 0-200 = 20.9% diff 


Find the P-value by calculating the probability of getting a t statistic 
this large or larger in the direction specified by the alternative 

308 _ 0.200 = 20.0% 
hypothesis Hais26 ~~ °H, 
305 


——= 0.200 = 20.0%, — 
1526 *naitt — 1 degrees of freedom. 


. Use the t distribution with ndiff-1 


As with any inference procedure, be sure to follow the four-step 
process when performing a test about a mean difference. 


EXAMPLE | Is caffeine dependence real? » 


Significance test about a mean difference 
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MrPants/Getty Images 


PROBLEM: Researchers designed an experiment to study the effects 
of caffeine withdrawal. They recruited 11 volunteers who were 
diagnosed as being caffeine dependent to serve as subjects. Each subject 
was barred from coffee, colas, and other substances with caffeine for the 
duration of the experiment. During one 2-day period, subjects took 
capsules containing their normal caffeine intake. During another 2-day 
period, they took placebo capsules. The order in which subjects took 
caffeine and the placebo was randomized. At the end of each 2-day 
period, a test for depression was given to all 11 subjects. Researchers 
wanted to know whether being deprived of caffeine would lead to an 


increase in depression.=4 


The table displays data on the subjects’ depression test scores. 
Higher scores show more symptoms of depression. 


Subject 2S eS eae Oe fee oe ctl) aed 
Depression (caffeine) B05 eA Oe BS Oar Oy 2a ily of 
Depression (placebo) 1623 5 7 14 24 6 3 15 12 0 


Do the data provide convincing evidence at the a=0.05 


305 
——. = 0.200 = 20.0% ee . 
1526 a = 0.05 significance level that caffeine withdrawal 


increases depression score, on average, for subjects like the ones in this 
experiment? 


SOLUTION: 


a | 
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| Follow the four-step process! | 


Pr 
STATE: HO: pdiff = 01526~ 0790 = °9-°”"°Ho : Maite = 0 


Start by calculating the difference in depression test scores for each 


subject. We chose the order placebo — caffeine for subtraction so that 
most values would be positive. 


305 
* ~ 9.200 = 20.0% . 
Ha: pidiff+01526 0.200 20.0 oH, ° Leaift } 0 


305 
é = 0.200 = 20.0% 1) 4: ¢¢ 4 
where idiff1526 Mdiff= the true mean difference (Placebo — 
Caffeine) in depression test score for subjects like these. Because no 
305 0.200 20.0 Oo 
significance level is given, we’ll use a= 0.051526 “a = 0.05, 


305 
PLAN: Paired t test for ydiffis2s °°"? ~ °° “Maite 


e Random: Researchers randomly assigned the treatments — placebo 
then caffeine, caffeine then placebo—to the subjects. / 


e Normal/Large Sample: The sample size is small, but the histogram of 
differences doesn’t show any outliers or strong skewness. “ 


Note that we do not have to check the 10% condition because we are not 
sampling without replacement from a finite population. 


Change in depression 
(Placebo - Caffeine) 


Because there is no strong skewness or outliers, it is plausible that the 


true distribution of difference (Placebo — Caffeine) in depression test 


scores for subjects like these is Normal. 
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DO: x diff = 7,3641525~ 92% Ean = 7.364, caift= 6.918 
305 305 
1526 ~ 200 = 20.0% sare = 6.918 neiff = 111506 ~ 200 = 20% naiee = 11 


The sample result gives some evidence in favor of Ha:pdiff>0 
= 0.200 = 20.0% 


305 8 
1596 7 0-200 = 20.04H7, : praise > O hecause 7.364>01596 


7.364 > 0, 


305 . nalievae 
e t= 7.364 — 06.91811=3.53156_ "= *°°* vil 


305, ro 
t=x diff-Osdiffndiffisas ~ °?00 = 200% 


305 ; ; = 
e P-value df =11—1= 101526 070 =7°°“df = 11 — 1 = 10 


t distribution 
with df = 10 


305 
The TTest function on the TI-83/84 gives t=3.531526 = Ree aU 


305 
t = 3.53 and P-value =0.00271526 ~ 2 = 2°-°*— 0.0027 


Using Table B: P-value is between 0.0025 and 0.005. 
Using Technology: tcdf(lower: 3.53, upper: 1000, df: 10)= 0.0027 


1526 0.200 = 20.0% __ — (),0027 


CONCLUDE: Because the P-value of 0.0027<a = 0. oSi526 seaciaencaitd 
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305 
0.0027 < a = 0.05, we reject H01526 . We have 
convincing evidence that caffeine withdrawal increases depression test 


score, on average, for subjects like the ones in this experiment. 


0.200 = 20.0 % Ay 


FOR PRACTICE, TRY EXERCISE 85 


Why did the researchers randomly assign the order in which subjects 
received placebo and caffeine in the example? Researchers want to be able 
to conclude that any statistically significant change in depression score is 
due to the treatments themselves and not to some other variable. One 
obvious concern is the order of the treatments. Suppose that caffeine were 
given to all the subjects during the first 2-day period. What if the weather 
were nicer on these 2 days than during the second 2-day period when all 
subjects were given a placebo? Researchers wouldn’t be able to tell if a 
large increase in the mean depression score is due to the difference in 
weather or due to the treatments. Random assignment of the caffeine and 
placebo to the two time periods in the experiment should help ensure that 
no other variable (like the weather) is systematically affecting subjects’ 
responses. 


Because researchers randomly assigned the treatments, they can make 
an inference about cause and effect. The data from this experiment provide 
convincing evidence that depriving caffeine-dependent subjects like these 
of caffeine causes an increase in depression scores, on average. 


The significance test in the example led to a simple decision: reject HO 


305 
= 0.200 = 20.0% ; 
1526 Ho. We know from past experience that a confidence 


interval gives more information than a test—it provides the entire set of 
plausible values for the parameter based on the data. A 90% confidence 


a 
y 


305 
. = 0.200 = 20.0% pp s:c¢ « 
interval for pdiffi526 Hadiff js 


Notice that we used a 90% confidence level here, which corresponds to the one-sided 


significance test with a = 0.05. 


x diff+t*sdiffndiff=7.364+1.8126.91811=7.364+3.780=(3.584,11.144) 


305 
”_ — 9.200 = 20.0% 
1526 
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si 6.918 
diff _ 7.364 + 1.812———- = 7.364 + 3.780 = (3.584,11.144) 
J" aiff vil 


E gift  t * 


We are 90% confident that the interval from 3.584 to 11.144 captures the 
true mean difference (Placebo — Caffeine) in depression test score for 
caffeine-dependent individuals like the ones in this study. The interval 
suggests that caffeine deprivation results in an average increase in 
depression test score of between 3.584 and 11.144 points for subjects like 
these. 


CHECK YOUR UNDERSTANDING 


(\ 


y Consumers Union designed an experiment to test whether nitrogen- 
filled tires would maintain pressure better than air-filled tires. They 
obtained two tires from each of several brands and then randomly 
assigned one tire in each pair to be filled with air and the other to be 
filled with nitrogen. All tires were inflated to the same pressure and 
placed outside for a year. At the end of the year, Consumers Union 
measured the pressure in each tire. The pressure loss (in pounds per 
square inch) during the year for the tires of each brand is shown in the 
table.22 


Courtesy of Judy Starnes 


Brand Air Nitrogen |Brand Air Nitrogen 


BF Goodrich Traction 7.6 Vs Pirelli P6 Four Seasons 4.4 4.2 
T/A HR 


Bridgestone HP50 3.8 25 Sumitomo HTR H4 1.4 2,1 
(Sears) 
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Bridgestone Potenza 3.7 1.6 Yokohama Avid H4S 4.3 3.0 
G009 


Bridgestone Potenza 4.7 1.5 BF Goodrich Traction 5.9 3.4 


RE950 T/A V 

Bridgestone Potenza 2.1 1.0 Bridgestone Potenza 4.1 Dass 

EL400 RE950 

Continental Premier 4.9 3.1 Continental 5.0 3.4 

Contact H ContiExtreme Contact 

Cooper Lifeliner 5.2 35 Continental 4.8 ee, 

Touring SLE ContiProContact 

Dayton DaytonaHR 3.4 a2 Cooper Lifeliner Sie 2.9 
Touring SLE 

Falken Ziex ZE-512 4.1 3.3 General Exclaim UHP 6.8 27. 

Fuzion Hrl 27 22 Hankook Ventus V4 Ball 1.4 
H105 

General Exclaim 3.1 3.4 Michelin Energy 2.9 15 
MXV4 Plus 

Goodyear Assurance 3.8 3.2 Michelin Pilot Exalto 6.6 22 

Tripletred A/S 

Hankook Optimo H418 3.0 0.9 Michelin Pilot HX 22 2.0 
MXM4 

Kumho Solus KH16 6.2 3.4 Pirelli P6 Four Seasons =. 2.5 zy 

Michelin Energy 2.0 1.8 Sumitomo HTRt 4.4 Bil 

MxvV4 

Michelin Pilot XGT iE 0.7 

H4 


Paired Data or Two Samples? 


In Section 10.2, we used two-sample t procedures to compare the mean 
change in blood pressure for healthy men who take calcium and for 
healthy men who take a placebo. These methods require data that come 
from independent random samples from the two populations of interest or 
from two groups in a randomized experiment (as was the case in the 
preceding example). When the conditions are met, we can perform 


305 
. : —— = 0.200 = 20.0% — ty - 
inference about the difference 1—p121526 #1 ~ #2 in the true 


means. 


1493 


In this section, we used paired t procedures to compare the mean 
change in depression scores for caffeine-dependent individuals when 
taking caffeine versus when taking a placebo. These methods require 
paired data that come from a random sample from the population of 
interest or from a randomized experiment (as was the case in this example 
because the same 11 subjects received both treatments). When the 
conditions are met, we can perform inference about the true mean 


20.07 aise o@ The proper inference method 


depends on how the data were produced. 


305 
: * ee = 0.200 = 
difference pdiffi526 


EXAMPLE | Are you all wet? (>) 


Two samples or paired data? 


PROBLEM: In each of the following settings, decide whether you 
should use two-sample t procedures to perform inference about a 
difference in means or paired t procedures to perform inference about a 
mean difference. Explain your choice. 


a. Before exiting the water, scuba divers remove their fins. A maker of 
scuba equipment advertises a new style of fins that is supposed to be 
faster to remove. A consumer advocacy group suspects that the time 
to remove the new fins may be no different than the time required to 
remove old fins, on average. Twenty experienced scuba divers are 
recruited to test the new fins. Each diver flips a coin to determine if 
they wear the new fin on the left foot and the old fin on the right foot, 
or vice versa. The time to remove each type of fin is recorded for 
every diver. 


b. To study the health of aquatic life, scientists gathered a random 
sample of 60 White Piranha fish from a tributary of the Amazon 
River during one year. The average length of these fish was 
compared to a random sample of 82 White Piranha from the same 
tributary a decade ago. 


c. Can a wetsuit deter shark attacks? A researcher has designed a new 
wetsuit with color variations that are suspected to deter shark attacks. 
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To test this idea, she fills two identical drums with bait and covers 
one in the standard black neoprene wetsuit and the other in the new 
suit. Over a period of one week, she selects 16 two-hour time periods 
and randomly assigns 8 of them to the drum in the black wetsuit. The 
other 8 are assigned to the drum with the new suit. During each time 
period, the appropriate drum is submerged in waters that sharks 
frequent and the number of times a shark bites the drum is recorded. 


SOLUTION: 


a. Paired t procedures. The data come from two measurements of the 
same variable (time to remove fin) for each diver. 


b. Two-sample t procedures. The data come from independent random 
samples of White Piranha in two different years. 


If the sample sizes are different, it can’t be paired data. 


c. Two-sample t procedures. The data come from two groups in a 
randomized experiment, with each group consisting of 8 time periods 
in which a drum with a specific wetsuit (standard or new) was 
randomly assigned to be submerged. 


FOR PRACTICE, TRY EXERCISE 91 


When designing an experiment to compare two means, a completely 
randomized design may not be the best option. A matched pairs design 
might be a better choice, as the following activity shows. 


ACTIVITY Get your heart beating! 


Are standing pulse rates higher, on average, than sitting pulse rates? In 
this activity, you will perform two experiments to try to answer this 
question. 


Experiment #1: Completely randomized design 


1495 


1. 


Your teacher will randomly assign half of the students in your class to 
stand and the other half to sit. Once the two treatment groups have 
been formed, students should stand or sit as required. Then they 
should measure their pulses for 1 minute. Have the subjects in each 
group record their data on the board. 


. Analyze the data for the completely randomized design. Make 


parallel dotplots and calculate the mean pulse rate for each group. Is 
there some evidence that standing pulse rates are higher, on average? 
Explain your answer. 


Experiment #2: Matched pairs design 


3. 


To produce paired data in this setting, each student should receive 
both treatments in a random order. Because you already sat or stood 
in Step 1, do the opposite now. As before, everyone should measure 
his or her pulse for 1 minute after the treatment is imposed (i.e., once 
everyone is standing or sitting). Then each subject should calculate 
his or her difference (Standing — Sitting) in pulse rate and record this 
value on the board. 


. Analyze the data for the matched pairs design. Make a dotplot of 


these differences and calculate their mean. Is there some evidence that 
standing pulse rates are higher, on average? Explain your answer. 


. Which design provides more convincing evidence that standing pulse 


rates are higher, on average, than sitting pulse rates? Justify your 
answer. 


A statistics class with 24 students performed the “Get your heart 


beating” activity. Figure 10.9 shows a dotplot of the pulse rates for their 
completely randomized design. The mean pulse rate for the standing group 


05 


is x 1=74.83;1526~ "7° = 20.0%F) = 74.83; the mean for the sitting group is 


x 2=68.331526 


305 ee . 
= 0.200 = 20-9°F2 = 68.33 So the average pulse rate is 6.5 


beats per minute higher in the standing group. However, the variability in 
pulse rates for the two groups creates a lot of overlap in the dotplots. A 


two-sample ¢ test of HO: p1—p2=01526 


305 = 
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305 
versus Ha: 1-p2>01526 — 
ee 

i526 700 = 20-0 — 1.42 and a P-value of 0.09. These data do not provide 


convincing evidence that standing pulse rates are higher, on average, than 
sitting pulse rates for people like the students in this class. 


0.200 = 20.0%H, : pry 2>0 yields t=1.42 


Sit ee oe e 000 oe 


Stand 5 T I T T 
0 20 40 60 80 100 
Pulse rate 


FIGURE 10.9 Parallel dotplots of the pulse rates for the standing and sitting 
groups in a Statistics class’s completely randomized design. 


Position 


What about the class’s matched pairs design? Figure 10.10 shows a 
dotplot of the difference (Standing — Sitting) in pulse rate for each of the 
24 students. We can see that 21 of the 24 students recorded a positive 
difference, indicating that per standing pulse rate was higher. The mean 


305 > 
difference is x diff=6.831526 536 = 0.200 = 20.05 att = 6.83 beats per minute. A 


0.200 = 20.0% Hy : 


305 
one-sample t test of HO: pdiff=01526 © > Haiti = 9 versus 


Ha: pdiff>01526 a ae AH ano gives t=6. 4831500 soe ee 
t = 6.483 and a P-value of approximately 0. These data provide very 
convincing evidence that standing pulse rates are higher, on average, than 
sitting pulse rates for people like the students in this class. 


T T T T T T 
0 20 40 60 80 100 


Difference (Standing — Sitting) in pulse rate 
FIGURE 10.10 Dotplot of the difference (Standing — Sitting) in pulse rate for each 
student in a statistics class’s matched pairs design. 


Let’s take one more look at the two figures. Notice that we used the 
same scale for both graphs. The matched pairs design greatly reduced the 
variability in the response variable by accounting for a big source of 
variability—the differences between individual students. That made it 
easier to detect the fact that standing causes an increase in the average 
pulse rate. In other words, using a paired design resulted in more power. 
With the large amount of variability in the completely randomized design, 
we could not draw such a conclusion. 
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Section 10.3 Summary 


e Paired data result from recording two values of the same quantitative 
variable for each individual or for each pair of similar individuals. 


e To analyze paired data, start by computing the difference for each pair. 
Then make a graph of the differences. Use the mean difference x diff 


305 

= 0.200 = 20.0% ,. See ine . . 
1526 “diff and the standard deviation of the differences sdiff 
305 

= 0.200 = 20.0% e ..-- - 
1526 Sdiff as Summary Statistics. 


305 
e Before estimating or testing a claim about pdiffi526 
check that these conditions are met: 


0.200 = 20.0% py s:¢¢ 
d 


= Random: Paired data come from a random sample from the 
population of interest or from a randomized experiment. 
10%: When sampling without replacement, ndiff<0.10Ndiff 
305 B 
1526 = (0.200 = 20.0 “on aise C 0. 10.Naisr ; 
«= Normal/Large Sample: The population distribution of differences (or 
the true distribution of differences in response to the treatments) is 
Normal or the number of differences in the sample is large (ndiff>30) 


305 
: 2 % er me 
1526 0.200 = 20.0% (raise > 30) 


. If the population (true) distribution of 
differences has unknown shape and the number of differences in the 
sample is less than 30, a graph of the sample differences shows no 
strong skewness or outliers. 


e When the conditions are met, a C% confidence interval for the true 


305 
; «pea = 0.200 = 20.0% pp aiee « 
mean difference pdiff1526 diff is 
Sdiff 
= + 1 
_ eae nine Oe eres 
x diff+t*sdiffndiffis2 ~~ ~~” V "diff 


305 


——_ = 0,200 = 20.0% 
where t*1526 t 


is the critical value for a t distribution with 


305 
sor 4 = 0.200 = 20.0% df — n.. ’ 
df=ndiff—11526 df = naite — 1 and C% of its area between 
| 305 
—t¥1526. SOT” and ¥1526~ . This is called a one- 
sample t interval for a mean difference or a paired t interval for a 


mean difference. 


0.200 = 20.0 % 4s 
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> 


305 
e A significance test of HO: pdiff=01526 
called a one-sample t test for a mean difference or a paired t test for 

a mean difference. The standardized test statistic is 


_ = aitt—9 
S diff 


305 ; mieveie 
t=x diff—Osdiffndiffisas~ °-20°=209* J" diff 


When the Normal/Large Sample condition is met, find the P-value using 


the t distribution with degrees of freedom equal to ndiff—1 
| 
apg = 0-200 = 20.0% poe — 1 


e The proper inference method depends on how the data were produced. 


305 
= 0.200 = 20.0% 
For paired data, use one-sample t procedures for diff1526 


Hdiff, For quantitative data that come from independent random samples 
from two populations of interest or from two groups in a randomized 


305 
experiment, use two-sample t procedures for p1—p121526 © 
ae ae 


0.200 = 20.0% 


Section 10.3 Exercises 


75. pg 674 |p) Groovy tires Researchers were interested in comparing 


two methods for estimating tire wear. The first method used the 
amount of weight lost by a tire. The second method used the amount 
of wear in the grooves of the tire. A random sample of 16 tires was 
obtained. Both methods were used to estimate the total distance 
traveled by each tire. The table provides the two estimates (in 


thousands of miles) for each tire.2° 


Tire Weight Groove Diff. 


Tire Weight Groove Diff. 


1 45.9 35.7 10.2 a 30.4 23.1 7.3 
2 41.9 39.2 2d 10 Z/a3 23.7 3.6 
3 37.0 ole 6.4 11 20.4 20.9 -0.5 
4 33.4 28.1 5.3 12 24.5 16.1 8.4 
5 31.0 24.0 7a 13 20.9 19.9 1.0 


6 30.5 28.7 1.8 14 18.9 ae Bel 
7 30.9 200) 5.0 15 13.7 11.5 O20 


a. Make a dotplot of the difference (Weight — Groove) in the estimate 
of wear for each tire using the two methods. 


b. Describe what the graph reveals about whether the two methods 
give similar estimates of tire wear, on average. 


c. Calculate the mean difference and the standard deviation of the 
differences. Interpret the mean difference. 


76. Well water Trace metals found in wells affect the taste of drinking 


77. 


water, and high concentrations can pose a health risk. Researchers 
measured the concentration of zinc (in milligrams/liter) near the top 
and the bottom of 10 randomly selected wells in a large region. The 
data are provided in the following table.22 


Well 1 2 3 4 S) 6 7 8 9 10 
Bottom 0.430 0.266 0.567 0.531 0.707 0.716 0.651 0.589 0.469 0.723 
Top 0.415 0.238 0.390 0.410 0.605 0.609 0.632 0.523 0.411 0.612 


Difference 0.015 0.028 0.177 0.121 0.102 0.107 0.019 0.066 0.058 0.111 


a. Make a dotplot of the difference (Bottom — Top) in the zinc 
concentrations for these 10 wells. 


b. Describe what the graph reveals about whether the amount of zinc 


at the top and bottom of the wells is the same, on average. 


c. Calculate the mean difference and the standard deviation of the 


differences. Interpret the mean difference. 


Flight times Emirates Airline offers one outbound flight from Dubai, 
United Arab Emirates, to Doha, Qatar, and one return flight from 
Doha to Dubai each day. An experienced Emirates pilot suspects that 
the Dubai-to-Doha outbound flight typically takes longer. To find out, 
the pilot collects data about these flights on a random sample of 12 
days. The table displays the flight times in minutes.22 


1500 


Day 1 203 4 85 6 7 8 § 10 11 12 


Outbound (Dubai 75 42 62 63 54 46 52 50 42 46 43 52 


305 _ 6.200 = 20.0% 
OT a a 


Doha) 


Return (Doha 42 37 37 44 42 40 41 44 41 42 48 48 
pe 0.200 = 20.0% _., 
— 1526 ae 


Dubai) 


a. Explain why these are paired data. 


b. A dotplot of the difference (Outbound — Return) in flight time for 
each day is shown. Describe what the graph reveals about whether 
the outbound or return flight takes longer, on average. 


| ne tua San T T T I 
-5 0 5 bb asa wD SB 
Difference (Outbound — Return) 


in flight time (min) 
c. Calculate the mean difference and the standard deviation of the 
differences. Interpret the standard deviation. 


78. Literacy Do males have higher literacy rates than females, on 
average, in Islamic countries? The following table shows the percent 
of men and women who were literate in 24 largely Muslim nations at 
the time of this writing.22 


Country Male literacy (%) Female literacy (%) 
Afghanistan 43.1 12.6 
Algeria 86.0 86.0 
Azerbaijan 99.9 Oa 
Bangladesh 62.0 53.4 
Egypt 82.2 65.4 
Indonesia 97.0 89.6 
Iran 91.2 82.5 
Iraq 89.0 73.6 
Jordan 96.6 90.2 
Kazakhstan 99.8 99.3 
Kyrgyzstan 99.3 98.1 


1501 


79. 


80. 


81. 


Lebanon 93.4 86.0 


Libya 98.6 90.7 
Malaysia 95.4 90.7 
Morocco 76.1 97.6 
Pakistan 67.0 42.0 
Saudi Arabia 90.4 81.3 
Syria 86.0 73.6 
Tajikistan 99.8 99.6 
Tunisia 95.1 80.3 
Turkey 99.3 98.2 
Turkmenistan 99.3 98.3 
Uzbekistan 99.6 99.0 
Yemen 81.2 46.8 


a. Explain why these are paired data. 


b. A dotplot of the difference (Male — Female) in literacy rate for 
each country is shown. Describe what the graph reveals about 
whether males have higher literacy rates than females in these 
countries, on average. 


P-+4 oO . Seco & eee cm a e eo 
rut’ tH wf Tt rE ow we oe 
0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 


Difference (Male — Female) in literacy rate 


c. Calculate the mean difference and the standard deviation of the 
differences. Interpret the standard deviation. 


pg 677 |) Groovy tires Refer to Exercise 75. Construct and 


interpret a 95% confidence interval for the true mean difference 
(Weight — Groove) in the estimates of tire wear using these two 
methods in the population of tires. 


Well water Refer to Exercise 76. Construct and interpret a 95% 
confidence interval for the true mean difference (Bottom — Top) in 
the zinc concentrations of the wells in this region. 


Does playing the piano make you smarter? Do piano lessons 
improve the spatial-temporal reasoning of preschool children? A 
study designed to investigate this question measured the spatial- 
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82. 


83. 


84. 


85. 


temporal reasoning of a random sample of 34 preschool children 
before and after 6 months of piano lessons. The difference (After — 
Before) in the reasoning scores for each student has mean 3.618 and 


standard deviation 3.055.%2 


a. Construct and interpret a 90% confidence interval for the true 
mean difference. 


b. Based on your interval from part (a), can you conclude that taking 


6 months of piano lessons would cause an increase in preschool 
students’ average reasoning scores? Why or why not? 


No annual fee? A bank wonders if omitting the annual credit card fee 
for customers who charge at least $2400 in a year will increase the 
amount charged on its credit cards. The bank makes this offer to an 
SRS of 200 of its credit card customers. It then compares how much 
these customers charge this year with the amount that they charged 
last year. The mean increase in the sample is $332, and the standard 
deviation is $108. 


a. Construct and interpret a 99% confidence interval for the true 
mean increase. 


b. Based on the interval from (a), can you conclude that dropping the 


annual fee would cause an increase in the average amount spent by 
this bank’s credit card customers? Why or why not? 


Flight times Refer to Exercise 77. Explain why it is not appropriate 
to perform a paired t test about the true mean difference (Outbound — 
Return) in flight times between Dubai and Doha. Note: See Exercise 
94 for an alternative approach to perform a significance test in this 
Setting. 


Literacy Refer to Exercise 78. Explain why it is not appropriate to 
perform a paired t test about the true mean difference (Male — 


305 o 
Female) i526 = 0.200 = 20.0 ‘(Male _ Female) 


countries. 


in literacy rates in Islamic 


pg 680 |») Drive-thru or go inside? Many people think it’s faster 


to order at the drive-thru than to order inside at fast-food restaurants. 
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86. 


To find out, Patrick and William used a random number generator to 
select 10 times over a 2-week period to visit a local Dunkin’ Donuts 
restaurant. At each of these times, one boy ordered an iced coffee at 
the drive-thru and the other ordered an iced coffee at the counter 
inside. A coin flip determined who went inside and who went to the 
drive-thru. The table shows the times, in seconds, that it took for each 
boy to receive his iced coffee after he placed the order.“! 


Visit Inside time Drive-thru time 
1 62 55 
2 63 50 
3 325 321 
4 105 110 
5 135 124 
6 55 54 
7 92 90 
8 75 69 
9 203 200 

10 100 103 


Do these data provide convincing evidence at the a=0.05 
305 
1526 


= 0.200 = 20.0% ‘ : 
a = 0.05 level of a difference in the true mean 


service time inside and at the drive-thru for this Dunkin’ Donuts 
restaurant? 


Better barley Does drying barley seeds in a kiln increase the yield of 
barley? A famous experiment by William S. Gosset (who discovered 
the ¢ distributions) investigated this question. Eleven pairs of adjacent 
plots were marked out in a large field. For each pair, regular barley 
seeds were planted in one plot and kiln-dried seeds were planted in 
the other. A coin flip was used to determine which plot in each pair 
got the regular barley seed and which got the kiln-dried seed. The 
following table displays the data on barley yield (pound per acre) for 


each plot.44 
Plot Regular Kiln 
1 1903 2009 
2 1935 1915 
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3 1910 2011 
4 2496 2463 
fs) 2108 2180 
6 1961 1925 
4 2060 2122 
8 1444 1482 
J 1612 1542 
10 1316 1443 
11 1511 1535 


Do these data provide convincing evidence at the a=0.05 
305 
1526 


~ 0.200 = 20-0 = 0.05 level that drying barley seeds in a kiln 


increases the yield of barley, on average? 


Music and memory Does listening to music while studying help or 
hinder students’ learning? Two statistics students designed an 
experiment to find out. They selected a random sample of 30 students 
from their medium-sized high school to participate. Each subject was 
given 10 minutes to memorize two different lists of 20 words, once 
while listening to music and once in silence. The order of the two 


word lists was determined at random; so was the order of the 
305 

treatments. The difference (Silence — Music) 1526 

(Silence — Music) 


0.200 = 20.0% 


in the number of words recalled was recorded for 
each subject. The mean difference was 1.57 and the standard 
deviation of the differences was 2.70. 


a. If the result of this study is statistically significant, can you 
conclude that the difference in the ability to memorize words was 
caused by whether students were performing the task in silence or 
with music playing? Why or why not? 


b. Do the data provide convincing evidence at the a=0.01 


305 
——= 0.200 = 20.0% . “¢: 

1526 a = 0.01 significance level that the number of 
words recalled in silence or when listening to music differs, on 


average, for students at this school? 


c. Based on your conclusion in part (a), which type of error—a Type 
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I error or a Type II error—could you have made? Explain your 
answer. 


88. Friday the 13th Do people behave differently on Friday the 13th? 
Researchers collected data on the number of shoppers at a random 
sample of 45 grocery stores on Friday the 6th and Friday the 13th in 
the same month. Then they calculated the difference (subtracting in 
the order 6th minus 13th) in the number of shoppers at each store on 


05 : ; o . 
these 2 days. The mean difference is —46.51526 — ee en MBG arid 


the standard deviation of the differences is 178. 


a. 


0.43 


If the result of this study is statistically significant, can you 
conclude that the difference in shopping behavior is due to the 
effect of Friday the 13th on people’s behavior? Why or why not? 


. Do these data provide convincing evidence at the a=0.05 


305 
1526 x = 0.05 level that the number of shoppers at 
grocery stores on these 2 days differs, on average? 


= 0.200 = 20.0 7 


Based on your conclusion in part (a), which type of error—a Type 
I error or a Type II error—could you have made? Explain your 
answer. 


89. Music and memory Refer to Exercise 87. 


a. 


b. 


Construct and interpret a 99% confidence interval for the true 
mean difference. If you already defined the parameter and checked 
conditions in Exercise 87, you don’t need to do them again here. 


Explain how the confidence interval provides more information 
than the test in Exercise 87. 


90. Friday the 13th Refer to Exercise 88. 


a. 


Construct and interpret a 90% confidence interval for the true 
mean difference. If you already defined parameters and checked 
conditions in Exercise 88, you don’t need to do them again here. 


Explain how the confidence interval provides more information 
than the test in Exercise 88. 
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91. pg 683 |») Two samples or paired data? In each of the following 


settings, decide whether you should use two-sample t procedures to 
perform inference about a difference in means or paired t procedures 


to perform inference about a mean difference. Explain your choice.“4 


a. To test the wear characteristics of two tire brands, A and B, each of 
50 cars of the same make and model is randomly assigned Brand A 
tires or Brand B tires. 


b. To test the effect of background music on productivity, factory 
workers are observed. For one month, each subject works without 
music. For another month, the subject works while listening to 
music on an MP3 player. The month in which each subject listens 
to music is determined by a coin toss. 


c. How do young adults look back on adolescent romance? 
Investigators interviewed a random sample of 40 couples in their 
mid-twenties. The female and male partners were interviewed 
separately. Each was asked about his or her current relationship 
and also about a romantic relationship that lasted at least 2 months 
when they were aged 15 or 16. One response variable was a 
measure on a numerical scale of how much the attractiveness of 
the adolescent partner mattered. You want to find out how much 
men and women differ on this measure. 


92. Two samples or paired data? In each of the following settings, 
decide whether you should use two-sample t procedures to perform 
inference about a difference in means or paired t procedures to 


perform inference about a mean difference. Explain your choice.*2 


a. To compare the average weight gain of pigs fed two different diets, 
nine pairs of pigs were used. The pigs in each pair were littermates. 
A coin toss was used to decide which pig in each pair got Diet A 
and which got Diet B. 


b. Separate random samples of male and female college professors 
are taken. We wish to compare the average salaries of male and 
female teachers. 
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c. To test the effects of a new fertilizer, 100 plots are treated with the 
new fertilizer, and 100 plots are treated with another fertilizer. A 
computer’s random number generator is used to determine which 
plots get which fertilizer. 


93. Have a ball! Can students throw a baseball farther than a softball? To 
find out, researchers conducted a study involving 24 randomly 
selected students from a large high school. After warming up, each 
student threw a baseball as far as he or she could and threw a softball 
as far as he she could, in a random order. The distance in yards for 
each throw was recorded. Here are the data, along with the difference 
(Baseball — Softball) in distance thrown, for each student: 


Student Baseball Softball Difference (Baseball — Softball) 
1 65 57 8 
2 90 58 32 
3 75 66 9 
4 73 61 12 
5 79 65 14 
6 68 56 12 
Z 58 53 5 
8 41 41 
2 56 44 12 

10 70 65 5 
11 64 57 7 
12 62 60 2 
13 72 55 18 
14 50 53 =3 
15 63 54 9 
16 48 42 6 
17 34 32 2 
18 49 48 1 
19 48 45 3 
20 68 67 1 
71 30 27 3 
22 26 25 1 
23 28 25 3 
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24 26 31 =o 


a. Explain why these are paired data. 


b. A boxplot of the differences is shown. Explain how the graph gives 
some evidence that students like these can throw a baseball farther 
than a softball. 


ea 
20 40 
Difference (Baseball — Softball) 
in distance thrown (yd) 


r 
0 


c. State appropriate hypotheses for performing a test about the true 
mean difference. Be sure to define any parameter(s) you use. 


d. Explain why the Normal/Large Sample condition is not met in this 
case. 


a 
The mean difference (Baseball—Softball)1526 — ial 


(Baseball — Softball) 


305 
x diff=6,541836 0.200 = 20.0% F aie = 6.54 yards. Is this a 
surprisingly large result if the null hypothesis is true? To find out, 
we can perform a simulation assuming that students have the same 
ability to throw a baseball and a softball. For each student, write 
the two distances thrown on different note cards. Shuffle the two 
cards and designate one distance to baseball and one distance to 
softball. Then subtract the two distances (Baseball—Softball) 

a 0.200 = 20.0%(Baseball — Softball) fo this for all the students 
and find the simulated mean difference. Repeat many times. Here 
are the results of 100 trials of this simulation: 


in distance thrown for these 24 students is 


Simulated mean difference (Baseball — Softball) 
in distance thrown (yd) 
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e. Use the results of the simulation to estimate the P-value. What 
conclusion would you draw? 


94. Flight times Refer to Exercise 77. The pilot suspects that the Dubai- 


to-Doha outbound flight typically takes longer. If so, the difference 


(ouon Reuse = 0.200 = 20.0%(Outbound — Return) in flight 
times will be positive on more days than it is negative. What if either 
flight is equally likely to take longer? Then we can model the 
outcome on a randomly selected day with a coin toss. Heads means 
the Dubai-to-Doha outbound flight lasts longer; tails means the Doha- 
to-Dubai return flight lasts longer. To imitate a random sample of 12 
days, imagine tossing a fair coin 12 times. 


a. Find the probability of getting 11 or more heads in 12 tosses of a 
fair coin. 


b. The outbound flight took longer on 11 of the 12 days. Based on 
your result in part (a), what conclusion would you make about the 
Emirates pilot’s suspicion? 


Multiple Choice Select the best answer for Exercises 95—98. 


95. There are two common methods for measuring the concentration of a 


96. 


pollutant in fish tissue. Do the two methods differ, on average? You 
apply both methods to each fish in a random sample of 18 carp and 
use 


a. the paired ¢ test for ucliftT638 ee eae 


b. the one-sample z test for p. 


305 


c. the two-sample t test for p1—-p2is26 "7" > A = Bie 


305 


d. the two-sample z test for pl—p215s26. 7") ~ °°° “pi — po, 


e. none of these. 


A study of the impact of caffeine consumption on reaction time was 
designed to correct for the impact of subjects’ prior sleep deprivation 
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by dividing the 24 subjects into 12 pairs on the basis of the average 
hours of sleep they had had for the previous 5 nights. That is, the two 
with the highest average sleep were a pair, then the two with the next 
highest average sleep, and so on. One randomly assigned member of 
each pair drank 2 cups of caffeinated coffee, and the other drank 2 
cups of decaf. Each subject’s performance on a standard reaction-time 
test was recorded. Which of the following is the correct check of the 
“Normal/Large Sample” condition for this significance test? 


I. Confirm graphically that the scores of the caffeine drinkers could 
have come from a Normal distribution. 


II. Confirm graphically that the scores of the decaf drinkers could 
have come from a Normal distribution. 


III. Confirm graphically that the differences in scores within each pair 
of subjects could have come from a Normal distribution. 


a. I only 

b. Il only 

c. III only 

d. I and II only 


e. I, I, and Ill 


Exercises 97 and 98 refer to the following setting. Researchers suspect that 
Variety A tomato plants have a different average yield than Variety B 
tomato plants. To find out, researchers randomly select 10 Variety A and 
10 Variety B tomato plants. Then the researchers divide in half each of 10 
small plots of land in different locations. For each plot, a coin toss 
determines which half of the plot gets a Variety A plant; a Variety B plant 
goes in the other half. After harvest, they compare the yield in pounds for 
the plants at each location. The 10 differences (Variety A — Variety B) in 
yield are recorded. A graph of the differences looks roughly symmetric 
and single-peaked with no outliers. The mean difference is x A-B=0.34 


305 a 
1526 — 0.200 = 20.0%F 4 5 — 0.34 


and the standard deviation of the differences 
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is sA-B= 0.83;506 - 


305 


0.0% 200 = 20.0% 
*sa_p = 0.83 Tet A-Bisa6 707 = 20-° 


= 0.200 = 2 


/A-B= the true mean difference (Variety A — Variety B) in yield for 
tomato plants of these two varieties. 


97. A 95% confidence interval for pA — B1526 
by 


98. 


e. 


. 0.34+1.96(0. 8310)150~ 


305 
= 0.200 = 20.0% ae 
"HA—B is given 


0.34+1.96(0. 83)i530~ 0.200 = 20.0%0.34 = 1.96 (0.83) 


0.200 = 20.0940-34 + 1.96 (2 = 


0.8 


an 


0.3421.812(0.8310)i5%0 
va z000092 = 0.200 = 20.0%0).34 = 2.262 (0.83) 


0.83 
0.34+2.262(0. 8310)i50 = 0.200 = 20.0%0.34 = 2.262 G - 


The P-value for a a os HO: pA-B= ois20 7 alata tal (car ca 


versus Ha: pA- B05 = 0.200 = 20.0%H, : 4a-B # 0 ig 0.227. Which 
of the following is the correct interpretation of this P-value? 


305 ; : or 
The probability that pA-B=O1526~ "7° = °°*a_B = 0 is 0,227. 


Given that the true mean difference (Variety A — Variety B) in 
yield for these two varieties of tomato plants is 0, the probability of 
getting a sample mean difference of 0.34 is 0.227. 


Given that the true mean difference (Variety A — Variety B) in 
yield for these two varieties of tomato plants is 0, the probability of 
getting a sample mean difference of 0.34 or greater is 0.227. 


Given that the true mean difference (Variety A — Variety B) in 
yield for these two varieties of tomato plants is 0, the probability of 
getting a sample mean difference greater than or equal to 0.34 or 


305 
less than or equal to —0.34i526 °° = 2°-°*—0.34 jg 0.227. 


Given that the true mean difference (Variety A — Variety B) in 
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yield for these two varieties of tomato plants is not 0, the 

probability of getting a sample mean difference greater than or 
305 sient 

equal to 0.34 or less than or equal to —0.341526 °°" ~ °°°”°—0.34 


is 0.227. 
Recycle and Review 


In each part of Exercises 99 and 100, state which inference procedure 
from Chapter 8, 9, or 10 you would use. Be specific. For example, you 
might say, “Two-sample z test for the difference between two 
proportions.” You do not have to carry out any procedures. 


99. Which inference method? 


a. Drowning in bathtubs is a major cause of death in children less 
than 5 years old. A random sample of parents was asked many 
questions related to bathtub safety. Overall, 85% of the sample said 
they used baby bathtubs for infants. Estimate the percent of all 
parents of young children who use baby bathtubs. 


b. How seriously do people view speeding in comparison with other 
annoying behaviors? A large random sample of adults was asked to 
rate a number of behaviors on a scale of 1 (no problem at all) to 5 
(very severe problem). Do speeding drivers get a higher average 
rating than noisy neighbors? 


c. You have data from interviews with a random sample of students 
who failed to graduate from a particular college in 7 years and also 
from a random sample of students who entered at the same time 
and did graduate within 7 years. You will use these data to 
estimate the difference in the percents of students from rural 
backgrounds among dropouts and graduates. 


d. Do experienced computer-game players earn higher scores when 
they play with someone present to cheer them on or when they 
play alone? Fifty teenagers with experience playing a particular 
computer game have volunteered for a study. We randomly assign 
25 of them to play the game alone and the other 25 to play the 
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game with a supporter present. Each player’s score is recorded. 


100. Which inference method? 


a. A city planner wants to determine if there is convincing evidence 
of a difference in the average number of cars passing through two 
different intersections. He randomly selects 12 times between 6:00 
a.m. and 10:00 p.m., and he and his assistant count the number of 
cars passing through each intersection during the 10-minute 
interval that begins at that time. 


b. Are more than 75% of Toyota owners generally satisfied with their 
vehicles? Let’s design a study to find out. We’Il select a random 
sample of 400 Toyota owners. Then we’|I ask each individual in 
the sample, “Would you say that you are generally satisfied with 
your Toyota vehicle?” 


c. Are male college students more likely to binge drink than female 
college students? The Harvard School of Public Health surveys 
random samples of male and female undergraduates at four-year 
colleges and universities about whether they have engaged in binge 
drinking. 


d. A bank wants to know which of two incentive plans will most 
increase the use of its credit cards and by how much. It offers each 
incentive to a group of current credit card customers, determined at 
random, and compares the amount charged during the following 6 
months. 


Exercises 101 and 102 refer to the following setting. Coaching companies 
claim that their courses can raise the SAT scores of high school students. 
Of course, students who retake the SAT without paying for coaching 
generally raise their scores. A random sample of students who took the 
SAT twice found 427 who were coached and 2733 who were uncoached.4® 
Starting with their Verbal scores on the first and second tries, we have 
these summary statistics: 


Try 1 Try 2 Gain 


305 305 305 305 305 305 _ 0.000 200% 
305 = 0.20(22. = 0,20¢ 50 = 9.20¢ 50% = 0.20¢ 50% = 0. 20¢0806722= 2" 


1526 ~ 1526 1526 1526 1526 $, 
zr Sz z Sz zr 
Coached 427 he, 
Uncoached 2733 506 101 527 101 21 52 
101. Coaching and SAT scores Let’s first ask if students who are 


102. 


coached increased their scores significantly, on average. 


a. 


b. 


You could use the information on the Coached line to carry out 
either a two-sample t test comparing Try 1 with Try 2 or a paired 
t test using Gain. Which is the correct test? Why? 


Carry out the proper test. What do you conclude? 


Coaching and SAT scores What we really want to know is whether 
coached students improve more than uncoached students, on 
average, and whether any advantage is large enough to be worth 
paying for. Use the information above to answer these questions: 


a. How much more do coached students gain, on average, compared 


to uncoached students? Construct and interpret a 99% confidence 
interval. 


Does the interval in part (a) give convincing evidence that 
coached students gain more, on average, than uncoached 
students? Explain your answer. 


. Based on your work, what is your opinion: Do you think 


coaching courses are worth paying for? 
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_Mé _. FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


Will using name-brand microwave popcorn result in a greater 
percentage of popped kernels than using store-brand microwave 
popcorn? To find out, Briana and Maggie randomly selected 10 bags 
of name-brand microwave popcorn and 10 bags of store-brand 
microwave popcorn. The chosen bags were arranged in a random 
order. Then each bag was popped for 3.5 minutes, and the percentage 
of popped kernels was calculated. The results are displayed in the 
following table. 


Name-brand 95 88 84 94 81 90 97 93 91 86 
Store-brand 91 89 82 82 77 78 84 86 86 90 


Do the data provide convincing evidence that using name-brand 


microwave popcorn will result in a greater mean percentage of 
popped kernels? 
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After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 


score. 
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Chapter 10 Review 


Section 10.1: Comparing Two Proportions 


In this section, you learned how to construct confidence intervals and 
perform significance tests for a difference between two proportions. 
Inference for a difference in proportions is based on the sampling 


0.200 = 20.0%. — 7 a 
*P1 ~ P2, When the conditions are 


305 
met, the sampling distribution of p\1—pA2i1s26 °° = 2.°%p, ~ 
305 


approximately Normal with a mean of pp/41—-p42=p1-p21526 
Mp,-p, — Pl 


305 
distribution of p41—p421526 © 
" P2 is 
= 0.200 = 20.0% 


P2 anda standard deviation of 


op’1—-p42=p1(1-p1)n1+p2(1- p2)n2i536 7 ll iis 


/m(-r) P2(1—p2) 
8 i Le 


Pi-P2 VV ny n2 


The conditions for inference about a difference in proportions are the 
same for confidence intervals and significance tests. The Random 
condition says that the data must be from two independent random 
samples or two groups in a randomized experiment. The 10% condition 
says that each sample size should be less than 10% of the corresponding 
population size when sampling without replacement. The Large Counts 


condition says that the number of successes and number of failures from 
305 


each sample/group s should be at least 10. That is, nlp\11526~ ae ae 


. 305 
mp; , ni(1- pA) i526 0.200 = 20.07 OTL) (1 meee n2 p21526— - 0.200 = 20.0%, Any Po 
305 om ‘ 305, 
, n2(1—pA2)i526~ 970° = 20-0%n2 (1 — Po) are all >101526~ 
A confidence interval for a difference between two proportions 
provides an interval of plausible values for the true difference in 
proportions. The formula is 


0.200 = 20.0% 19 


305, ena 
(p\1-p2)+z*p\1(1-p41)n1+pA2(1-pA2)n2i1526 9709 = 29.0% 
~  .\, 4 /Pi(l-P1) , P2(1-P2) 

(P) — Pa) +2 a a a 


ny n9 
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The logic of confidence intervals, including how to interpret the 
confidence interval and the confidence level, is the same as it was in 
Chapter 8, when you first learned about confidence intervals. 


Likewise, a significance test for a difference between two proportions 
uses the same logic as the significance tests you learned about in Chapter 
9. We start by assuming the null owes’ is true and asking how likely it 
would be to get evidence for Ha1526 1 as strong as or 
stronger than the observed result in a study by chance alone. If it is 
plausible that a difference in proportions could be the result of sampling 
variability or the chance variation due to random assignment, we do not 
have convincing evidence that the alternative hypothesis is true. However, 
if the difference is too big to attribute to chance, there is convincing 
evidence that the alternative hypothesis is true. For a test of HO: p1—p2=0 


0.200 = 20.0 4H, 


305 sited 
1526 > 0-200 = 20.0% Hy : py — p2 = 0 the standardized test statistic is 
308 — 0.200 = 20.0% 
z=(p41-p2)-0p(1-p/)n1+p(1—p)n2 1526 
; (P1—P2)—0 
/p(-P) , P(L-P) 
\ ny ng 
305 _ 0.200 = 20.0% 
where p1526 =~ P is the combined (pooled) proportion of 
successes: 
. %1+XoQ 
ae a 0.200 = 20 ox? ~ ny+n 
pA=X1+X2n1+n21526 nae 


Finally, you learned that the inference techniques used for analyzing a 
difference in proportions from two independent random samples work 
very well for analyzing a difference in proportions from two groups in a 
completely randomized experiment when the conditions are met. 


Section 10.2: Comparing Two Means 


In this section, you learned how to construct confidence intervals and 
perform significance tests for a difference between two means. Inference 
for a difference in means is based on the sampling distribution of x 1-x 2 


305 
0.200 = 20.0%, — 7. - 
1526 &1 ~ £2, When the conditions are met, the sampling 
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= 0.200 = 20.0% = 
distribution of x 1—x 21526 


0.0%F a ©2 is (approximately) 


Normal with a mean of px 1-x 2=y1- 27596 D2 2008 


Mz,—x2 ~ Ml ~ F2 anda standard deviation of ox 1-x 2=012n1+o022n2 


| hd rT aid 
= 0.200 = 20.0%7z,-z. — Vu * ny 

The conditions for inference about a difference in means are the same 
for confidence intervals and significance tests. The Random condition says 
that the data must be from two independent random samples or two groups 
in a randomized experiment. The 10% condition says that each sample size 
should be less than 10% of the corresponding population size when 
sampling without replacement. For each sample, the corresponding 
population distribution (or the true distribution of response to the 


305, cana: 
treatment) is Normal or the sample size is large (n >1526 Sie ae 30). 


If either population (treatment) distribution has unknown shape and n < 
30, confirm that a graph of the sample data shows no strong skewness or 
outliers. 


As in Chapters 8 and 9, inference techniques for means are based on 
the t distributions. There are two options for calculating the number of 
degrees of freedom to use. The first option is to use technology to calculate 
the degrees of freedom. The second option is to use the smaller of nl-1 


26 ~ 0-200 = 20.0% m1 — 1 and n2-115 a ae Migs The technology 
option is preferred because it is more accurate and produces a larger 
number of degrees of freedom, resulting in narrower confidence intervals 
and smaller P-values. If you are using technology, always choose the 
unpooled option when performing inference about a difference between 
two means. 


A confidence interval for a difference between two means provides an 
interval of plausible values for the true difference in means. The formula is 


0.200 = 20.0% 


305 
(x 1=x ~9) rs 12n1+s220 27558 ~ 
Use a significance test to decide between two competing eee about 


305 se 
a true difference in means. For a test of HO: 1-\12=0i526 °°"? ~ °°” 
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Ho: (1 — 2 = 0. the standardized test statistic is 


| 
t=(x 1-x 2)-0s12n1+s22n2I526 °°" =" 


Section 10.3: Comparing Two Means: Paired Data 


In this section, you learned how to analyze paired data, which result from 
measuring the same quantitative variable twice for one individual or for 
two very similar individuals. Start by finding the difference between the 
values in each pal ane make a graph of the differences. Use the mean 


difference x “aiff - = 0.200 = 20-09 ave and the standard deviation of the 


= 0.200 = 20.0% 


differences sdiff15 a diff as summary Statistics. 


You can perform inference about the true mean difference diff 
305 
1526 


= 0.200 = 20.0%); a: - a 
“diff when the conditions are met. The conditions are the 


same for confidence intervals and significance tests. The Random 
condition says that paired data must come from a random sample from the 


population of interest or from a randomized experiment. The 10% 
305 


condition says that the sample size ndiff1526 — 0.200 = 20.0 nai: should be 
less than 10% of the corresponding population of differences when 
sampling without replacement. The Normal/Large Sample condition says 
that the population a of differences is Normal or that the sample 


size is large (ndiff > 30) 1538 = = 0.200 = 20.07 (naire > 30) Tf the sample size is 
small and the population shape is unknown, graph the difference values to 
make sure there is no strong skewness or outliers. 

As in Chapters 8 and 9, inference techniques for means are based on 
the t estibudions: The appropriate number of degrees of freedom is 


df=ndiff-— lie 526 ~ 0-200 = 20.0%df = nai — 1, 
A confidence interval for a mean difference a an interval of 


plausible values for the true mean difference ucliffi638 7 ede 


The formula is 
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Fat  &° 
x ~aift-ttxsditindiffieae ~ + 0.200 = 20.0% diff 


Use a significance test to decide between me COMpaMIE hypotheses about 


a mean difference. For a test of HO: pdiff= one” mas Sips Hae 0 
the standardized test statistic is 
_ *aift—9 
= S diff 
t=x diff-Osdiffndiffisas °70 = 2°°% diff 


To decide whether two-sample t procedures for a difference between 
two means or paired t procedures for a mean difference are appropriate, 
look at how the data were produced. 


Difference in proportions p1 — p2isx-°-°*P1 — P2 

Conditions , Random: The data come from two independent random sample 
from two groups in a randomized experiment. 

o 10%: When sampling without replacement, n1<0.10N1=.».»» 


305 ‘. 
ny< 0.10.N, and n2<0.10N2 1526 0.200 = 20.0%no a 0.10.N> 


e Large Counts: The counts of “successes” and “failures” in eac! 


305 e n 
sample or group—n1p11526 — ee aD ,n1(1-p/1) 
= 0.200 = 20.0% np Bo 


05 
ise (tpi) 


n2(1- pn2) 1520 


305 
n2 p\21526— 


= 0.200 = 20.0%n2 (1 — po) are alll at least 10. 


305 


Name Two-sample _ interval for 
Z p1—p21526 


= 0.200 = 20.0%», — po 
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(TI-83/84) (2-PropZInt) 


305 
—— A1-pA2)+2*p\1(1—pA1)n1+pA2(1-pA2)n21526~ 970° = 20.0% 
p®i~p p Pp p p 
‘ 305 2 
Name Two-sample , west for p1—p2s26~ 0.200 = 20.0%p, — po 
(TI-83/84) (2-PropZTest) 
Null ae 020m 
oie Se i Soe 
Formula Z= 
2 (pi-p; 
(1-r) 
305 
(p\1—p’2)-Op\(1—p)n1+p\(1-pA)n2isa6~ 920 = 29.0% i 
305 ~  X1+X2 _ 
where pA=X1+X2n1+n21530~ 0.200 = 20.0% = ==>, value fron 


standard Normal distribution 


Related Relevant 
Example Chapter 
on Review 
Learning Target Section Page(s) Exercise(s) 


Describe the shape, center, and variability 10.1 624 R10.1 
of the sampling distribution of 


5 P P 
p\1—p21526 0.200 = 20.0%, — po° 


Determine whether the conditions aremet 10.1 625,631 #£R10.2, 
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for doing inference about a difference 
between two proportions. 


Construct and interpret a confidence 
interval for a difference between two 
proportions. 


Calculate the standardized test statistic and 
P-value for a test about a difference 
between two proportions. 


Perform a significance test about a 
difference between two proportions. 


Describe the shape, center, and variability 
of the sampling distribution of 

305 pa ae 
1x2 7596 = 9-200 = 20.0%, — Fy: 
Determine whether the conditions are met 
for doing inference about a difference 
between two means. 


Construct and interpret a confidence 
interval for a difference between two 
means. 


Calculate the standardized test statistic and 
P-value for a test about a difference 
between two means. 


Perform a significance test about a 
difference between two means. 


Analyze the distribution of differences in a 
paired data set using graphs and summary 
statistics. 


Construct and interpret a confidence 
interval for a mean difference. 


Perform a significance test about a mean 
difference. 


Determine when it is appropriate to use 
paired t procedures versus two-sample ¢ 
procedures. 
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Chapter 10 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R10.1 |») American-made cars Nathan and Kyle both work for the 


Department of Motor Vehicles (DMV), but they live in different 
states. In Nathan’s state, 80% of the registered cars are made by 
American manufacturers. In Kyle’s state, only 60% of the registered 
cars are made by American manufacturers. Nathan selects a random 


sample of 100 cars in his state and Kyle selects a random sample of 
ree . 

70 cars in his state. Let p\N-p4K1526 — een ope pe be the 

difference (Nathan’s state — Kyle’s state) in the sample proportion of 

cars made by American manufacturers. 

a. What is the shape of the sampling distribution of p\N-pAK 


305 
= 0.200 = 20.0%). — p, 
1526 Pn ~ PK? Why? 


b. Find the mean of the sampling distribution. 


c. Calculate and interpret the standard deviation of the sampling 
distribution. 


R10.2 |) Facebook As part of the Pew Internet and American Life 


Project, researchers conducted two surveys. The first survey asked a 
random sample of 1060 U.S. teens about their use of social media. A 
second survey posed similar questions to a random sample of 2003 

U.S. adults. In these two studies, 71.0% of teens and 58.0% of adults 


305 — 
used Facebook.4Z Let pTis20 7’? = 20.0%pp = the true proportion of 
p prop 


305 
all U.S. teens who use Facebook and pAis26. "> *°°*pa= the 


true proportion of all U.S. adults who use Facebook. Calculate and 
interpret a 99% confidence interval for the difference in the true 
proportions of U.S. teens and adults who use Facebook. 


R10.3 |») Treating AIDS The drug AZT was the first drug that seemed 
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effective in delaying the onset of AIDS. Evidence for AZT’s 
effectiveness came from a large randomized comparative 
experiment. The subjects were 870 volunteers who were infected 
with HIV, the virus that causes AIDS, but did not yet have AIDS. 
The study assigned 435 of the subjects at random to take 500 
milligrams of AZT each day and another 435 to take a placebo. At 
the end of the study, 38 of the placebo subjects and 17 of the AZT 
subjects had developed AIDS. 


a. Do the data provide convincing evidence at the a=0.05 


| a 
i526 V0 = °°" — 0.05 level that taking AZT lowers the 


proportion of infected people like the ones in this study who will 
develop AIDS in a given period of time? 


b. Describe a Type I error and a Type IJ error in this setting and 
give a consequence of each error. 


R10.4 |») Candles A company produces candles. Machine 1 makes 


candles with a mean length of 15 cm and a standard deviation of 
0.15 cm. Machine 2 makes candles with a mean length of 15 cm and 


a standard deviation of 0.10 cm. A random sample of 49 candles is 
305 


taken from each machine. Let x 1-x 21526 °700~ 20°"%1 — Fz he 
the difference (Machine 1 — Machine 2) in the sample mean length 


of candles. Describe the shape, center, and variability of the 
305 


0.200 = 20.0% 5, — x. 
sampling distribution of x 1-x 21526 — 17 2. 


R10.5 |») Men versus women The National Assessment of Educational 
Progress (NAEP) Young Adult Literacy Assessment Survey 
interviewed a aaa samples of 840 men and 1077 women 
aged 21 to 25 years.“8 The mean and standard deviation of scores on 
the NAEP’s test of quantitative skills ver s - 1=272.40 
305 . 

1506 0.200 = 20.0%F , = 272.40 and s1=59. ro 0.200 20.0% 5, = 59.2 


for the men on me sample. For the women, the results were 


x 2=274; un 0.200 = 20.0%5 = 274.73 and s2=57.5 


305 . _— 
1526 0.200 = 20.0 °S5 _ 57.5. 


a. Construct and interpret a 90% confidence interval for the 
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difference in mean score for male and female young adults. 


b. Based only on the interval from part (a), is there convincing 
evidence of a difference in mean score for male and female 
young adults? 


R10.6 |») Each day I am getting better in math A “subliminal” 


message is below our threshold of awareness but may nonetheless 
influence us. Can subliminal messages help students learn math? A 
group of 18 students who had failed the mathematics part of the City 
University of New York Skills Assessment Test agreed to 
participate in a study to find out. All received a daily subliminal 
message, flashed on a screen too rapidly to be consciously read. The 
treatment group of 10 students (assigned at random) was exposed to 
“Fach day I am getting better in math.” The control group of 8 
students was exposed to a neutral message, “People are walking on 
the street.” All 18 students participated in a summer program 
designed to improve their math skills, and all took the assessment 
test again at the end of the program. The following table gives data 
on the subjects’ scores before and after the program.22 
Treatment group Control group 


Pretest Posttest Difference Pretest Posttest Difference 


18 24 6 18 29 11 
18 25 Z 24 29 S) 
21 33 TZ 20 24 

18 29 11 18 26 8 
18 33 15 24 38 14 
20 36 16 22 27 S) 
23 34 11 15 22 vi 
23 36 13 19 31 ie 
21 34 13 

17 OME 10 


a. Explain why a two-sample t test and not a paired t test is the 
appropriate inference procedure in this setting. 


b. The following boxplots display the differences in pretest and 
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posttest scores for the students in the control (C) and treatment 
(T) groups. Write a few sentences comparing the performance of 
these two groups. 


= 
—i- 


i—e wu by a i ee 
0 2 4 6 8 10 12 14 16 18 


Difference (Posttest — Pretest) 
in test score 


Group 
Treatment Control 


c. Do the data provide convincing evidence at the a=0.01 


305 ‘candi 
i526? ~ *9-°"@ = 0.01 significance level that subliminal 
messages help students like the ones in this study learn math, on 
average? 


d. Can we generalize these results to the population of all students 
who failed the mathematics part of the City University of New 
York Skills Assessment Test? Why or why not? 


R10.7 |) On your mark In track, sprinters typically use starting blocks 


because they think it will help them run a faster race. To test this 
belief, an experiment was designed where each sprinter on a track 
team ran a 50-meter dash two times, once using starting blocks and 
once with a standing start. The order of the two different types of 
Starts was determined at random for each sprinter. The times (in 
seconds) for 8 different sprinters are shown in the table. 


Sprinter With blocks Standing start 
1 6.12 6.38 
2 6.42 6.52 
3 5.98 6.09 
4 6.80 6.72 
5 5.73 5.98 
6 6.04 6.27 
7 6.55 6.71 
8 6.78 6.80 


a. Make a dotplot of the difference (Standing — Blocks) in 50-meter 
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run time for each sprinter. What does the graph suggest about 
whether starting blocks are helpful? 


. Calculate the mean difference and the standard deviation of the 
differences. Explain why the mean difference gives some 
evidence that starting blocks are helpful. 


. Do the data provide convincing evidence that sprinters like these 
run a faster race when using starting blocks, on average? 


. Construct and interpret a 90% confidence interval for the true 
mean difference. Explain how the confidence interval gives more 
information than the test in part (b). 
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Chapter 10 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T10.1 A study of road rage asked separate random samples of 596 men 
and 523 women about their behavior while driving. Based on their 
answers, each respondent was assigned a road rage score on a 
scale of 0 to 20. Are the conditions for performing a two-sample t 
test satisfied? 


a. Maybe; we have independent random samples, but we should 
look at the data to check Normality. 


b. No; road rage scores on a scale from 0 to 20 can’t be Normal. 
c. No; we don’t know the population standard deviations. 


d. Yes; the large sample sizes guarantee that the corresponding 
population distributions will be Normal. 


e. Yes; we have two independent random samples and large 
sample sizes. 


T10.2 Thirty-five people from a random sample of 125 workers from 
Company A admitted to using sick leave when they weren’t really 
ill. Seventeen employees from a random sample of 68 workers 
from Company B admitted that they had used sick leave when they 
weren’t ill. Which of the following is a 95% confidence interval 
for the difference in the proportions of workers at the two 
companies who would admit to using sick leave when they weren’t 
ill? 


0.200 = 20.0% 


305 
a. 0.03+(0.28)(0.72)125+(0.25)(0.75)681526 


0.03 + V 125 68 


0.200 = 20.0% 


305 
b. 0.03+1.96(0.28)(0.72)125+(0.25)(0.75)681526 


/(0.28)(0.72) (0.25)(0.75) 
)(0.75) 


0.03 + 1.96/——— =: 
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= 0.200 = 20.0% 


c. 0.03+1.645(0.28)(0.72)125+(0.25)(0. 75) 681525 


(0.28)(0.72 (0.25)(0.75) 
0.03 + 1.6454/— + Se 


0.200 = 20.0% 


d. 0.03+1.96(0.269)(0.731)125+(0.269)(0. 731)6815207 


: /[0.269)(0.731) (0.269)(0.731) 
0.03 + 1.96 4/ —— aS 


e. 0.03+1.645(0.269)(0.731)125+(0.269)(0.731)68 


305 . en tual ye. [(0.269)(0-731) (0.209)(0.731) 
ag = 0:200 = 20.0 70.03 + 1.645 i — 


T10.3 The power takeoff driveline on tractors used in agriculture can be 
a serious hazard to operators of farm equipment. The driveline is 
covered by a shield in new tractors, but the shield is often missing 
on older tractors. Two types of shields are the bolt-on and the flip- 
up. It was believed that the bolt-on shield was perceived as a 
nuisance by the operators and deliberately removed, but the flip-up 
shield is easily lifted for inspection and maintenance and may be 
left in place. In a study by the U.S. National Safety Council, 
random samples of older tractors with both types of shields were 
taken to see what proportion of shields were removed. Of 183 
tractors designed to have bolt-on shields, 35 had been removed. Of 
the 136 tractors with flip-up oo 15 were removed. We wish to 


0.200 = 20.0% FI, : 


perform a test of HO: pB= oF isa PB ~ PF versus 


305 2 
Ha: pB>pF 1526 ial ‘PB > PF, where pB 


05 
305 _ 9.200 = 


20.0% 0.200 = 20.0% 
1526 PB and oF is 


PF are the proportions 
of all tractors with the bolt-on and flip-up shields removed, 
respectively. Which of the following is not a condition for 
performing the significance test? 


a. Both populations are Normally distributed. 
b. The data come from two independent samples. 
c. Both samples were chosen at random. 


d. The counts of successes and failures are large enough to use 
Normal calculations. 


e. Both populations are more than 10 times the corresponding 
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sample sizes. 


305 
; ——= 0.200 = 20.0% 
T10.4 A quiz question gives random samples of n=101526 


n = 10 observations from each of two Normally distributed 
populations. Tom uses a table of t distribution critical values and 9 
degrees of freedom to calculate a 95% confidence interval for the 
difference in the two population means. Janelle uses her 
calculator’s two-sample t interval with 16.87 degrees of freedom 
to compute the 95% confidence interval. Assume that both 
students calculate the intervals correctly. Which of the following is 
true? 


a. Tom’s confidence interval is wider. 
b. Janelle’s confidence interval is wider. 


Both confidence intervals are the same width. 


ie) 


pe 


There is insufficient information to determine which 
confidence interval is wider. 


e. Janelle made a mistake; degrees of freedom has to be a whole 
number. 


Exercises T10.5 and T10.6 refer to the following setting. A researcher 
wished to compare the average amount of time spent in extracurricular 
activities by high school students in a suburban school district with that in 
a school district of a large city. The researcher obtained an SRS of 60 high 
school students in a large suburban school district and found the mean time 
spent in extracurricular activities per week to be 6 hours with a standard 
deviation of 3 hours. The researcher also obtained an independent SRS of 
AO high school students in a large city school district and found the mean 
time spent in extracurricular activities per week to be 5 hours with a 
standard deviation of 2 hours. Suppose that the researcher decides to carry 


r 


305 
out a significance test of HO: psuburban=pcity1526 
Ay 


0.200 = 20.0% 


* Hsuburban — Mecity versus a two-sided alternative. 


T10.5 Which is the correct standardized test statistic? 


__ (6—-5)—0 


305 3 2 


a. 2=(6—-5)-0360+2401526 ~ °200= 20.0% = aw a0 
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b. 2=(6-5)- oaneo+2240e~ 0200 = 20.0% yore 


05 yy = 3 + 
c. t=(6-5)- 0360+240i525 Soe |= a 


305 9 9 @ fe aE 
d. t=(6-5)-0360+2401526 9200 = 200% = wo" 


305, uta [x 
e. t=(6-5)-03260+22401526 77 = 799% Vw 0 
T10.6 The P-value for the test is 0.048. A correct conclusion is to 


305 ae 
a. fail to reject H01526 — 0.200 = 20.0 Hy because 0.048<a=0.05 


305 _ 0.200 = 20.0% 
1526 ~ "0.048 < a = 0.05, There is convincing evidence 


of a difference in the average time spent on extracurricular 
activities by students in the suburban and city school districts. 


305, nda 
b. fail to reject H0O1526 — 9.200 = 2.8 Hy because 0.048<a=0.05 


305 _ 0.200 = 20.0% 
1526” ~—SttS:~«S 0.0488 < @ = 0.05, There is not convincing 


evidence of a difference in the average time spent on 
extracurricular activities by students in the suburban and city 
school districts. 


305 “ 
c. fail to reject H01526 ~ 0:200= 29.9 Hy because 0.048<a=0.05 
305 


imo 20% 04s < a= 0.05, There is convincing evidence 
that the average time spent on extracurricular activities by 
students in the suburban and city school districts is the same. 


305 7 
d. reject H01526 ~ 0.200°= 2-9 Hy because 0.048<a=0.05 
305 


1526-200 = 20.0% 948 < a = 0.05. There is not convincing 
evidence of a difference in the average time spent on 
extracurricular activities by students in the suburban and city 


school districts. 


305 —— 
e, reject H01526 — 0.200 = 20-0 Hy because 0.048<a=0.05 
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305 ee 

i526 0 = *9-""°9.048 < a = 0.05, There is convincing evidence 
of a difference in the average time spent on extracurricular 
activities by students in the suburban and city school districts. 


T10.7 At a baseball game, 42 of 65 randomly selected people own an 
iPod. At a rock concert occurring at the same time across town, 34 
of 52 randomly selected people own an iPod. A researcher wants to 
test the claim that the proportion of iPod owners at the two venues 
is different. A 90% confidence interval for the difference (Game — 
Concert) in population proportions is (—0.154, 0.138) 


305 


1526 0-200 = 20.0%(—0.154, 0.138) Which of the following gives the 


correct outcome of the researcher’s test of the claim? 


a. Because the confidence interval includes 0, the researcher can 
conclude that the proportion of iPod owners at the two venues is 
the same. 

305 

b. Because the center of the interval is —0.0081526 — 

0.008 the researcher can conclude that a higher proportion of 
people at the rock concert own iPods than at the baseball game. 


0.200 = 20.0% 


c. Because the confidence interval includes 0, the researcher 
cannot conclude that the proportion of iPod owners at the two 
venues is different. 


d. Because the confidence interval includes more negative than 
positive values, the researcher can conclude that a higher 
proportion of people at the rock concert own iPods than at the 
baseball game. 


e. The researcher cannot draw a conclusion about a claim without 
performing a significance test. 


T10.8 An SRS of size 100 is taken from Population A with proportion 0.8 
of successes. An independent SRS of size 400 is taken from 
Population B with proportion 0.5 of successes. The sampling 
distribution of the difference (A — B) in sample proportions has 


what mean and standard deviation? 
305 


—— = 01.200 = 2 
a. mean=0.3; 1526 


”-0*mean = 0.3; standard deviation =1.3 
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305 
1526 


— (1.200 = 20.0% : 
0.200 = 20.0%_ 4 


305 
—" ~ 0.200 = 20.0% = a 
b. mean=0.3;1526 mean = 0.3; standard deviation 
305 
—— = ().2 = 20.0% 
=0.401526 oa = 0.40 
ca >= 0,200 = 20.0% 
c. mean= 0. Sy” lea “mean = 0.3; standard deviation 
ee 
9) = 
=0.0471526 "7" = 0.047 
ca >= 0.200 = 20.0% ‘ 
= — aU. 0 a — . . ° 
d. mean=0. 3; 1526 mean = 0.3; standard deviation 
= (0.2 = 20. 0% _ 
=() oo22i8m6 sa 0.0022 
ca = 0.200 = 20.0% 
e. mean=0. ah 1526 ‘mean = 0.3; standard deviation 
.200 = 20.0% 
=0. oo02isa si = 0.0002 


T10.9 Are TV commercials louder than their surrounding programs? To 
find out, researchers collected data on 50 randomly selected 
commercials in a given week. With the television’s volume at a 
fixed setting, they measured the maximum loudness of each 
commercial and the maximum loudness in the first 30 seconds of 
regular programming that followed. Assuming conditions for 
inference are met, the most appropriate method for answering the 
question of interest is 


a. atwo-sample t test for a difference in means. 

b. atwo-sample t interval for a difference in means. 
c. a paired ¢t test for a mean difference. 

d. a paired t interval for a mean difference. 

e. atwo-sample z test for a difference in proportions. 


T10.10 Researchers want to evaluate the effect of a natural product on 
reducing blood pressure. They plan to carry out a randomized 
experiment to compare the mean reduction in blood pressure of a 
treatment (natural product) group and a placebo group. Then they 


305 , 
will use the data to perform a test of HO: pT- ai =e ee 
305 os 


305 


H,: pr-pup> 0 where uT 1526 = 0.200 = 2 


20.0 %rpa 
T= the true mean 
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reduction in blood pressure when taking the natural product and pP 
305 
1526 


= 0.200 = 20.0%)» ae 
P= the true mean reduction in blood pressure 


when taking a placebo for subjects like the ones in the experiment. 
The researchers would like to detect whether the natural product 
reduces blood pressure by at least 7 points more, on average, than 
the placebo. If groups of a 50 are used in the experiment, a two- 


305 
sample t test using a=0. 01526 = 0.200 = 0.0% = 0.01 will have a 


power of 80% to detect a 7-point difference in mean blood pressure 
reduction. If the researchers want to be able to detect a 5-point 
difference instead, then the power of the test 


a. would be less than 80%. 
b. would be greater than 80%. 
c. would still be 80%. 


Pu 


could be either less than or greater than 80%. 


e. would vary depending on the standard deviation of the data. 


Section IJ: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


T10.11 Researchers wondered whether maintaining a patient’s body 
temperature close to normal by heating the patient during surgery 
would affect rates of infection of wounds. Patients were assigned at 


random to two groups: the normothermic group (core temperatures 
305 


were maintained at near normal, 36.5°C1526 mR se 5 C, 
using heating blankets) and the hypothermic group (core 
bw Perales were allowed to decrease to about 34.5°C 


1 eld meat C). If keeping patients warm during surgery 


alters the chance of infection, patients in the two groups should 
show a difference in the average length of their hospital stays. Here 
are summary Statistics on hospital stay (in number of days) for the 
two groups: 
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Group n _ = 0.200 = 20.0%— = 0.200 = 20.0% 
x 1526 r Sx 1526 x 

Normothermic 104 12.1 4.4 

Hypothermic 96 14.7 6.5 


a. Construct and interpret a 95% confidence interval for the 
difference in the true mean length of hospital stay for 
normothermic and hypothermic patients like these. 


b. Does your interval in part (a) suggest that keeping patients 
warm during surgery affects the average length of patients’ 
hospital stays? Justify your answer. 


c. Interpret the meaning of “95% confidence” in the context of this 
study. 


T10.12 A random sample of 100 of last year’s model of a certain popular 
car found that 20 had a specific minor defect in the brakes. The 
automaker adjusted the production process to try to reduce the 
proportion of cars with the brake problem. A random sample of 
350 of this year’s model found that 50 had the minor brake defect. 


a. Was the company’s adjustment successful? Carry out an 
appropriate test to support your answer. 


b. Based on your conclusion in part (a), which mistake—a Type I 
error or a Type II error—could have been made? Describe a 
possible consequence of this error. 


T10.13 “I can’t get through my day without coffee” is a common 
statement from many college students. They assume that the 
benefits of coffee include staying awake during lectures and 
remaining more alert during exams and tests. Students in a 
Statistics class designed an experiment to measure memory 
retention with and without drinking a cup of coffee 1 hour before a 
test. This experiment took place on two different days in the same 
week (Monday and Wednesday). Ten students were used. Each 
student received no coffee or one cup of coffee 1 hour before the 
test on a particular day. The test consisted of a series of words 
flashed on a screen, after which the student had to write down as 
many of the words as possible. On the other day, each student 
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received a different amount of coffee (none or one cup). 


a. One of the researchers suggested that all the subjects in the 
experiment drink no coffee before Monday’s test and one cup of 
coffee before Wednesday’s test. Explain to the researcher why 
this is a bad idea and suggest a better method of deciding when 
each subject receives the two treatments. 


b. The researchers actually used the better method of deciding 
when each subject receives the two treatments that you 
identified in part (a). For each subject, the number of words 
recalled when drinking no coffee and when drinking one cup of 
coffee is recorded in the table. Carry out an appropriate test to 
determine whether there is convincing evidence that drinking 
coffee improves memory, on average, for students like the ones 
in this study. 


Student No cup One cup 
1 24 25 
2 30 31 
3 22 23 
4 24 24 
fs) 26 27 
6 23 25 
7 26 28 
8 20 20 
9 2/7 ZF 
10 28 30 
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Chapter 10 Project Which Costs More: 
Diesel or Unleaded? 


When buying or leasing a new car, one of the factors that customers 
consider is the type of fuel it uses. Some people prefer vehicles that use 
diesel fuel, while others favor vehicles that use regular unleaded gasoline. 
Which of these two types of fuel costs more at the pump, on average? 
Does the answer to this question depend on the state where you live? 


Researchers collected data on the price per gallon (in dollars) of diesel 
fuel and regular unleaded gasoline from a random sample of gas stations in 
6 states: Colorado, Illinois, Indiana, Kansas, Missouri, and Ohio. The file 
gas prices ch 10 project.xls, which can be accessed from the book’s 
website at highschool.bfwpub.com/tps6e, contains data from a total of 82 
gas stations. Download the file to a computer for further analysis using the 
application specified by your teacher. Use the file provided to answer the 
following questions. 


. Start by calculating the difference (Diesel — Unleaded) in gas prices for 
all 82 stations, and store these values in a new column titled Difference. 


’. Make a graph to display the distribution of difference in gas prices. 
Describe the shape, center, and variability of the distribution. Are there 
any outliers? 


Construct and interpret a 95% confidence interval for the true mean 
difference. Does the interval provide convincing evidence of a 
difference in the mean price per gallon of diesel fuel and regular 
unleaded gasoline? 


—_ 
. 


|. Make a graph that compares the difference (Diesel — Unleaded) in gas 
prices for the 6 states from which the data were collected. Write a few 
sentences comparing the distributions. 


». We might expect the mean difference (Diesel — Unleaded) in gas prices 
to be the same for adjacent states. Choose 2 adjacent states from the data 
set. Then carry out an appropriate test to see if the data provide 
convincing evidence to contradict this expectation. 
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Cumulative AP® Practice Test 3 


Section I: Multiple Choice Choose the best answer. 


AP3.1 Suppose the probability that a softball player gets a hit in any 
single at-bat is 0.300. Assuming that her chance of getting a hit on 
a particular time at bat is independent of her other times at bat, 
what is the probability that she will not get a hit until her fourth 
time at bat in a game? 


4 , : 

a. (43)(0.3)1 (0. 7)31536 = 0.200 = 20.0% les (0.3) (0.7) 
: 4 3 j 

al ° 0.3)° (0.7 

— cenananyateim-nn(’) eae 
4 , | 

c. (41)(0.3)3 . 7s = 0.200 = 20.0% te 1 ) (0.3)" (0.7) 


d. (0.3)3(0. mis a" 0.200 = 20.0% (0.3)°(0.7)' 


e. (0.3)1(0.7)31525> 0.200 = 20.0% (0.3)'(0.7)° 


AP3.2 A survey asked a random sample of U.S. adults about their political 
party affiliation and how long they thought they would survive 
compared to most people in their community if an apocalyptic 
disaster were to strike. The responses are summarized in the 
following two-way table. 


Political affiliation 


Total 
Longer 314 
Expected About as 416 
survival long 
length Not as long 115 
Not sure 153 
Total 360 404 234 998 
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Suppose we select one of the survey respondents at random. Which 
of the following probabilities is the largest? 


a. P(Independent and Longer) 

b. P(Independent or Not as long) 
as 

c. P(Democrat |1526— 


= 0.200 = 20. 


0.200 = 


d. P(About as long sa 
e. P(About as long) 


AP3.3 Sports Illustrated planned to ask a random sample of Division I 
college athletes, “Do you believe performance-enhancing drugs are 
a problem in college sports?” Which of the following is the 
smallest number of athletes that must be interviewed to estimate 
the true proportion who believe performance-enhancing drugs are a 


problem within £2961506 i638 0.200 = 20.0% +2% with 90% confidence? 
a. 17 

b. 21 

c. 1680 

d. 1702 

e. 2401 


AP3.4 The distribution of grade point averages (GPAs) for a certain 
college is approximately Normal with a mean of 2.5 and a standard 
deviation of 0.6. The minimum possible GPA is 0.0 and the 
maximum possible GPA is 4.33. Any student with a GPA less than 
1.0 is put on probation, while any student with a GPA of 3.5 or 
higher is on the dean’s list. About what percent of students at the 
college are on probation or on the dean’s list? 


a. 0.6 
b. 4.7 
c. 5.4 
d. 94.6 
e. 95.3 
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AP3.5 Which of the following will increase the power of a significance 
test? 


a. Increase the Type II error probability. 
b. Decrease the sample size. 


c. Reject the null hypothesis only if the P-value is less than the 
significance level. 
305 


. “¢ —— = 0.200 = 20.0% 
d. Increase the significance level 01526 Qa, 


e. Select a value for the alternative hypothesis closer to the value 
of the null hypothesis. 


AP3.6 You can find some interesting polls online. Anyone can become 
part of the sample just by clicking on a response. One such poll 
asked, “Do you prefer watching first-run movies at a movie theater, 
or waiting until they are available to watch at home or on a digital 
device?” In all, 8896 people responded, with only 12% (1118 
people) saying they preferred theaters. You can conclude that 


a. American adults strongly prefer watching movies at home or on 
their digital devices. 


b. the high nonresponse rate prevents us from drawing a 
conclusion. 


c. the sample is too small to draw any conclusion. 


d. the poll uses voluntary response, so the results tell us little about 
all American adults. 


e. American adults strongly prefer seeing movies at a movie 
theater. 


AP3.7 A certain candy has different wrappers for various holidays. 
During Holiday 1, the candy wrappers are 30% silver, 30% red, 
and 40% pink. During Holiday 2, the wrappers are 50% silver and 
50% blue. In separate random samples of 40 candies on Holiday 1 
and 40 candies on Holiday 2, what are the mean and standard 
deviation of the total number of silver wrappers? 


a. 32, 18.4 
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b. 32, 6.06 
c. 32, 4.29 
d. 80, 18.4 
e. 80, 4.29 


AP3.8 A beef rancher randomly sampled 42 cattle from her large herd to 
obtain a 95% confidence interval for the mean weight (in pounds) 
of the cattle in the herd. The interval obtained was (1010, 1321). If 
the rancher had used a 98% confidence interval instead, the interval 
would have been 


a. wider with less precision than the original estimate. 

b. wider with more precision than the original estimate. 

c. wider with the same precision as the original estimate. 
d. narrower with less precision than the original estimate. 
e. narrower with more precision than the original estimate. 


AP3.9 School A has 400 students and School B has 2700 students. A local 
newspaper wants to compare the distributions of SAT scores for 
the two schools. Which of the following would be the most useful 
for making this comparison? 


a. Back-to-back stemplots for A and B 
b. A scatterplot of A versus B 


c. Two dotplots for A and B drawn on the same scale 


A. 


Two relative frequency histograms of A and B drawn on the 
same scale 


e. Two bar graphs for A and B drawn on the same scale 


AP3.10 Let X represent the outcome when a fair six-sided die is rolled. 
305, crave . 
For this random variable, pX=3.51526 — a ae aL OD 
305 usr 
oX=1.711526 9°" ~ *°’ox = 1.71, Tf the die is rolled 100 times, 


what is the approximate probability that the sum is at least 375? 
a. 0.0000 
b. 0.0017 


and 
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c. 0.0721 
d. 0.4420 
e. 0.9279 


AP3.11 An agricultural station is testing the yields for six different 
varieties of seed corn. The station has four large fields available, 
located in four distinctly different parts of the county. The 
agricultural researchers consider the climatic and soil conditions in 
the four parts of the county as being quite different, but are 
reasonably confident that the conditions within each field are fairly 
similar throughout. The researchers divide each field into six 
sections and then randomly assign one variety of corn seed to each 
section in that field. This procedure is done for each field. At the 
end of the growing season, the corn will be harvested, and the yield 
(measured in tons per acre) will be compared. Which one of the 
following statements about the design is correct? 


a. This is an observational study because the researchers are 
watching the com grow. 


b. This a randomized block design with fields as blocks and seed 
types as treatments. 


c. This is a randomized block design with seed types as blocks and 
fields as treatments. 


d. This is a completely randomized design because the six seed 
types were randomly assigned to the four fields. 


e. This is a completely randomized design with 24 treatments—6 
seed types and 4 fields. 


AP3.12 The correlation between the heights of fathers and the heights of 
their grownup sons, both measured in inches, is r=0.52 


305 
——. = 0.200 = 20.0% , 
1526 r = 0.52, If fathers’ heights were measured in feet 


instead, the correlation between heights of fathers and heights of 
sons would be 


a. much smaller than 0.52. 


b. slightly smaller than 0.52. 
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C. 
d. 


e. 


unchanged; equal to 0.52. 
slightly larger than 0.52. 
much larger than 0.52. 


AP3.13 A random sample of 200 New York State voters included 88 
Republicans, while a random sample of 300 California voters 
produced 141 Republicans. Which of the following represents the 
95% confidence interval for the true difference in the proportion of 
Republicans in New York State and California? 


a. 


(0.44-0.47)+1.96((0.44)(0.56)+(0.47)(0.53)200+300) 


. (0.44)(0.56)+(0.47)(0.53) ) 
305 ‘ ¢ or ; — 5 Oli it TS et kh RES 
== 0.200 = 20.0 (0.44 — 0.47) + 1.96 ( Se 


(0.44-0.47)+1.96((0.44)(0.56)200+(0.47)(0.53)300) 


= (0.44)(0.56)  (0.47)(0.53) 
30! - = Late linia! OA Di oa | a 
305. _ 9.209 = 20.094(0-44 — 0.47) + 1.96 ( + oe ) 


(0.44-0.47)+1.96(0.44)(0.56)200+(0.47)(0.53)300 

505 _ 9,200 = 20.09%(0.44 0.47) $1.96, / +} 
(0.44-0.47)+1.96(0.44)(0.56)+(0.47)(0.53)200+300 

305 _ 9.209 = 20.09(0.44 — 0.47) + 1.96) 


(0.44—0.47)+1.96(0.45)(0.55)200+(0.45)(0.55)300 
305 7 [WABVO-5) _— (0.45)(0-55) 
a5g = 0-200 = 20.0%(0.44 — 0.47) + 1.96  — oo 


AP3.14 Which of the following is not a property of a binomial setting? 


a. 
b. 


Cc. 


d. 


e. 


Outcomes of different trials are independent. 

The chance process consists of a fixed number of trials, n. 
The probability of success is the same for each trial. 
Trials are repeated until a success occurs. 


Each trial can result in either a success or a failure. 


AP3.15 Mrs. Woods and Mrs. Bryan are avid vegetable gardeners. They 
use different fertilizers, and each claims that hers is the best 
fertilizer to use when growing tomatoes. Both agree to do a study 
using the weight of their tomatoes as the response variable. Each 
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planted the same varieties of tomatoes on the same day and 
fertilized the plants on the same schedule throughout the growing 
season. At harvest time, each randomly selects 15 tomatoes from 
her garden and weighs them. After performing a two-sample t test 
on the difference in mean weights of tomatoes, they get t=5.24 

ad = 0.200 = 20.0% _ 5 94 ang P=0.0008;538~ 0.200 = 20.0% p _ 9 nang 
. Can the gardener with the larger mean claim that her fertilizer 
caused her tomatoes to be heavier? 


a. Yes, because a different fertilizer was used on each garden. 
b. Yes, because random samples were taken from each garden. 


c. Yes, because the P-value is so small. 


jar 


. No, because the condition of the soil in the two gardens is a 
potential confounding variable. 
305 


—— = 0.200 = 20.0%1r _ « 
e. No, because 15<301526 15 < 30. 


AP3.16 The Environmental Protection Agency (EPA) is charged with 
monitoring industrial emissions that pollute the atmosphere and 
water. So long as emission levels stay within specified guidelines, 
the EPA does not take action against the polluter. If the polluter 
violates regulations, the offender can be fined, forced to clean up 
the problem, or possibly closed. Suppose that for a particular 
industry the acceptable emission level has been set at no more than 
5 parts per million (5 ppm). The null and alternative hypotheses are 


5 305 
——= 0.200 = 20.0% “= ——= 0.200 = 20.0% 
HO: p=51526 Ho: = 5 versus Ha: H>5 1526 


Hq: > 5 Which of the following describes a Type II error? 
a. The EPA fails to find convincing evidence that emissions 


exceed acceptable limits when, in fact, they are within 
acceptable limits. 


b. The EPA finds convincing evidence that emissions exceed 
acceptable limits when, in fact, they are within acceptable 
limits. 


c. The EPA fails to find convincing evidence that emissions 
exceed acceptable limits when, in fact, they do exceed 
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acceptable limits. 


The EPA finds convincing evidence that emissions exceed 
acceptable limits when, in fact, they do exceed acceptable 
limits. 

The EPA takes more samples to ensure that they make the 
correct decision. 


AP3.17 Which of the following statements is false? 


a. 


A measure of center alone does not completely summarize a 
distribution of quantitative data. 


If the original measurements are in inches, converting them to 
centimeters will not change the mean or standard deviation. 


One of the disadvantages of a histogram is that it doesn’t show 
each data value. 


. Ina quantitative data set, adding a new data value equal to the 


mean will decrease the standard deviation. 


If a distribution of quantitative data is strongly skewed, the 
median and interquartile range should be reported rather than 
the mean and standard deviation. 


AP3.18 A 96% confidence interval for the proportion of the labor force 
that is unemployed in a certain city is (0.07, 0.10). Which of the 
following statements is true? 


a. 


The probability is 0.96 that between 7% and 10% of the labor 
force is unemployed. 


. About 96% of the intervals constructed by this method will 


contain the true proportion of the labor force that is unemployed 
in the city. 


In repeated samples of the same size, there is a 96% chance that 
the sample proportion will fall between 0.07 and 0.10. 


. The true rate of unemployment in the labor force lies within this 


interval 96% of the time. 


Between 7% and 10% of the labor force is unemployed 96% of 
the time. 
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AP3.19 A large toy company introduces many new toys to its product line 
each year. The company wants to predict the demand as measured 
by y, first-year sales (in millions of dollars) using x, awareness of 
the product (as measured by the percent of customers who had 
heard of the product by the end of the second month after its 
introduction). A random sample of 65 new products was taken, and 
a correlation of 0.96 was computed. Which of the following is 
true? 


a. The least-squares regression line accurately predicts first-year 
sales 96% of the time. 


b. About 92% of the time, the percent of people who have heard of 
the product by the end of the second month will correctly 
predict first-year sales. 


c. About 92% of first-year sales can be accounted for by the 
percent of people who have heard of the product by the end of 
the second month. 


d. For each increase of 1% in awareness of the new product, the 
predicted sales will go up by 0.96 million dollars. 


e. About 92% of the variation in first-year sales can be accounted 
for by the least-squares regression line with the percent of 
people who have heard of the product by the end of the second 
month as the explanatory variable. 


AP3.20 Final grades for a class are approximately Normally distributed 
with a mean of 76 and a standard deviation of 8. A professor says 
that the top 10% of the class will receive an A, the next 20% a B, 
the next 40% a C, the next 20% a D, and the bottom 10% an F. 
What is the approximate maximum grade a student could attain and 
still receive an F for the course? 


a. 70 

b. 69.27 
c. 65.75 
62.84 


A 
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AP3.21 National Park rangers keep data on the bears that inhabit their 
park. Here is a histogram of the weights of 143 bears measured in a 
recent year: 


35 4 


30 4 
25 
20 4 


15 5 


Frequency 


10 4 


54 


40 80 120 160 200 240 280 320 360 400 440 480 520 
Weight (Ib) 


Which of the following statements is correct? 


a. The median will lie in the interval (140, 180), and the mean will 
lie in the interval (180, 220). 


b. The median will lie in the interval (140, 180), and the mean will 
lie in the interval (260, 300). 


c. The median will lie in the interval (100, 140), and the mean will 
lie in the interval (180, 220). 


d. The mean will lie in the interval (140, 180), and the median will 
lie in the interval (260, 300). 


e. The mean will lie in the interval (100, 140), and the median will 
lie in the interval (180, 220). 


AP3.22 A random sample of size n will be selected from a population, and 


305 
= 0.200 = 20.0% ¢ 
the proportion p/1526 P of those in the sample who 


have a Facebook page will be calculated. How would the margin of 
error for a 95% confidence interval be affected if the sample size 
were increased from 50 to 200 and the sample proportion of people 
who have a Facebook page is unchanged? 


a. It remains the same. 
b. It is multiplied by 2. 
c. It is multiplied by 4. 
d. It is divided by 2. 
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e. It is divided by 4. 


AP3.23 A scatterplot and a least-squares regression line are shown in the 
figure. What effect does point P have on the slope of the regression 
line and the correlation? 


Point P increases the slope and increases the correlation. 


oT p 


. Point P increases the slope and decreases the correlation. 


c. Point P decreases the slope and decreases the correlation. 


jar 


. Point P decreases the slope and increases the correlation. 


e. No conclusion can be drawn because the other coordinates are 
unknown. 


AP3.24 The following dotplots show the average high temperatures (in 
degrees Celsius) for a sample of tourist cities from around the 
world. Both the January and July average high temperatures are 
shown. What is one statement that can be made with certainty from 
an analysis of the graphical display? 


eee se e se ee eee ee 
January om se wess oe @ S00 Sesee ose 


Month 


July 


20 2 24 26 28 30 
Average high temperature (°C) 


a. Every city has a larger average high temperature in July than in 
January. 


b. The distribution of temperatures in July is skewed right, while 
the distribution of temperatures in January is skewed left. 


c. The median average high temperature for January is higher than 
the median average high temperature for July. 


d. There appear to be outliers in the average high temperatures for 


1550 


January and July. 


e. There is more variability in average high temperatures in 
January than in July. 


AP3.25 Suppose the null and alternative hypotheses for a significance test 
are defined as 


HO: p= =4015007 PaO AG: pe 40 


Ha: «240ume = = 0.200 = 20.0%H,: pp < 40 
305 _ 0,200 = 20.0! 
Which of the following specific values for Hai526 — +H 
will give the highest power? 
305 _ - 


38° — 0.200 = 20.09 
b. p=391526 ‘pm = 39 
305 
Cc. p=411526 
30% — 0,200 = 20.0%, — 42 
d. p=421526 i a 
305 
e, =431526 — 


= 0.200 = 20.0%), — 41 


0.200 = 20.0%), — 43 


AP3.26 A large university is considering the establishment of a 
schoolwide recycling program. To gauge interest in the program by 
means of a questionnaire, the university takes separate random 
samples of undergraduate students, graduate students, faculty, and 
staff. This is an example of what type of sampling design? 


a. Simple random sample 

b. Stratified random sample 
c. Convenience sample 

d. Cluster sample 

e. Randomized block design 


AP3.27 Suppose the true proportion of people who use public 
transportation to get to work in the Washington, D.C., area is 0.45. 
In a simple random sample of 250 people who work in 
Washington, about how far do you expect the sample proportion to 
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be from the true proportion? 
a. 0.4975 

b. 0.2475 

0.0315 

0.0009 

e. 0 


ie) 


Pp 


Questions 28 and 29 refer to the following setting. According to sleep 
researchers, if you are between the ages of 12 and 18 years old, you need 9 
hours of sleep to function well. A simple random sample of 28 students 
was chosen from a large high school, and these students were asked how 
much sleep they got the previous night. The mean of the responses was 7.9 
hours with a standard deviation of 2.1 hours. 


AP3.28 If we are interested in whether students at this high school are 
getting too little sleep, which of the following represents the 
appropriate null and alternative hypotheses? 


305 
a. HO: p=7.91526 © 0.200 = 20.0% Ho: = 7.9 and Ha: U<7.9 


305 
5g = 0.200 = oad < 7.9 


= 0. 


b. HO: p= 7.91505 526 0200 = 20.0% Ho: = 7.9 and Ha: U#7.9 
305 


? — J A . 
apg = 0-200 = 20.0% H, =p # 7.9 


305 305 
—= 0.200 = 20.0% ._—— ——~ = 0.200 = 20.0% 
c. HO: p=91526 ‘Ho: = 9 and Ha: [1491526 : 
Hai #9 
ae 0.200 = 20.0 % H, e = 0.200 = 90 0% 
d. HO: p=91526 Ho: = 9 and Ha: «coum i 0; 
Ayiy<9 


305 
= 0.200 = 20.0! = 0.200 = 20.09 
e. HO: <n 526 “Ho: < 9 and Ha: >91526 : 


Hy: > 9 


AP3.29 Which of the following is the standardized test statistic for the 
hypothesis test? 


a. t=7.9-92.12815907 0200= 200% ag 
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305 _ 0.200 = 20 -, 

b. t=9-7.92.1281526 ~~“ VB 
+= 19-9 
309 _ 0,200 = 20.0% /32 

c. t=7.9-92.128156 “" ~~” «Vv 
7.9-9 

305 _ 0.200 = 20 m — 

d. t=7.9-92.1271526 ~“"  ~ v7 
9-7.9 

ee a, i 

e, t=9-7.92.1271526 ~~“ vir 


AP3.30 Shortly before the 2012 presidential election, a survey was taken 


by the school newspaper at a very large state university. Randomly 
selected students were asked, “Whom do you plan to vote for in the 
upcoming presidential election?” Here is a two-way table of the 
responses by political persuasion for 1850 students: 


Political persuasion 


Democrat | Republican |Independent| Total 


Obama 1029 
Candidate Romney 695 
of choice Other a1 
Undecided 105 
Total 1850 


Which of the following statements about these data is true? 
a. The percent of Republicans among the respondents is 41%. 


b. The marginal relative frequencies for the variable choice of 
candidate are given by Obama: 55.6%; Romney: 37.6%; Other: 
1.1%; Undecided: 5.7%. 


c. About 11.2% of Democrats reported that they planned to vote 
for Romney. 


d. About 44.6% of those who are undecided are Independents. 


e. The distribution of political persuasion among those for whom 
Romney is the candidate of choice is Democrat: 7.5%; 
Republican: 84.0%; Independent: 18.8%. 
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Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


AP3.31 A researcher wants to determine whether or not a 5-week crash 
diet is effective over a long period of time. A random sample of 15 
five-week crash dieters is selected. Each person’s weight (in 
pounds) is recorded before starting the diet and 1 year after it is 
concluded. Do the data provide convincing evidence that 5-week 
crash dieters weigh less, on average, 1 year after finishing the diet? 


Dieter 1 2 3 4 5 6 7 8 
Before 158 185 176 172 164 234 258 200 
After 163 182 188 150 161 220 235 191 
Dieter 9 10 11 12 13 14 15 
Before 228 246 198 221 236 255 231 
After 228 237 209 220 222 268 234 


AP3.32 Starting in the 1970s, medical technology has enabled babies with 
very low birth weight (VLBW, less than 1500 grams, or about 3.3 
pounds) to survive without major handicaps. It was noticed that 
these children nonetheless had difficulties in school and as adults. 
A long-term study has followed 242 randomly selected VLBW 
babies to age 20 years, along with a control group of 233 randomly 
selected babies from the same population who had normal birth 
weight.22 


a. Is this an experiment or an observational study? Why? 


b. At age 20, 179 of the VLBW group and 193 of the control 
group had graduated from high school. Do these data provide 
ee 
convincing evidence at the a=0.051526 ~ ae ey = O06 
significance level that the graduation rate among VLBW babies 
is less than for normal-birth-weight babies? 


AP3.33 A nuclear power plant releases water into a nearby lake every 
afternoon at 4:51 p.m. Environmental researchers are concerned 
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that fish are being driven away from the area around the plant. 
They believe that the temperature of the water discharged may be a 
factor. The scatterplot shows the temperature of the water (in 
degrees Celsius) released by the plant and the measured distance 
(in meters) from the outflow pipe of the plant to the nearest fish 
found in the water on eight randomly chosen afternoons. 


160 + 


Distance to nearest fish (m) 
zB 
l 


T Ul T T is 
15 20 25 30 35 40 


Temperature (°C) 


Here are computer output from a least-squares regression analysis 
on these data and a residual plot: 


Predictor Coef SE Coef T Pp 
Constant ~7 3.64 15.48 —4.76 0.003 
Temperature 5.7188 0.5612 -10.19 0.000 
S = 11.4175 R - Sq = 94.5% R - Sq(adj) = 93.6% 
20 7 Fe 
16 = 
3 0 = o 
f es 
-10 - ‘ 


—20 -4 T T T T T T T 
0 20 40 60 80 100 120 140 160 
Fitted value 
a. Explain why a linear model is appropriate for describing the 
relationship between temperature and distance to the nearest 
fish. 
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b. Write the equation of the least-squares regression line. Define 
any variables you use. 


c. Interpret the slope of the regression line. 


d. Compute the residual for the point (29, 78). Interpret this 
residual. 


AP3.34 The Candy Shoppe assembles gift boxes that contain 8 chocolate 
truffles and 2 handmade caramel nougats. The truffles have a mean 
weight of 2 ounces with a standard deviation of 0.5 ounce, and the 
nougats have a mean weight of 4 ounces with a standard deviation 
of 1 ounce. The empty boxes have mean weight 3 ounces with a 
standard deviation of 0.2 ounce. 


a. Assuming that the weights of the truffles, nougats, and boxes 
are independent, what are the mean and standard deviation of 
the weight of a box of candy? 


b. Assuming that the weights of the truffles, nougats, and boxes 
are approximately Normally distributed, what is the probability 
that a randomly selected box of candy will weigh more than 30 
ounces? 


c. If five gift boxes are randomly selected, what is the probability 
that at least one of them will weigh more than 30 ounces? 


d. If five gift boxes are randomly selected, what is the probability 
that the mean weight of the five boxes will be more than 30 
ounces? 


AP3.35 An investor is comparing two stocks, A and B. She wants to know 
if over the long run, there is a significant difference in the return on 
investment as measured by the percent increase or decrease in the 
price of the stock from its date of purchase. The investor takes a 
random sample of 50 annualized daily returns over the past 5 years 
for each stock. The data are summarized in the table. 


Stock Mean return Standard deviation 
A 11.8% 12.9% 


B 7.1% 9.6% 
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a. The investor uses the data to perform a two-sample t test of 


305 
——-= 0.200 = 20.0% F], - a 
HO: pA-pB=01526 Hy: Ha — HB = 9 versus 


305 
——— = 9 — oF é : 
Ha: pA-pB401526 0.200 = 20.0%H, : HA — HB FO where HA 
305 
305 _ 9.200 = 20.0% 14 — ' 
1526 A = the true mean annualized daily return for 
305 
—— = 0.200 = 20.0% 47 ! 
Stock A and pB1526 /B = the true mean annualized 


daily return for Stock B. The resulting P-value is 0.042. 
Interpret this value in context. What conclusion would you 
make? 


b. The investor believes that although the return on investment for 
Stock A usually exceeds that of Stock B, Stock A represents a 
riskier investment, where the risk is measured by the price 


305 

volatility of the stock. The sample variance ia =i See 
St is a statistical measure of the price volatility and indicates 
how much an investment’s actual performance during a 
specified period varies from its average performance over a 
longer period. Do the price fluctuations in Stock A significantly 
exceed those of Stock B, as measured by their variances? State 
an appropriate set of hypotheses that the investor is interested in 


testing. 


c. To measure this, we will construct a test statistic defined as 


F=larger sample variancesmaller sample variance 
larger sample variance 

305 on ek =e 

1526 7 0200 = 20.0% smaller sample variance 


Calculate the value of the F statistic using the information given 
in the table. Explain how the value of the statistic provides 
some evidence for the alternative hypothesis you stated in part 
(b). 

d. Two hundred simulated values of this test statistic, F, were 
calculated assuming that the two stocks have the same variance 
in daily price. The results of the simulation are displayed in the 
following dotplot. 
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Simulated value of F statistic 


Use these simulated values and the test statistic that you 
calculated in part (c) to determine whether the observed data 
provide convincing evidence that Stock A is a riskier 
investment than Stock B. Explain your reasoning. 
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Inference for 


Distributions of Categorical Data 


Andrey Gudkov/Alamy 


1559 


Chapter 11 Ap® Statistics Practice Test 


1560 


INTRODUCTION 


In Chapter 9, we discussed tests for the proportion of successes in a single 
population. These tests were based on a single categorical variable with 
values that were divided into two categories: success and failure. 
Sometimes we want to perform a test for the distribution of a categorical 
variable with two or more categories. The chi-square test for goodness of 
fit allows us to determine whether a hypothesized distribution seems valid. 
This test is useful in a field like genetics, where the laws of probability 
give the expected proportion of outcomes in each category. It will also 
help us decide if the birthdays of NHL players are uniformly distributed 
throughout the year. 


In Chapter 10, we discussed tests for a difference in proportions for 
two populations or treatments. Sometimes we’d like to compare the 
distribution of a categorical variable for two or more populations or 
treatments, where the variable can have two or more categories. We can 
decide whether the distribution of a categorical variable differs for two or 
more populations or treatments using a chi-square test for homogeneity. 
This test will help us answer the question: Does background music 
influence customer purchases? 


Tests for homogeneity use data summarized in a two-way table. It is 
also possible to use the information in a two-way table to study the 
relationship between two categorical variables. The chi-square test for 
independence allows us to determine if there is convincing evidence of an 
association between two variables in a population, such as anger level and 
heart disease status. 


Here’s an activity that gives you a taste (pun intended) of what lies 
ahead. 


ACTIVITY The candy man can 
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Ramon Rivera Moret 


Mars, Inc., is famous for its milk chocolate candies. Here’s what the 
company’s Consumer Affairs Department says about the distribution of 
color for M&M’S® Milk Chocolate Candies produced at its 
Hackettstown, New Jersey, factory: 


Brown: Red: 12.5% Yellow: 
12.5% 12.5% 
Green: Orange: Blue: 25% 
12.5% 25% 


The purpose of this activity is to investigate if the distribution of color 
in a large bag of M&M’S Milk Chocolate Candies differs from the 
claimed distribution. 


1. Your teacher will take a random sample of 60 M&M’S Milk 
Chocolate Candies from a large bag and give one or more pieces of 
candy to each student. As a class, count the number of candies of 
each color. Make a table on the board that summarizes these observed 
counts. 


2. How can you tell if the sample data give convincing evidence to 
dispute the company’s claim? Each team of 3 or 4 students should 
discuss this question and devise a formula for a test statistic that 
measures the difference between the observed and expected color 
distributions. The test statistic should yield a single number when the 
observed and expected values are plugged in. Also, larger differences 
between the observed and expected distributions should result in a 
larger value for the statistic. 


3. Each team will share its proposed test statistic with the class. Your 


2F 
teacher will then reveal how the chi-square test statisticy2X is 
calculated. 
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4. Discuss as a class: If your sample is consistent with the company’s 
claimed distribution of M&M’S® Milk Chocolate Candies colors, 
will the value of 2X” be large or small? If your sample is not 
consistent with the company’s claimed color distribution, will the 
value of y2Xx” be large or small? 


5. Compute the value of the chi-square test statistic for the class’s data. 


We can use simulation to determine if your class’s chi-square test 
Statistic is large enough to provide convincing evidence that the 
distribution of colors in the large bag is different from the company’s 
claim. To conduct the simulation, 100 random samples of size 60 were 
selected from a population of M&M7’S Milk Chocolate Candies that 
matches the company’s claim. For each random sample, the value of the 


y2X” test statistic was calculated and plotted on the dotplot. 
6. There is one dot at y2=16X" = 16 Explain what this dot represents. 


7. Where does your class’s value of y2x” fall relative to the other dots 
on the dotplot? What conclusion can you make about the distribution 
of colors in the large bag? 


ee eeece eeee 
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Simulated 7? test statistic 


The dotplot at the end of the activity shows the values of the chi-square 
test statistic that are likely to occur by chance alone when sampling from 
the company’s claimed M&M’S Milk Chocolate Candies color 
distribution. You may have noticed that the shape of the distribution isn’t 
approximately Normal. Will it always look like this? You will learn more 
about the sampling distribution of the chi-square test statistic shortly. 
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SECTION 11.1 Chi-Square Tests for Goodness of Fit 


LEARNING TARGETS By the end of the section, you should be able to: 


e State appropriate hypotheses and compute the expected counts and 
chi-square test statistic for a chi-square test for goodness of fit. 


State and check the Random, 10%, and Large Counts conditions for 
performing a chi-square test for goodness of fit. 


Calculate the degrees of freedom and P-value for a chi-square test 
for goodness of fit. 


Perform a chi-square test for goodness of fit. 


Conduct a follow-up analysis when the results of a chi-square test 
are Statistically significant. 


Jerome’s class did the “Candy man can” activity using a bag from the 
Hackettstown factory. The one-way table summarizes the data from the 
class’s sample of M&M’S Milk Chocolate Candies. 


Color Brown Red Yellow Green Orange Blue Total 
Count 12 3 7 9 9 20 ~=60 


The sample proportion of brown candies is p(=1260=0.20 


a : 
P = Go — 9-20 Because the company claims that 12.5% of M&M’S® Milk 


Chocolate Candies are brown, Jerome might believe that something fishy 
is going on. He could use the one-sample z test for a proportion from 
Chapter 9 to test the hypotheses 

HO: p=0.125 Ho: p = 0.125 


where p is the true proportion of M&M’S Milk Chocolate Candies in the 
large bag that are brown. He could then perform additional significance 
tests for each of the remaining colors. 
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Note that the correct alternative hypothesis Hall. a is two-sided. A sample proportion 


of brown candies much higher or much lower than 0.125 would give Jerome reason 


to be suspicious about the company’s claim. It’s not appropriate to adjust Hall a 


after looking at the sample data! 


Besides being fairly inefficient, this method would also lead to the 
problem of multiple tests, which we discussed in Section 9.3. More 
important, this approach wouldn’t tell us how likely it is to get a random 
sample of 60 candies with a color distribution that differs as much from 
the one claimed by the company as the class’s sample does, taking all the 
colors into consideration at one time. For that, we use a new kind of 
significance test, called a chi-square test for goodness of fit. 


Stating Hypotheses 


As with any significance test, we begin by stating hypotheses. The null 
hypothesis in a chi-square test for goodness of fit should state a claim 
about the distribution of a single categorical variable in the population of 
interest. In the “Candy man can” activity, the categorical variable we’re 
measuring is color and the population of interest is all the M&M’S Milk 
Chocolate Candies in the large bag. The appropriate null hypothesis is: 


H0:/40 : The distribution of color in the large bag of M&M’S Milk 
Chocolate Candies is the same as the claimed distribution. 


The alternative hypothesis in a chi-square test for goodness of fit is that 
the categorical variable does not have the specified distribution. For the 
“Candy man can” activity, our alternative hypothesis is 


Ha:4o : The distribution of color in the large bag of M&M’S Milk 
Chocolate Candies is not the same as the claimed distribution. 


Although we usually write the hypotheses in words, we can also write 
them in symbols. For example, here are the hypotheses for the “Candy 
man can” activity: 


HO: pbrown=0.125, pred=0.125, pyellow=0.125, pgreen=0.125, 
Ho: Pbrown >= 0.125, Pred = 0.125, Pyellow = 0.125, Pereen — 0.125, 
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porange=0.25, pblue =().25 Porange = 0.25, Pblue = 0.25 


Ha:At least two of the pi’s are incorrect 
H, : At least two of the p;’s are incorrect 


where Pcolor = the true proportion of M&M’S Milk Chocolate Candies in 
the large bag of that color. 


Why don’t we write the alternative hypothesis as “Ha:/4e : At least one 
of the pi’sPi $ is incorrect” instead? If the stated proportion in one 
category is wrong, then the stated proportion in at least one other category 
must be wrong because the sum of the pi’s?i § must be 1. 


oOo Don’t state the alternative hypothesis in a way that suggests 


that all the proportions in the hypothesized distribution are wrong. 
For instance, it would be incorrect to write 


Ha: pbrown#0.125, pred#0.125, pyellow#0.125, 
pgreen#0.125 porange#40.25, pblue#0.25 
H,: Pbrown f 0.125, Drea a 0.125, Pyenow - 0.125, Pereen # 0.125 


Porange a 0.25, Phiue # 0.25 


Comparing Observed and Expected Counts: The Chi- 
Square Test Statistic 


The idea of the chi-square test for goodness of fit is this: we compare the 
observed counts from our sample with the counts that would be expected if 
HO is true. (Remember: we always assume that HO is true when 
performing a significance test.) The more the observed counts differ from 
the expected counts, the more evidence we have against the null 
hypothesis and for the alternative hypothesis. 

Recall that Jerome’s class collected data from a random sample of 
M&M’S® Milk Chocolate Candies. How many candies of each color 
should they expect to find in their sample of 60 candies? Assuming that 
the color distribution stated by Mars, Inc., is true, 12.5% of the candies are 
brown. For random samples of 60 candies, the average number of brown 
candies should be (60)(0.125)=7.5(60) (0.125) = 7.5 This is our expected 
count of bbown M&M’S Milk Chocolate Candies. Using this same 
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method, we find the expected counts for the other color categories: 


Red: (60)(0.125)=7.5Yellow: (60)(0.125)=7.5Green: (60) 
(0.125)=7.5 Orange: (60)(0.25)=7.5 Blue: (60)(0.25)=7.5 


Red : (60) (0.125) = 7.5 
Yellow : (60) (0.125) = 7.5 
Green : (60) (0.125) = 7.5 
Orange : (60) (0.25) = 7.5 
Blue: (60) (0.25) = 7.5 


atte 


Ramon Rivera Moret 


CALCULATING EXPECTED COUNTS IN A CHI- 
SQUARE TEST FOR GOODNESS OF FIT 


The expected count for category i in the distribution of a categorical 
variable is 


npi?Pi 


where pi?i is the relative frequency for category i specified by the null 
hypothesis. 


Did you notice that the expected count sounds a lot like the expected 
value of a random variable from Chapter 6? That’s no coincidence. The 
number of M&M’S Milk Chocolate Candies of a specific color in a 
random sample of 60 candies is a binomial random variable. Its expected 
value is np, the average number of candies of this color in many samples 
of 60 M&M’S Milk Chocolate Candies. The expected count is not likely to 
be a whole number and shouldn’t be rounded to a whole number. 


To see if the data give convincing evidence for the alternative 
hypothesis, we compare the observed counts from our sample with the 
expected counts. If the observed counts are far from the expected counts, 
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that’s the evidence we were seeking. The table gives the observed and 
expected counts for the sample of 60 candies from Jerome’s class. Figure 
11.1 shows these counts as a side-by-side bar graph. 


Color Observed Expected 
Brown 12 7.9 
Red 3 75 
Yellow 7 7a 
Green 9 70 
Orange a 15.0 
Blue 20 15.0 


| Observed 
| Expected 


Brown Red Yellow Green Orange Blue 
Color 


FIGURE 11.1 Bar graph comparing observed and expected counts for Jerome’s 
class sample of 60 M&M’S® Milk Chocolate Candies. 


We see some fairly large differences between the observed and 
expected counts in several color categories. How likely is it that 
differences this large or larger would occur just by chance in random 
samples of size 60 from the population distribution claimed by Mars, Inc.? 
To answer this question, we calculate a statistic that measures how far 
apart the observed and expected counts are. The statistic we use to make 


the comparison is the chi-square test statistic y2X” (The symbol y* is the 
lowercase Greek letter chi, pronounced “kye” like “rye.”) 


AP® EXAM TIP 


The formula for the chi-square test statistic is included on the formula 


sheet that is provided on the AP® Statistics exam. However, it doesn’t 
include the word count: 
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2 


2 (Observed—Expected 
os 2, Expected 


X2=); (Observed—Expected)2Expected* 


We included the word count to emphasize that you must use the 
observed and expected counts—not the observed and expected 
proportions—when calculating the chi-square test statistic. 


DEFINITION Chi-square test statistic 


The chi-square test statistic is a measure of how far the observed 
counts are from the expected counts. The formula for the statistic is 


x2=)y. (Observed count—Expected count)2Expected count 


2 2 (Observed count—Expected count) 
x = Expected count 


where the sum is over all possible values of the categorical variable. 


For Jerome’s data, we add six terms—one for each color category: 


y2=(12-7.5)27.5+(3-7.5)27.5+(7-7.5)27.5+(9-7.5)27.5+(9-15)215+ 
(20-15)215=2.7+2.7+0.03+0.30+2.4+1.67=9.8 
(12—7.5)2 (3—7.5)2. (7—-7.5)2, (9—7.5)2, (9-15)? (20-15)? 
7.5 7.5 75 75. +S 15 
= 2.7+2.7+ 0.03 + 0.30 + 2.4+ 1.67 = 9.8 


to 


Here’s an example to help you practice what you have learned so far. 


EXAMPLE |A fair die & 


Hypotheses, expected counts, and the chi-square 
statistic 


PROBLEM: Carrie made a 6-sided die in her ceramics class and rolled 
it 90 times to test if each side was equally likely to show up. The table 
summarizes the outcomes of her 90 rolls. 


a. State the hypotheses that Carrie should test. 
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b. Calculate the expected count for each of the possible outcomes. 


c. Calculate the value of the chi-square test statistic. 


Outcome of roll 1 2 3 4 5 6 Total 


Frequency 12 28 2 13 10 15 90 


SOLUTION: 


a. H0:/0 : The sides of Carrie’s die are equally likely to show up. 


Ha:/a : The sides of Carrie’s die are not equally likely to show up. 


b. If HO/0 is true, each of the 6 sides should show up 1/6 of the time. 


C, 


The expected count is 90(1/6) = 1590 (1/6) = 15 for each side. 


Outcome of roll 1 2 3 4 5 6 = Total 
Expected count 15 15 15 15 15 15 90 


You can also state the null hypothesis as: 
Ho: Carrie’s die is fair. 
or 


Ho: The distribution of outcome for Carrie’s 


die is uniform. 
or 


Hig Spi = Po 5 p45 — pe 16 


x2 = (12 — 15)215+(28 — 15)215+(12 — 15)215+(13 — 15)215+(10 - 
15)215+(15 — 15)215 =0.6+11.27+0.6+0.27+1.67+0 = 14.41 

9 (12-15)? (28-15)? (12-15)? (13-15), (10-15)? (15-15)? 
x = 15 15 5 " we - 15 


= 0.6+4 11.27 + 0.6 + 0.27+1.67+0 
14.41 


x2=); (Observed count—Expected count)2Expected count 


9 (Observed count—Expected count)? 
i 2 es a 


Expected count 


FOR PRACTICE, TRY EXERCISE 1 
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Think About It 


WHY DO WE DIVIDE BY THE EXPECTED COUNT WHEN 
CALCULATING THE CHI-SQUARE TEST STATISTIC? In 
Jerome’s class sample, they got 4.5 more browns than expected 
(12-7.5)(12 ~ 7.5) and 5 more blues than expected (20-15)(20 — 15). 
Which of these is more surprising? 


Ramon Rivera Moret 


In both cases, the number of M&M’S® Milk Chocolate Candies in 
the sample exceeds the expected count by about the same amount. But 
it’s much more surprising to be off by 4.5 out of an expected 7.5 brown 
candies (a 60% discrepancy) than to be off by 5 out of an expected 15 
blue candies (a 33% discrepancy). For that reason, we want the category 
with a larger relative difference to contribute more heavily to the 
evidence against HO/0 and in favor of Ha/a measured by the y2Xx” 
test statistic. 

If we just computed (Observed Count-Expected Count)2 


2 
(ObservedCount — ExpectedCount)” £5; each category instead, the 


contributions of these two color categories would be about the same 
(with the contribution from blue being slightly larger): 


Brown: (12-7.5)2=20.25 Blue: (20-15)2=25 
Brown : (12 — 7.5)” = 20.25 Blue : (20 — 15)? = 25 


By using (Observed count—Expected count)2/Expected count 


(Observed count — Expected count)” /Expected count we guarantee 
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that the color category with the larger relative difference will contribute 
more heavily to the total: 
Brown: (12-7.5)27.5=2.7Blue: (20-15)215=1.67 
(12—7.5)? (20-15)? 


Brown: ——————-= 2.7 Blue: ——————= 1.67 
7.0 15 


CHECK YOUR UNDERSTANDING 


1. 


y Mars, Inc., reports that the M&M’S® Peanut Chocolate Candies 
from their Cleveland factory have the following distribution of color: 
20% blue, 20% orange, 20% green, 20% yellow, 10% red, and 10% 
brown. Joey bought a large bag of them and selected a random sample 
of 65 candies. He found 14 blue, 9 orange, 15 green, 14 yellow, 5 red, 
and 8 brown. 

1. State appropriate hypotheses for testing the company’s claim about 
the color distribution of M&M’S Peanut Chocolate Candies in Joey’s 
large bag. 

2. Calculate the expected count for each color. 


3. Calculate the chi-square test statistic for Joey’s sample. 


The Chi-Square Distributions and P-Values 


Think of y2x” as a measure of the distance the observed counts are from 
the expected counts. Like any distance, it is always zero or positive, and it 
is zero only when the observed counts are exactly equal to the expected 
counts. Large values of y2x” are stronger evidence for Hate because they 
say that the observed counts are far from what we would expect if HO0 
were true. Small values of 2x” suggest that the data are consistent with 
the null hypothesis. Is the value from Jerome’s class, y2=9.8X" = 9.8 4 
large value? You know the drill: compare the observed value 9.8 against 
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the sampling distribution that shows how yx" would vary in repeated 
random sampling if the null hypothesis were true. 

We used software to simulate taking 1000 random samples of size 60 
from the population distribution of M&M7’S Milk Chocolate Candies given 
by Mars, Inc. Figure 11.2 shows a dotplot of the values of the chi-square 
test statistic for these 1000 samples. 


In 87 of the 1000 simulated 
samples, the value of the 
chi-square test statistic was 
at least 9.8—the observed 


test statistic from Jerome’s 
| class. 
7 | i = eo e | 


Simulated 7? test statistic 
FIGURE 11.2 Dotplot showing values of the chi-square test statistic in 1000 
simulated samples of size n=60 


from the population 
distribution of M&M’S Milk Chocolate Candies stated by the company. 


Recall that larger values of y2Xx” give more convincing evidence 
against HO/0 and in favor of Ha/4«. According to the dotplot, 87 of the 
1000 simulated samples resulted in a chi-square test statistic of 9.8 or 
higher. Our estimated P-value is 87/ 1000=0.08787/ 1000 = 0.087 Because 
the P-value exceeds the default a=0.050 = 0.05 significance level, we fail 
to reject HOM0. We do not have convincing evidence that the color 
distribution in Jerome’s bag is different from the distribution claimed by 
the company. 

As Figure 11.2 suggests, the sampling distribution of the chi-square 
test statistic is not a Normal distribution. It is a right-skewed distribution 
that allows only nonnegative values because y2X” can never be negative. 
The sampling distribution of y2* ° differs depending on the number of 
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possible values for the categorical variable (i.e., on the number of 
categories). 

When the expected counts are all at least 5, the sampling distribution 
of the y2X” test statistic is modeled well by a chi-square distribution 
with degrees of freedom (df) equal to the number of categories minus 1. 
As with the ¢t distributions, there is a different chi-square distribution for 
each possible value of df. 


DEFINITION Chi-square distribution 


A chi-square distribution is defined by a density curve that takes 
only nonnegative values and is skewed to the right. A particular chi- 
square distribution is specified by its degrees of freedom. 


Figure 11.3 shows the density curves for three members of the chi- 
square family of distributions. As the degrees of freedom (df) increase, the 
density curves become less skewed, and larger values become more 
probable. 


Chi-square distribution 
with df = 1 


Chi-square distribution 
a with df = 4 


Chi-square distribution 
with df =8 


0 2 4 6 8 10 12 14 16 
FIGURE 11.3 The density curves for three members of the chi-square family of 
distributions. 


Here are two other interesting facts about the chi-square distributions: 


e The mean of a particular chi-square distribution is equal to its degrees of 
freedom. 


e For df>2df > 2 the mode (peak) of the chi-square density curve is at df 
— df — 2 


For example, when df=8df = 8 the chi-square distribution has a mean of 8 
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and a mode of 6. 

To get P-values from a chi-square distribution, we can use technology 
or Table C in the back of the book. For Jerome’s class data, y2=9.8 
x’ = 9.8 Because all the expected counts are at least 5, the y2X° test 
Statistic will be modeled well by a chi-square distribution when HOO is 
true. There are 6 color categories for M&M’S® Milk Chocolate Candies, 
so df=6-1=5df = 6-1=5, 

The P-value is the probability of getting a value of y2* ” as large as or 
larger than 9.8 when H0/40 is true. Figure 11.4 shows this probability as 
an area under the chi-square density curve with 5 degrees of freedom. 


Chi-square 
distribution 
with df =5 


V-=98 


FIGURE 11.4 The P-value for a chi-square test for goodness of fit using Jerome’s 
M&M’S Milk Chocolate Candies class data. 


To find the P-value using Table C, look in the df=5df = 5 row. The 
value y2=9.8X° = 9.8 falls between the critical values 9.24 and 11.07. The 
corresponding areas in the right tail of the chi-square distribution with 5 
degrees of freedom are 0.10 and 0.05. So the P-value for a test based on 
Jerome’s data is between 0.05 and 0.10. 


Now let’s look at how to find the P-value with your calculator. 


P 
df 5S 10 05 
4 6.74 7.78 9.49 
5 8.12 9.24 11.07 
6 9.45 10.64 12.59 


27. Technology Corner | FINDING P-VALUES FOR 
CHI-SQUARE TESTS |») 
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TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 

To find the P-value in the M&M’S® Milk Chocolate Candies example 
with your calculator, use the y2X * cdf command. We ask for the area 
between y2=9.8X° = 9.8 and a very large number (we’!I use 10,000) 
under the chi-square density curve with 5 degrees of freedom. 


x*cd#(9.8,10000,5) 
@. 8811046058 


Press (DISTR) and choose y2x” cdf(. 


OS 2.55 or later: In the dialog box, enter these values: 
lower:9.8, upper:10000, df:5, choose Paste, and then press 


[ENTER| 


Older OS: Complete the command y2x” cdf(9.8,10000,5) and 


press [ENTER|. 


As the calculator screen shot shows, this method gives a more precise 
P-value than Table C. 


Table C gives us an interval in which the P-value falls. The 
calculator’s y2X ” cdf command gives a result that is consistent with Table 
C but more precise. For that reason, we recommend using your calculator 
to compute P-values from a chi-square distribution. 

Based on Jerome’s sample, what conclusion should we draw about HO: 
Ao : The distribution of color in the large bag of M&M’S Milk Chocolate 
Candies is the same as the claimed distribution? Because our P-value of 
0.081 is greater than a=0.05a = 0.05 we fail to reject HO/0. We don’t 
have convincing evidence that the distribution of color in the large bag of 
M&M’S Milk Chocolate Candies differs from the claimed distribution. 
This is consistent with the results of the simulation from the activity. 

ow Failing to reject Hg does not mean that the null hypothesis is 


true! That is, we can’t conclude that the bag has the color distribution 
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claimed by Mars, Inc. All we can say is that the sample data did not 
provide convincing evidence to reject HOM . 


EXAMPLE | Return of the fair die (») 


Finding a P-value 


PROBLEM: Carrie made a 6-sided die in her ceramics class and rolled 
it 90 times to test if each side was equally likely to show up. The table 
summarizes the observed and expected counts. 


Outcome of roll 1 2 3 4 5 6 Total 
Observed count 12 28 12 13 10 15 90 
Expected count 15 15 15 15 15 15 90 


In the preceding example, we calculated y2=14.41X° = 14.41 Find the 
P-value using Table C. Then calculate a more precise value using 
technology. 


SOLUTION: 
ees = 6—-1=5 


Using Table C: the P-value is between 0.01 and 0.02. 


Using technology: x2 cdf(lower:14.41, upper:10000, df:5)=0.0132 
x” cdf(lower:14.41, upper:10000, df:5) = 0.0132. 


df 5 # of categories -1 


Chi-square distribution 
with df = 5 
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FOR PRACTICE, TRY EXERCISE 3 


Because the P-value is less than a=0.05a = 0.05 we should reject HO 
Ho. There is convincing evidence that Carrie’s die is unfair. 


CHECK YOUR UNDERSTANDING 


(\ 


\ 


y Let’s continue our analysis of Joey’s sample of M&M’S® Peanut 

Chocolate Candies from the preceding Check Your Understanding (page 

714). 

1. Confirm that the expected counts are large enough to use a chi-square 
distribution to calculate the P-value. Which degrees of freedom 
should you use? 


2. Use Table C to find the P-value. Then use your calculator’s y2x” cdf 
command. 


3. What conclusion would you draw about the company’s claimed color 
distribution for M&M’S Peanut Chocolate Candies? 


Carrying Out a Test 


Like our test for a population proportion, the chi-square test for goodness 
of fit uses an approximation that becomes more accurate as we take larger 
samples. The Large Counts condition says that all expected counts must 
be at least 5. Before performing a test, we must also check that the 
Random and 10% conditions are met. 


CONDITIONS FOR PERFORMING A CHI-SQUARE 
TEST FOR INDEPENDENCE 


e Random: The data come from a random sample from the population 
of interest. 
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o 10%: When sampling without replacement, n<0.10 Nn < 0.10 N 


e Large Counts: All expected counts are at least 5. 


AP® EXAM TIP 


When checking the Large Counts condition, be sure to examine the 
expected counts, not the observed counts. And make sure to write and 
label the expected counts on your paper or you won’t receive credit. 


When we want to compare the distribution of a categorical variable in 
one population to a claimed distribution, we use a chi-square test for 
goodness of fit. 


THE CHI-SQUARE TEST FOR GOODNESS OF FIT 


Suppose the conditions are met. To perform a test of 


H0:40 : The stated distribution of a categorical variable in the 
population of interest is correct 


compute the chi-square test statistic: 


x2=>\(Observed count — Expected count)2Expected count 
‘ (Observed count — Expected count)? 


x =k Expected count 


where the sum is over the k different categories. The P-value is the area 
to the right of yx under the chi-square density curve with k-14 — 1 
degrees of freedom. 


The next example shows the chi-square test for goodness of fit in 
action. As always, we follow the four-step process when performing 
inference. 


EXAMPLE |Birthdays in hockey (>) 


A test for equal proportions 
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Canva Pty Ltd/Alamy 


PROBLEM: In his book Outliers, Malcolm Gladwell suggests that a 
hockey player’s birth month has a big influence on his chance to make it 
to the highest levels of the game. Specifically, because January 1 is the 
cut-off date for youth leagues in Canada [where many National Hockey 
League (NHL) players come from], players born in January will be 
competing against players up to 12 months younger. The older players 
tend to be bigger, stronger, and more coordinated and hence get more 
playing time, more coaching, and have a better chance of being 
successful. 

To see if birth date is related to success (judged by whether a player 
makes it into the NHL), a random sample of 80 NHL players from a 
recent season was selected and their birthdays were recorded. The one- 
way table summarizes the data on birthdays for these 80 players. 


Birthday Jan-Mar Apr—Jun Jul-Sep Oct-Dec 
Number of players 32 20 16 12 


Do these data provide convincing evidence that the birthdays of NHL 
players are not uniformly distributed across the four quarters of the 
year? 


SOLUTION: 


Sie: 


Ho:/40 : The birthdays of all NHL players are uniformly distributed 
across the four quarters of the year. 


Ha:4a : The birthdays of all NHL players are not uniformly distributed 
across the four quarters of the year. 


1580 


Follow the four-step process! 


We could write the hypotheses in symbols as HO:pJan—Mar=pApr 
—Jun=pJul-—Sep=pOct-—Dec=1/4 


Ho > PJan—Mar = PApr—Jun = PJul—Sep = POct—Dec = 1/4 


H,: At least two of the proportions are not 1/4 


We'll use a = 0.05.a = 0.05. 
PLAN: Chi-square test for goodness of fit 


e Random: The data came from a random sample of NHL players. / 


o 10%: We must assume that 80 is less than 10% of all NHL players. 
"A 


e Large Counts: All expected counts = 80(1/4) = 2025 
counts = 80(1/4) = 20 > 5 y 


There were 879 NHL players in the population from which we selected 
our sample. 


There is some evidence in favor of Ha/4a because the observed counts 


differ from the expected counts. 


DO: 
e Test Statistic: 
y2=(32 — 20)220+(20 — 20)220+(16 - 20)220+ 
(12 — 20)220=7.2+0+0.8+3.2 = 11.2 
(32 —20)2 (20-20) (16—20)% (12 - 20)? 
0 2 20. 20 
= 7.24040.843.2 = 11.2 


bo 


* P-value: df=4-1=3¢f =4-1=3 
Using Table C: The P-value is between 0.01 and 0.02. 
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Using technology: y2cdf(lower:11.2, upper:10000, df:3)=0.011 
x’ cdf(lower : 11.2, upper : 10000, df : 3) = 0.011 


CONCLUDE: Because the P-value of 0.011<a = 0.05, 

0.011 < a = 0.05, we reject Ho.40- We have convincing evidence 
that the birthdays of NHL players are not uniformly distributed across 
the four quarters of the year. 


Chi-square distribution 
with df =3 


FOR PRACTICE, TRY EXERCISE 9 


You can use your calculator to carry out the “Do” step for a chi-square 
test for goodness of fit. Remember that using your calculator comes with 
potential benefits and risks on the AP® Statistics exam. 


28. Technology Corner |PERFORMING A CHI- 
SQUARE TEST FOR GOODNESS OF FI (>) 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 

You can use the TI-84 to perform the calculations for a chi-square test 
for goodness of fit. We’ll use the data from the hockey and birthdays 
example to illustrate the steps. 


Birthday Observed Expected 
Jan—Mar oe 20 
Apr—Jun 20 20 
Jul-Sep 16 20 
Oct—Dec 12 20 


1. Enter the observed counts in L1 and the expected counts in L2. 
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2. Press [STAT], arrow over to TESTS, and choose y2x” GOR Nest. 


Note: TI-83s and some older TI-84s don’t have this test. TI-84 users 
can get this functionality by upgrading their operating systems. 


3. Enter the inputs shown. If you choose Calculate, you’ ll get a screen 
with the test statistic, P-value, and df. If you choose the Draw option, 
you’ll get a picture of the appropriate chi-square distribution with the 
test statistic marked and shaded area corresponding to the P-value. 


ORMAL FLOAT AUTO REAL RADIAN CL 1 pease’ ieee ORMAL FLOAT AUTO REAL RADIAN CL i 


X%2GOF-Test X%2GOF-Test 
Observed:Li =11.2 | 
p=. 0106921291 


Expected:L2 


ee BLUE 


Calculate Draw 


CNTRB={7.2 @ .8 3.2} 


x2G0F-Test 
%2211.2 >=.0107 


We'll discuss the CNTRB results shortly. 


AP® EXAM TIP 


You can use your calculator to carry out the mechanics of a significance 
test on the AP® Statistics exam. But there’s a risk involved. If you just 
give the calculator answer with no work, and one or more of your 
values is incorrect, you will likely get no credit for the “Do” step. We 
recommend writing out the first few terms of the chi-square calculation 
followed by “...”. This approach might help you earn partial credit if 
you enter a number incorrectly. Be sure to name the procedure (chi- 
square test for goodness of fit) and to report the test statistic (y2=11.2) 


(x° = 11.2) degrees of freedom (df=3)(4f = 3) and P-value (0.011). 


FOLLOW-UP ANALYSIS In the chi-square test for goodness of fit, we 
test the null hypothesis that a categorical variable has a specified 
distribution in the population of interest. If the sample data lead to a 
statistically significant result, we can conclude that our variable has a 
distribution different from the one stated. To investigate how the 
distribution is different, start by identifying the categories that contribute 
the most to the chi-square statistic. Then describe how the observed and 
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expected counts differ in those categories, noting the direction of the 


difference. 


A 
| 
} 
f 
Gt 


belchonock /Depositphoto 


Let’s return to the hockey and birthdays example. The table of 
observed and expected counts for the 80 randomly selected NHL players is 
repeated on the following page. The last column shows the contributions 
(also called components) of the chi-square test statistic. The two biggest 
contributions to the chi-square statistic came from Jan—Mar and Oct—Dec. 
In January through March, 12 more players were born than expected. In 
October through December, 8 fewer players were born than expected. 
These results support Malcolm Gladwell’s claim that NHL players are 
more likely to be born early in the year. 


Birthday 
Jan—Mar 
Apr—Jun 
Jul-Sep 
Oct—Dec 


Observed 


oe 
20 
16 
12 


Expected 


20 
20 
20 
20 


O-E 


(O-E)/E 
7.2 
0.0 
0.8 
3.2 


Note: When we ran the chi-square test for goodness of fit on the 
calculator, a list of these individual components was produced and stored 
in the list menu. On the TI-84, the list is called CNTRB (for contribution). 


CHECK YOUR UNDERSTANDING 
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y Does the warm, sunny weather in Arizona affect a driver’s choice of 
car color? Cass thinks that Arizona drivers might opt for a lighter color 
with the hope that it will reflect some of the heat from the sun. To see if 
the distribution of car colors in Oro Valley, near Tucson, is different 
from the distribution of car colors across North America, she selected a 
random sample of 300 cars in Oro Valley. The table shows the 
distribution of car color for Cass’s sample in Oro Valley and the 


distribution of car color in North America, according to www.ppg.com.+ 


Color White Black Gray Silver Red Blue Green Other Total 
Oro Valley 84 38 31 46 27° «29 6 39 300 
sample 


North America 23% 18% 16% 15% 10% 9% 2% 7% 100% 


1. Do these data provide convincing evidence that the distribution of car 
color in Oro Valley differs from the North American distribution? 


2. If there is convincing evidence of a difference in the distribution of 
car color, perform a follow-up analysis. 


Section 11.1 Summary 


e The chi-square test for goodness of fit tests the null hypothesis that a 
categorical variable has a specified distribution in the population of 
interest. The alternative hypothesis is that the variable does not have the 
specified distribution in the population of interest. 


e This test compares the observed count in each category with the counts 
that would be expected if Hoo were true. The expected count for any 
category is found by multiplying the sample size by the proportion in 
each category according to the null hypothesis. 


e The chi-square test statistic is 


x2=); (Observed count — Expected count)2Expected count 
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‘ihe ss (Observed count — Expected count)? 
Expected count 


where the sum is over all possible categories. 
The conditions for performing a chi-square test for goodness of fit are: 


: Random: The data come from a random sample from the population 
of interest. 


o 10%: When sampling without replacement, n<0.10 N.n < 0.10 .N. 
: Large Counts: All expected counts are at least 5. 


When the conditions are met, the sampling distribution of the y2* test 
Statistic can be modeled by a chi-square distribution. 


Large values of y2x” are evidence against HO/0 and in favor of Ha/a. 


The P-value is the area to the right of y2X under the chi-square density 


curve with degrees of freedom df=number of categories—1 
df = number of categories —1 


If the test finds a statistically significant result, consider doing a follow- 
up analysis that looks for the largest contributions to the chi-square test 
Statistic and compares the observed and expected counts. 


11.1 Technology Corners | > 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
27. Finding P-values for chi-square tests Page 716 
28. Performing a chi-square test for goodness of fit Page 719 


Section 11.1 Exercises 


1. pg 712 |») Aw, nuts! A company claims that each batch of its 


deluxe mixed nuts contains 52% cashews, 27% almonds, 13% 
macadamia nuts, and 8% Brazil nuts. To test this claim, a quality- 
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control inspector takes a random sample of 150 nuts from the latest 
batch. The table displays the sample data. 


Type of nut Cashew Almond Macadamia Brazil 
Count 83 29 20 18 


a. State appropriate hypotheses for performing a test of the company’s 
claim. 


b. Calculate the expected count for each type of nut. 


c. Calculate the value of the chi-square test statistic. 

2. Roulette Casinos are required to verify that their games operate as 
advertised. American roulette wheels have 38 slots—18 red, 18 black, 
and 2 green. In one casino, managers record data from a random 
sample of 200 spins of one of their American roulette wheels. The 
table displays the results. 


Color Red Black Green 
Count 85 99 16 


a. State appropriate hypotheses for testing whether these data give 
convincing evidence that the distribution of outcomes on this wheel 
is not what it should be. 


b. Calculate the expected count for each color. 


c. Calculate the value of the chi-square test statistic. 
3. pg 716 (pm) P-values For each of the following, find the P-value 


using Table C. Then calculate a more precise value using technology. 
a. y2=19.03, df=11X" = 19-03, df = 11 
b. y2=19.03, df=3X" = 19.03, df = 3 


4. More P-values For each of the following, find the P-value using Table 
C. Then calculate a more precise value using technology. 


a. y2=4.49, df=5x° = 4.49, df = 5 


b. y2=4.49, df=1x° = 4-49, df= 1 
5. Aw, nuts! Refer to Exercise 1. 


a. Confirm that the expected counts are large enough to use a chi- 
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square distribution to calculate the P-value. What degrees of 
freedom should you use? 


b. Use Table C to find the P-value. Then use your calculator’s y2x” 
cdf command. 


c. What conclusion would you draw about the company’s claimed 
distribution for its deluxe mixed nuts? 
6. Roulette Refer to Exercise 2. 


a. Confirm that the expected counts are large enough to use a chi- 
square distribution to calculate the P-value. What degrees of 
freedom should you use? 


b. Use Table C to find the P-value. Then use your calculator’s y2x” 
cdf command. 


c. What conclusion would you draw about whether or not the roulette 
wheel is operating correctly? 

7. No chi-square A school’s principal wants to know if students spend 
about the same amount of time on homework each night of the week. 
She asks a random sample of 50 students to keep track of their 
homework time for a week. The following table displays the average 
amount of time (in minutes) students reported per night. 


Night Sunday Monday Tuesday Wednesday Thursday Friday Saturday 
Average time 130 108 115 104 99 37 62 


Explain carefully why it would not be appropriate to perform a chi- 
square test for goodness of fit using these data. 

8. No chi-square The principal in Exercise 7 also asked the random 
sample of students to record whether they did all of the homework that 
was assigned on each of the five school days that week. Here are the 
data: 


School day Monday Tuesday Wednesday Thursday Friday 
Number who did all homework 34 29 a2 28 19 


Explain carefully why it would not be appropriate to perform a chi- 
square test for goodness of fit using these data. 
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9. pg 718 |») Munching Froot Loops Kellogg’s Froot Loops cereal 


comes in six colors: orange, yellow, purple, red, blue, and green. 
Charise randomly selected 120 loops and noted the color of each. Here 
are her data: 


Color Orange Yellow Purple Red Blue Green 
Count 28 21 16 25 14 16 


Do these data provide convincing evidence at the 5% significance 
level that Kellogg’s Froot Loops do not contain an equal proportion of 
each color? 

10. What’s your sign? The University of Chicago’s General Social 
Survey (GSS) is the nation’s most important social science sample 
survey. For reasons known only to social scientists, the GSS regularly 
asks a random sample of people their astrological sign. Here are the 
counts of responses from a recent GSS of 4344 people: 


Sign Aries ‘Taurus Gemini Cancer Leo Virgo 
Count 321 360 367 374 383 402 
Sign Libra Scorpio Sagittarius Capricorn Aquarius _— Pisces 


Count 392 329 331 354 376 355 


If births are spread uniformly across the year, we expect all 12 signs to 
be equally likely. Do these data provide convincing evidence at the 1% 
significance level that all 12 signs are not equally likely? 

11. Fruit flies Biologists wish to mate pairs of fruit flies having genetic 
makeup RrCc, indicating that each has one dominant gene (R) and one 
recessive gene (r) for eye color, along with one dominant (C) and one 
recessive (c) gene for wing type. Each offspring will receive one gene 
for each of the two traits from each parent, so the biologists predict 
that the following phenotypes should occur in a ratio of 9:3:3:1: 


Phenotype Redeyesand Redeyesand White eyesand White eyes and 
Straight wings — curly wings straight wings curly wings 


Frequency te fe 42 49 10 


Assume that the conditions for inference are met. Carry out a test at 
the a=0.05a = 0.05 significance level of the proposed genetic model. 
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12. You say tomato The paper “Linkage Studies of the Tomato” 
(Transactions of the Canadian Institute, 1931) reported the following 
data on phenotypes resulting from crossing tall cut-leaf tomatoes with 
dwarf potato-leaf tomatoes. We wish to investigate whether the 
following frequencies are consistent with genetic laws, which state that 
the phenotypes should occur in the ratio 9:3:3:1. 


Phenotype Tall cut Tall potato Dwarf cut Dwarf potato 
Frequency 926 288 293 104 


Assume that the conditions for inference are met. Carry out a test at 
the a=0.05a = 0.05 significance level of the proposed genetic model. 
13. Birds in the trees Researchers studied the behavior of birds that were 
searching for seeds and insects in an Oregon forest. In this forest, 54% 
of the trees are Douglas firs, 40% are ponderosa pines, and 6% are 
other types of trees. At a randomly selected time during the day, the 
researchers observed 156 red-breasted nuthatches: 70 were seen in 


Douglas firs, 79 in ponderosa pines, and 7 in other types of trees.? 


a. Do these data provide convincing evidence that nuthatches prefer 
particular types of trees when they’re searching for seeds and 
insects? 


b. Relative to the proportion of each tree type in the forest, which type 
of trees do the nuthatches seem to prefer the most? The least? 

14. Seagulls by the seashore Do seagulls show a preference for where 
they land? To answer this question, biologists conducted a study in an 
enclosed outdoor space with a piece of shore whose area was made up 
of 56% sand, 29% mud, and 15% rocks. The biologists chose 200 
seagulls at random. Each seagull was released into the outdoor space 
on its own and observed until it landed somewhere on the piece of 
shore. In all, 128 seagulls landed on the sand, 61 landed in the mud, 
and 11 landed on the rocks. 


a. Do these data provide convincing evidence that seagulls show a 
preference for where they land? 


b. Relative to the proportion of each ground type on the shore, which 
type of ground do the seagulls seem to prefer the most? The least? 
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15. Mendel and the peas Gregor Mendel (1822-1884), an Austrian monk, 
is considered the father of genetics. Mendel studied the inheritance of 
various traits in pea plants. One such trait is whether the pea is smooth 
or wrinkled. Mendel predicted a ratio of 3 smooth peas for every 1 
wrinkled pea. In one experiment, he observed 423 smooth and 133 
wrinkled peas. Assume that the conditions for inference are met. 


a. Carry out a chi-square test for goodness of fit for the genetic model 
that Mendel predicted. 


b. In Chapter 9, Exercise 49, you tested Mendel’s prediction using a 
one-sample z test for a proportion. The hypotheses were HO: p=0.75 
Ho : p= 0.75 and Ha: p#0.754a ‘pF 0.75 where p = true 
proportion of smooth peas. Calculate the z statistic and P-value for 
this test. How do these values compare to the values from part (a)? 
16. Spinning heads? When a fair coin is flipped, we all know that the 
probability the coin lands on heads is 0.50. However, what if a coin is 
spun? According to the article “Euro Coin Accused of Unfair 
Flipping” in the New Scientist, two Polish math professors and their 
students spun a Belgian euro coin 250 times. It landed heads 140 
times. One of the professors concluded that the coin was minted 
asymmetrically. A representative from the Belgian mint indicated the 
result was just chance. Assume that the conditions for inference are 
met. 


a. Carry out a chi-square test for goodness of fit to test if heads and 
tails are equally likely when a euro coin is spun. 


b. In Chapter 9, Exercise 50, you analyzed these data with a one- 
sample z test for a proportion. The hypotheses were HO: p=0.5 
Ho : p= 0.5 and Ha: pz0.5a ‘pF 0.5 where p = the true 
proportion of heads. Calculate the z statistic and P-value for this 
test. How do these values compare to the values from part (a)? 

17. Skittles® Statistics teacher Jason Molesky contacted Mars, Inc., to ask 
about the color distribution for Skittles candies. Here is an excerpt 
from the response he received: “The original flavor blend for the 
Skittles Bite Size Candies is lemon, green apple, orange, strawberry 
and grape. They were chosen as a result of consumer preference tests 
we conducted. The flavor blend is 20 percent of each flavor.” 
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18. 


State appropriate hypotheses for a significance test of the 
company’s claim. 


. Find the expected counts for a random sample of 60 candies. 


How large a y2X° test statistic would you need to have significant 
evidence against the company’s claim at the a=0.05a = 0.05 level? 
At the a=0.010 = 0.01 level? 


. Create a set of observed counts for a random sample of 60 candies 


that gives a P-value between 0.01 and 0.05. Show the calculation of 
your chi-square test statistic. 


Is your random number generator working? Use your calculator’s 
RandInt function to generate 200 digits from 0 to 9 and store them in a 
list. 


a. 


State appropriate hypotheses for a chi-square test for goodness of fit 
to determine whether your calculator’s random number generator 
gives each digit an equal chance of being generated. 


Carry out a test at the a=0.05a = 0.05 significance level. Hint: To 
obtain the observed counts, make a histogram of the list containing 
the 200 random digits, and use the trace feature to see how many of 
each digit were generated. You may have to adjust your window to 
go from —0.5~9-9 to 9.5 with an increment of 1. 


Assuming that a student’s calculator is working properly, what is 
the probability that the student will make a Type I error in part (b)? 


Suppose that 25 students in an AP® Statistics class independently 
do this exercise for homework and that all of their calculators are 
working properly. Find the probability that at least one of them 
makes a Type I error. 


Multiple Choice: Select the best answer for Exercises 19-22. 


Exercises 19-21 refer to the following setting. The manager of a high 
school cafeteria is planning to offer several new types of food for 
student lunches in the new school year. She wants to know if each type 
of food will be equally popular so she can start ordering supplies and 
making other plans. To find out, she selects a random sample of 100 
students and asks them, “Which type of food do you prefer: Ramen, 
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tacos, pizza, or hamburgers?” Here are her data: 


Type of food Ramen Tacos Pizza Hamburgers 
Count 18 22 39 21 


19. An appropriate null hypothesis to test whether the food choices are 
equally popular is 


a. HO: p=2544 0: # = 295 where p = the mean number of students that 
prefer each type of food. 


b. HO: p=0.2540 : p = 0.25 where p = the proportion of all students 
who prefer ramen. 

c. HO: nR=nT=nP=nH=254 : nr = nr = np = NH = 25 where nR 
”R js the number of students in the school who would choose 
ramen, and so on. 

d. HO: pR=pT=pP=pH=0.2540 : pr = pr = pp = Pu = 9.25 where 
pRPR is the proportion of students in the school who would choose 
ramen, and so on. 

e. HO: p\R=pAT=pP=p/H=0.25440: Pr = Pr = Pp = Py = 0.25, 
where pARPR is the proportion of students in the sample who chose 
ramen, and so on. 

20. The chi-square test statistic is 
a. (18-25)225+(22-25)225+(39-25)225+(21-25)225 


(18—25)* (22—25)? (39-25)? (21-25)? 


25 25 25 25 


b. (25-18)218+(25-22)222+(25-39)239+(25-21)221 
(25-18)? (25-22)? + (25—39)? 4. (25-21)? 


18 ao 22 39 21 


c. (18-25)25+(22-25)25+(39-25)25+(21-25)25 
(18—25) # (22-25) (39-25) | (21-25) 


e ' . 
25 25 25 25 


d. (18-25)2100+(22-25)2100+(39-25)2100+(21-25)2100 
(18-25)? (22-25)? (39—25)* (21—25)* 


100 100 100 100 


e. (0.18-0.25)20.25+(0.22—0.25)20.25+(0.39-0.25)20.25+ 
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(0.18—0.25)* (0.22—0.25)* (0.39—0.25)? (0.21—0.25)* 


(0.21-0.25)20.25 0.25 0.25 0.25 0.25 


21. The P-value for a chi-square test for goodness of fit is 0.0129. Which 
of the following is the most appropriate conclusion at a significance 
level of 0.05? 


a. 


Because 0.0129 is less than a=0.05a = 0.05 reject HO#0. There is 
convincing evidence that the food choices are equally popular. 


Because 0.0129 is less than a=0.05a = 0.05 reject HOM0. There is 
not convincing evidence that the food choices are equally popular. 


Because 0.0129 is less than a=0.05a = 0.05 reject HOM0. There is 
convincing evidence that the food choices are not equally popular. 


. Because 0.0129 is less than a=0.05e@ = 0.05 fail to reject HO. 


There is not convincing evidence that the food choices are equally 
popular. 


Because 0.0129 is less than a=0.05 = 0.05 fail to reject HOMO. 
There is convincing evidence that the food choices are equally 
popular. 


22. Which of the following is false? 


a. 


e. 


A chi-square distribution with k degrees of freedom is more right- 
skewed than a chi-square distribution with k+1* + 1 degrees of 
freedom. 


A chi-square distribution never takes negative values. 


The degrees of freedom for a chi-square test are determined by the 
sample size. 


- P(y2>10)” (x? > 10) i, greater when df=k+1df = & + 1 than when 


df=kdf = k, 


The area under a chi-square density curve is always equal to 1. 


Recycle and Review 


23. Video games (1.1) To determine if there is a relationship between age 


computer, TV, game console, or portable device like a cell phone. 


and playing video games, the Pew Research Center asked randomly 
selected adults for their age and if they “ever play video games on a 
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Here is a two-way table summarizing the results of the study: 


Age group 
18-29 | 30-49 | 50-64 65+ Total 
Play Yes 887 650 3033 
video No 429 872 985 3126 
games 


Total 1316 2089 1635 1119 6159 


a. Construct a segmented bar graph to display the relationship 
between age group and response to the question about video 
games. 


b. Describe the association shown in the segmented bar graph in part 


(a). 


Exercises 24—26 refer to the following setting. Do students who read 
more books for pleasure tend to earn higher grades in English? The 
boxplots show data from a simple random sample of 79 students at a 
large high school. Students were classified as light readers if they 
read fewer than 3 books for pleasure per year. Otherwise, they were 
classified as heavy readers. Each student’s average English grade for 
the previous two marking periods was converted to a GPA scale, 
where A=4.0, A-=3.7, B+=3.34 = 4.0, A— = 3.7, B+ = 3.3 and so 
on. 


Type of reader 
Heavy 


Light 


2.4 2.6 2.8 3.0 3.2 3.4 3.6 38 4.0 
Average English grade 


24. Reading and grades (1.3) Write a few sentences comparing the 
distributions of average English grade for light and heavy readers. 


25. Reading and grades (10.2) Summary statistics for the two groups 
from Minitab are provided. 
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Type of n Mean StDev SE Mean 


reader 
Heavy 47 3.640 0.324 0.047 
Light 32 3.356 0.380 0.067 


Explain why it is acceptable to use two-sample t procedures in this 
setting. 


Construct and interpret a 95% confidence interval for the 
difference in the mean English grade for light and heavy readers. 


Does the interval in part (b) provide convincing evidence that 
reading more causes a difference in students’ English grades? 
Justify your answer. 


26. Reading and grades (3.2) The scatterplot shows the number of 
books read and the English grade for all 79 students in the study. The 
least-squares regression line y\=3.42+0.024xY = 3.42 + 0.024% has 
been added to the graph. 


a. 


b. 


Interpret the slope and y intercept. 


The student who reported reading 17 books for pleasure had an 
English GPA of 2.85. Calculate and interpret this student’s 
residual. 


For this linear model, r2=0.083r? = 0.083 Interpret this value. 


Average English grade 
N 


0 2 4 6 8 10 12 14 16 18 20 2 
Number of books read 
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SECTION 11.2 Inference for Two-Way Tables 


LEARNING TARGETS By the end of the section, you should be able to: 


e State appropriate hypotheses and compute the expected counts and 
chi-square test statistic for a chi-square test based on data in a two- 
way table. 


State and check the Random, 10%, and Large Counts conditions for 
a chi-square test based on data in a two-way table. 


Calculate the degrees of freedom and P-value for a chi-square test 
based on data in a two-way table. 


Perform a chi-square test for homogeneity. 


Perform a chi-square test for independence. 


Choose the appropriate chi-square test in a given setting. 


‘The two-sample z procedures of Chapter 10 allow us to compare the 
proportions of successes in two populations or for two treatments. What if 
we want to compare more than two samples or groups? More generally, 
what if we want to compare the distributions of a single categorical 
variable across several populations or treatments? We rely on a new 
significance test, called a chi-square test for homogeneity. 

The test for homogeneity starts by presenting the data in a two-way 
table. However, two-way tables have other uses than comparing 
distributions of a single categorical variable. As we saw in Section 1.1, 
they can also be used to summarize relationships between two categorical 
variables. To determine if there is convincing evidence of an association 
between two categorical variables, we perform a chi-square test for 
independence. 


Tests for Homogeneity: Stating Hypotheses 


Does background music influence what customers buy? One experiment in 
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a European restaurant compared three randomly assigned treatments: no 
music, French accordion music, and Italian string music. Under each 
condition, the researchers recorded the number of customers who ordered 
French, Italian, and other entrées. The null hypothesis in this example is 


H0:/40 : There is no difference in the true distributions of entrées 
ordered at this restaurant when no music, French accordion music, 
or Italian string music is played. 


It would also be correct to state the null hypothesis as Ho:4 0 * The distribution of a 


categorical variable is the same for each of several populations or treatments. We 
prefer the “no difference” wording because it’s more consistent with the language we 


used in the significance tests of Chapter 10. 


In general, the null hypothesis in a chi-square test for homogeneity says 
that there is no difference in the true distribution of a categorical variable 
in the populations of interest or for the treatments in an experiment. 

The alternative hypothesis says that there is a difference in the 
distributions but does not specify the nature of that difference. In the 
restaurant example, the alternative hypothesis is 


Ha:/4a : There is a difference in the true distributions of entrées 
ordered at this restaurant when no music, French accordion music, 
or Italian string music is played. 


oOo The alternative hypothesis does not state that each distribution 
is different from each of the others. Instead, the alternative hypothesis 
will be true even if just one of the true distributions is different from the 
others. Consequently, any difference among the three observed 
distributions of entrées ordered is evidence against the null hypothesis and 
for the alternative hypothesis. 

So how did the experiment turn out? The two-way table summarizes 
the data and Figure 11.5 shows the conditional relative frequencies of 
entrée ordered for each of the three treatments. 


Type of background music 
None French Italian | Total 
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Entrée ordered 


French 
Italian 
Other 
Total 
Music = None Music = French Music = Italian 
50 4 50 4 50 4 
40 + 40 4 40 
r=] :=| = 
2 30-7 9 307 2 30 
= z z 
& 20-4 2 2- 2 20 
10 4 | 10 4 10 
0 . 0 + SEIETD. 0 
French Italian Other French Italian Other French Italian Other 
Entrée Entrée Entrée 


FIGURE 11.5 Relative frequency bar graphs comparing the distributions of 
entrées ordered for different music conditions. 


The type of entrée that customers order seems to differ considerably 
across the three music treatments. Orders of Italian entrées are very low 
(1.3%) when French music is playing, but are higher when Italian music 
(22.6%) or no music (13.1%) is playing. French entrées seem popular in 
this restaurant, as they are ordered often under all music conditions—but 
notably more often when French music is playing. For all three music 
treatments, the percent of Other entrées ordered was similar. 

Do the differences in these distributions provide convincing evidence 
that background music affects customer behavior at this restaurant? Or is it 
plausible that the background music has no effect on customer behavior 
and that these differences are due to the chance involved in the random 
assignment of treatments? To decide, we have to know how likely it is to 
get differences this big or bigger when the null hypothesis is true. In other 
words, we need a P-value! 

With only the methods we already know, we might start by comparing 
the proportions of French entrées ordered when no music 
(p\=30/84=0.357)(P = 30/84 = 0.357) and when French accordion music 
are played (p\=39/75=0.52)(P = 39/75 = 0.52) using a two-sample z test. 
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a Type I error in at least one of the tests. 

Because of the increased probability of a false positive, it’s cheating to 
pick out one large difference from the two-way table and then perform a 
significance test as if it were the only comparison we had in mind. 
Statisticians even have a name for this unethical practice: P-hacking. 

For example, a test comparing the proportions of French entrées 
ordered under the no music and French accordion music treatments shows 
that the difference is statistically significant (z=2.06, P=0.039) 

(z = 2.06, P = 0.039) However, the proportions of Italian entrées ordered 
for the no music and Italian string music treatments do not differ 
significantly (z=1.61, P=0.107)(2 = 1.61, P = 0.107) Reporting only the 
results of the first test wouldn’t be telling the whole story. 

The problem of how to do many comparisons at once without 
increasing the overall probability of a Type I error is common in statistics. 
Statistical methods for dealing with multiple comparisons usually have 
two steps: 


|. Perform an overall test to see if there is convincing evidence of any 
differences among the parameters that we want to compare. 


’, When the overall test shows there is convincing evidence of a 
difference, perform a detailed follow-up analysis to decide which of the 
parameters differ and to estimate how large the differences are. 


When we want to compare the distribution of a categorical variable for 
several populations or treatments, the overall test uses the familiar chi- 
Square test statistic. 


Tests for Homogeneity: Expected Counts and the Chi- 
Square Test Statistic 


A chi-square test for homogeneity begins with the hypotheses 


Hg: There is no difference in the distribution of a categorical variable 


for several populations or treatments. 


H,: There is a difference in the distribution of a categorical variable 
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H,: There is a difference in the distribution of a categorical variable 


for several populations or treatments. 


To perform the test, we compare the observed counts in a two-way table 
with the counts we would expect if HOM were true. Calculating the 
expected counts isn’t difficult, but we calculate them slightly differently 
than in the chi-square test for goodness of fit. 

The null hypothesis in the restaurant experiment is that there’s no 
difference in the distributions of entrées ordered when no music, French 
accordion music, or Italian string music is played. To find the expected 
counts, we start by assuming that Ho is true. We can see from the two-way 


table that 99 of the 243 entrées ordered during the study were French. 


Type of background music 


Entrée ordered 


Ingram Publishing/Alamy 


If the specific type of music that’s playing has no effect on entrée 
orders, the proportion of French entrées ordered under each music 
condition should be 99/243=0.407499/243 = 0.4074 Because there were 


1601 


99 
84: 99243=84(0.4074)=34.22 243 


of those entrées to be French, on average. The expected counts of French 
entrées ordered under the other two music conditions can be found in a 


= 84(0.4074) = 34.22 


similar way: 
French music: 75(0.4074)=30.56 French music: 75 (0.4074) = 30.56 
Italian music: 84(0.4074)=34,22Italian music: 84 (0.4074) = 34.22 


AP® EXAM TIP 


As with chi-square tests for goodness of fit, the expected counts should 
not be rounded to the nearest whole number. While an observed count 
of entrées ordered must be a whole number, an expected count need not 
be a whole number. The expected count gives the average number of 
entrées ordered if HO/0 is true and the random assignment process is 
repeated many times. 


We repeat the process to find the expected counts for the other two 
types of entrées. The overall proportion of Italian entrées ordered during 
the study was 31/243=0.127631/243 = 0.1276 so the expected counts of 
Italian entrées ordered under each treatment are 

No music: 84(0.1276)=10.72French music: 75(0.1276)=9.57 Italian music: 
No music: 84(0.1276) = 10.72 
French music: 75 (0.1276) = 9.57 
Italian music: 84(0.1276) = 10.72 


The overall proportion of Other entrées ordered during the experiment was 
113/243=0.465118/243 = 0.465 go the expected counts of Other entrées 
ordered for each treatment are 
No music: 84(0.465)=39.06 French music: 75(0.465)=34.88 Italian music: 
No music: 84(0.465) = 39.06 
French music: 75 (0.465) = 34.88 
Italian music: 84 (0.465) = 39.06 
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The following table summarizes the expected counts for all three 
treatments. Note that the values for no music and Italian music are the 
same because 84 total entrées were ordered under each condition, and we 
expect the distributions of entrée choice to be the same. 


Expected counts 
Type of background music 
None French Italian | Total 


Total 84 79 84 243 


We can check our work by adding the expected counts to obtain the row 
and column totals, as in the table. These should be the same as those in the 
table of observed counts except for small roundoff errors, such as 75.01 
rather than 75 for the total number of entrées ordered when French music 
was playing. 

Let’s take a look at the two-way table from the restaurant study one 
more time. In this context, we found the expected count of French entrées 
ordered when no music was playing as follows: 


99 
84 -——= 34.22 
84:-99243=34.22 243 


Observed counts 


Type of background music 


Entrée ordered 


We’ve marked in the table the three numbers used in this calculation. 
These values are the row total for French entrées ordered, the column total 
for entrées ordered when no music was playing, and the table total of 
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We’ve marked in the table the three numbers used in this calculation. 
These values are the row total for French entrées ordered, the column total 
for entrées ordered when no music was playing, and the table total of 
entrées ordered during the experiment. We can rewrite the original 
calculation as 


84.99 9984 49, 
84-99243=99 84243=34.22 243 243 ) 
This suggests a more general formula for the expected count in any cell of 


a two-way table. 


CALCULATING EXPECTED COUNTS FOR A CHI- 
SQUARE TEST BASED ON DATA IN A ‘TWO-WAY 
TABLE 


When H0/40 is true, the expected count in any cell of a two-way table 


is 
expected count=row total - column totaltable total 
row total - column total 


expected count = 
*P table total 


Just as we did with the chi-square test for goodness of fit, we compare 
the observed counts with the expected counts using the chi-square test 
Statistic 

x2=> (Observed count — Expected count)2Expected count 
. (Observed count — Expected count)? 
[—— 


Expected count 
This time, the sum is over all cells (not including the totals!) in the two- 
way table. 
For French entrées with no music, the observed count is 30 orders and 
the expected count is 34.22. The contribution to the y2x” test statistic for 
this cell is 


(Observed count — Expected count)2Expected count= 
(30-34.22)234.22=0.52 
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¥2=(30-34.22)234.22+(39-30.56)230.56+-+ 
(35-39.06)239.06=0.52+2.33+'+0.42=18.28 
(30—34.22)2 (3930.56)? (35—39.06)2 
x 34.22 ~—SC«0.5 6 - 39.06 
= 0.52 + 2.33 +--+ + 0.42 = 18.28 


AP® EXAM TIP 


In the “Do” step, you aren’t required to show every term in the chi- 


square test statistic. Writing the first few terms of the sum followed by 
“...” is considered as “showing work.” We suggest that you do this and 
then let your calculator tackle the computations. 


Here is an example to practice what you have learned so far. 


EXAMPLE | Would you vote for a female president? 
Ne 
Hypotheses, expected counts, and the chi-square 
test statistic 


Bloomberg/Getty Images 


PROBLEM: For a class project, Abby and Mia wanted to know if the 
gender of an interviewer could affect the responses to a survey question. 
The subjects in their experiment were 100 males from their school. Half 
of the males were randomly assigned to be asked, “Would you vote for a 
female president?” by a female interviewer. The other half of the males 
were asked the same question by a male interviewer.2 The table shows 
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the results. 
a. State the appropriate null and alternative hypotheses. 


b. Show the calculation for the expected count in the Male/Yes cell. 
Then provide a complete table of expected counts. 


c. Calculate the value of the chi-square test statistic. 


Gender of interviewer 


Total 

Response to Yes 69 
question No 11 
Maybe 20 

Total 100 


SOLUTION: 


a. H0:/0 : There is no difference in the true distributions of response 
to this question when asked by a male interviewer and when asked 


by a female interviewer for subjects like these. 


Ha:e : There is a difference in the true distributions of response to 
this question when asked by a male interviewer and when asked 


by a female interviewer for subjects like these. 


b. The expected count for the Male/Yes cell is 69-50100 = 34.5 


69:50 _ a, 
on ene The rest of the expected counts are shown in the table: 


Gender of interviewer 


Male Female Total 

Response to Yes 69 
question No 14 
Maybe 20 

Total 50 50 100 
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expected count=row total - column totaltable total 


expected count = row total - column total 
table total 


c. x2 = (30 — 34.5)234.5+(39 — 34.5)234.5+-= 4.25 
9 _ (30-345)? | (39—34.5)? 


4.25 


x2=)\(Observed count — Expected count)2Expected count 


‘ (Observed count — Expected count)” 
x2 = Sy Phverved count — Eapected cont 


Expected count 


FOR PRACTICE, TRY EXERCISE 29 


CHECK YOUR UNDERSTANDING 


A 


‘ 


y Separate random samples of children from the United Kingdom and 
the United States who completed a survey in a recent year were 
selected. For each student, we recorded the superpower he or she would 
most like to have: the ability to fly, ability to freeze time, invisibility, 
super strength, or telepathy (ability to read minds). Is there convincing 
evidence that the distributions of superpower preference are different for 
survey takers in the two countries? The data are summarized in the two- 
way table.® 


Country 
U.K. URS. Total 
Fly 99 
Freeze time 96 


Superpower Freeze time 52 44 96 
preference Invisibility 67 
Super 43 
strength 
Telepathy 110 
Total 200 215 415 


1. State the appropriate null and alternative hypotheses. 
2. Show the calculation for the expected count in the U.K./Fly cell. Then 
provide a complete table of expected counts. 


3. Calculate the value of the chi-square test statistic. 


Tests for Homogeneity: Conditions and P-values 


Like every other significance test, there are conditions that must be met to 
justify our calculations and conclusions. 


CONDITIONS FOR PERFORMING A CHI-SQUARE 
TEST FOR GOODNESS OF FIT 


e Random: The data come from a random sample from the population 
of interest. 
o 10%: When sampling without replacement, n<0.10 Nn < 0.10 N 
for each sample. 


e Large Counts: All expected counts are at least 5. 


RF Pictures/Getty Images 
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RF Pictures/Getty Images 


We can confirm that the conditions are met in the restaurant 
experiment. 


e Random: The three treatments were assigned at random. 

e Large Counts: All expected counts are at least 5 (see table below). 

We don’t have to check the 10% condition because the researchers were 
not sampling without replacement from some population of interest. They 


performed an experiment using customers who happened to be in the 
restaurant at the time. 


Observed counts 


Type of background music 


. French 
Entrée ordered 
Italian 
Other 


Total 


Expected counts 
Type of background music 
None French Italian | Total 


Total 84 75 84 243 


Entrée ordered 


As in the test for goodness of fit, you should think of the chi-square 
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rows and 3 columns in the two-way table (not including the totals), 
df=(3-1)x(3-1)=44£ = (3 — 1) x (3 a 1) =4 


We can find the P-value using Table C or technology. 


P 
df 0025 001 
4 16.42 18.47 


e Using Table C: Look at the df=4df = 4 row in Table C. The calculated 


value y2=18.28X" = 18.28 lies between the critical values 16.42 and 
18.47. The corresponding P-value is between 0.001 and 0.0025. 


Chi-square distribution 
with df = 4 


x2 = 18.28 


e Using technology: The command y2Xx” cdf(lower:18.28, upper:10000, 
df:4) gives 0.0011. 


Because the P-value of 0.0011<a=0.050.0011 < a = 0.05 we reject HO 
Ho. There is convincing evidence that there is a difference in the true 
distributions of entrées ordered at this restaurant when no music, French 
accordion music, or Italian string music is played. 


EXAMPLE | Does the gender of an interviewer matter? 
No 


Conditions, P-value, and conclusion 
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sturti/Getty Images 


PROBLEM: In the preceding example, you read about an experiment 
to determine if the gender of an interviewer affects responses to the 
question “Would you vote for a female president?” Here are tables 
showing the observed and expected counts: 


Observed counts 


Gender of interviewer 


Total 
Response to Yes 69 
question No 11 
Maybe 20 
Total 100 
Expected counts 
Gender of interviewer 
Total 
Response to Yes 69 
question No 11 
Maybe 20 
Total 100 


Earlier, we calculated y2=4,25X" = 4.25 


a. Verify that the conditions for inference are met. 


b. Use Table C to find the P-value. Then use your calculator’s y2X” cdf 
command. 


c. Interpret the P-value from the calculator. 


d. What conclusion would you draw? 
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SOLUTION: 


a. Random: Treatments were randomly assigned. / 
Large Counts: All expected counts>5Counts = 95 (see table of 


expected counts). / 


We don’t check the 10% condition because Abby and Mia didn’t 


randomly select subjects from some population. 


b. df=(3-12-1)=2¢ = (8-1) (2-1) =2 
Using Table C: The P-value is between 0.10 and 0.15. 


Using technology: y2 cdf(lower:4.25, upper:10000, df:2) = 0.119 
x” cdf(lower : 4.25, upper : 10000, df: 2) = 0.119 


c. Assuming that the gender of the interviewer doesn’t affect responses 
to this question, there is a 0.119 probability of observing differences 
in the distributions of responses as large as or larger than those in 


this study by chance alone. 


d. Because the P-value of 0.119>a = 0.05,9-119 > a = 0.05, we fail to 
reject HOMo There is not convincing evidence of a difference in the 
true distributions of response to this question when asked by a male 
interviewer and when asked by a female interviewer for subjects like 


these. 


Chi-square distribution 
with df =2 


FOR PRACTICE, TRY EXERCISE 31 


Calculating the expected counts and then the chi-square test statistic by 
hand is a bit time-consuming. As usual, technology saves time and gets the 
arithmetic right. 


29. Technology Corner |PERFORMING CHI-SQUARE 
TESTS FOR TWO-WAY TABLES (>) 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 


You can use the TI-83/84 to perform calculations for a chi-square test 
for homogeneity. We’ll use the data from the restaurant study to 
illustrate the process. 


1. Enter the observed counts in matrix [A]. 
e Press (MATRIX), arrow to EDIT, and choose A. 


NAMES MATH [Sep@s 
MCA 


3 
4 
5 
6:CFI 
7 
8 
9 


e Enter the dimensions of the matrix: 3x33 x 3 


e Enter the observed counts from the two-way table in the same 
locations in the matrix. 

MATRIXCA] 3 x3 

( EC 39 30 


c 43 35 35 | 


(Ali.D= 30 
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MATRIXCA] 3 x3 

( EC 39 30 

{ 11 1 19 ] 
c 43 35 35 1 
[AL.= 3 


2. Press [STAT] arrow to TESTS, and choose y2x” -Test. Adjust your 
settings as shown. 


ORMAL FLOAT AUTO REAL RADIAN CL fl 


ix2-Test| 
Observed: [A] 
Expected: [B] 
Color: EUs 


Calculate Draw 


Note: You do not have to enter the expected counts in matrix [B]. 


Once you have run the test, the expected counts will be stored in 
matrix [B]. 


3. Choose “Calculate” or “Draw” to carry out the test. If you choose 
“Calculate,” you should get the test statistic, P-value, and df shown 
here. If you specify “Draw,” the chi-square distribution with 4 
degrees of freedom will be drawn, the area in the tail will be shaded, 
and the P-value will be displayed. 


- 
X2=18. 27921151 / 
P=, 0010882802 | 
df=4 | 


%2=18.2792 p=.6011 


4. To see the expected counts, Press (MATRIX), arrow to 
EDIT, and choose [B]. 
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You can use your calculator to carry out the mechanics of a significance 
test on the AP® Statistics exam—but there’s a risk involved. If you just 
give the calculator answer without showing work, and one or more of 
your entries is incorrect, you will likely get no credit for the “Do” step. 
We recommend writing out the first few terms of the chi-square 
calculation followed by “...”. This approach may help you earn partial 
credit if you enter a number incorrectly. Be sure to name the procedure 


y2x” test for homogeneity) and to report the test statistic (y2=18.279) 
(x? = 18.279) degrees of freedom (df=4) (df = 4) and P-value (0.0011). 


CHECK YOUR UNDERSTANDING 


( 


y In the preceding Check Your Understanding (page 732), we 
presented data about superpower preferences for random samples of 
children from the United Kingdom and the United States. Here are the 
data once again: 


Total 
Fly 99 
Freeze time 96 
Superpower —— 
preference Invisibility 67 
Super 43 
strength 
Telepathy 110 
Total 415 


1. Verify that the conditions for inference are met. 


2. Use Table C to find the P-value. Then use your calculator’s y2Xx” cdf 
command. 


3. Interpret the P-value from the calculator. 
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4. What conclusion would you draw? 


Putting It All Together: The Chi-Square Test for 
Homogeneity 


In Section 11.1, we used a chi-square test for goodness of fit to test a 
hypothesized model for the distribution of a categorical variable. When we 
want to compare the distribution of a categorical variable in several 
populations or for several treatments, we use a chi-square test for 
homogeneity. 


This test is also known as a chi-square test for homogeneity of proportions. We 


prefer the simpler name. 


CHI-SQUARE TEST FOR HOMOGENEITY 


Suppose the conditions are met. To perform a test of 


HO0:0 : There is no difference in the distribution of a categorical 
variable for several populations or treatments 


compute the chi-square test statistic 


x2=>y. ( Observed count — Expected count)2Expected count 
2 


. ( Observed count — Expected count) 
fis yp ont Bee 


Expected count 


where the sum is over all cells (not including totals) in the two-way 
table. The P-value is the area to the right of y2x” under the chi-square 
density curve with degrees of freedom = (number of rows — 1)(number 
of columns — 1). 


Let’s look at an example of a chi-square test for homogeneity from 
start to finish. As usual, we follow the four-step process when performing 
a significance test. 
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EXAMPLE | Speaking English (>) 


The chi-square test for homogeneity 


Scott Olson/Getty Images 
PROBLEM: The Pew Research Center conducts surveys about a 
variety of topics in many different countries. In one survey, it wanted to 
investigate how residents of different countries feel about the 
importance of speaking the national language. Separate random samples 
of residents of Australia, the United Kingdom, and the United States 
were asked many questions, including the following: “Some people say 
that the following things are important for being truly [survey country 
nationality]. Others say they are not important. How important do you 
think it is to be able to speak English?” The two-way table summarizes 
the responses to this question. 


Country 

Australia U.K. US: Total 

Very 2569 
important 

Opinion Somewhat 713 
about important 

speaking Not very 118 
English — important 

Not at all 63 
important 

Total 1000 1460 1003 3463 


1617 


Do these data provide convincing evidence at the a=0.05@ = 0.05 level 
that the distributions of opinion about speaking English differ for 
residents of Australia, the U.K., and the U.S.? 


SOLUTION: 


STATE: Ho:/40 : There is no difference in the true distributions of 
opinion about speaking English for residents of Australia, the U.K., and 
the U.S. 


Use the four-step process! 


Ha:/4a : There is a difference in the true distributions of opinion about 
speaking English for residents of Australia, the U.K., and the U.S. 
We'll use a = 0.05. = 0.05. 

PLAN: Chi-square test for homogeneity. 


e Random: Independent random samples of residents from the three 
countries. / 


° 10%: 1000 is<10%10% : 1000 is < 10% of all Australian residents, 
1460 is <10%1460 is < 10% of all U.K. residents, and 1003 
is<10%1003 is < 10% of all U.S. residents. / 


e Large Counts: All expected counts are >5= ® (see table below). / 


Expected counts Country 
Australia U.K. USS. Total 
Very 741.8 1083.1 744.1 2569 
important 


Opinion Somewhat 205.9 300.6 206.5 713 
about important 


speaking Not very 118 
English important 
Not at all 63 
important 
Total 1000 1460 1003 3463 


1618 


Very 741.8 1083.1 744.1 2569 


important 
OPO career ere e050 300.6 206.5 713 
about -; 
; important 
speaking 
English Not very 34.1 49.7 34.2 118 
important 
Not at all 18:2 26.6 18.2 63 
important 
Total 1000 1460 1003 3463 


DO: 


e Test Statistic: 


2 = (690 — 741.8)2741.8+(1177 — 1083.1)21083.1+- = 68.57 


a (690—741.8)* (1177—1083.1)? é 
¥° = tt: dc = 68.57 


741.8 1083.1 


e P-value:df = (4 - 1)(3 — 1) = 6P-value : df = (4—1)(3—1) =6 
Using Table C:P—value<0.0005? — value < 0.0005 
Using technology: x2cdf(lower:68.57, upper:10000, df:6)*0 

x’ cdf (lower : 68.57, upper : 10000, df : 6) ~ 0 


CONCLUDE: Because the P-value of approximately 0<a = 0.05, 

0 < a= 0.05, we reject HOM0 There is convincing evidence that there is 
a difference in the true distributions of opinion about speaking English 
for residents of Australia, the U.K., and the U.S. 


Because the observed counts differ from the expected counts, there is 


some evidence for Ha/Ja 


Chi-square distribution 
with df =6 
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enough to claim that all the expected counts are at least 5. You must 
provide clear evidence. 


What if we want to compare several proportions? Many studies involve 
comparing the proportion of successes for each of several populations or 
treatments. The two-sample z test from Chapter 10 allows us to test the 
null hypothesis HO: p1=p2/o : Pi = P2 where p1P1 and p2?P2 are the true 
proportions of successes for the two populations or treatments. The chi- 
square test for homogeneity allows us to test HO: p1=p2=...=pk 
Ho : pi = P2 = --. = Pk This null hypothesis says that there is no 
difference in the proportions of successes for the k populations or 
treatments. The alternative hypothesis is Ha:40 : At least two of the pi’s 
Pi’S are different. Many students incorrectly stateHa/o as “all the 
proportions are different.” Think about it this way: the opposite of “all the 
proportions are equal” is “some of the proportions are not equal.” 


FOLLOW-UP ANALYSIS The chi-square test for homogeneity allows 
us to compare the distribution of a categorical variable for any number of 
populations or treatments. If the test allows us to reject the null hypothesis 
of no difference, we may want to do a follow-up analysis that examines the 
differences in detail. As with the chi-square test for goodness of fit, start 
by identifying the cells that contribute the most to the chi-square statistic. 
Then describe how the observed and expected counts differ in those 
categories, noting the direction of the difference. 


Our earlier restaurant study found significant differences among the 
distributions of entrées ordered under each of the three music conditions. 
We entered the two-way table for the study into Minitab software and 
requested a chi-square test. The output appears in Figure 11.6. Minitab 
repeats the two-way table of observed counts and puts the expected count 
for each cell below the observed count, followed by the nine individual 


2 — 
components that contribute to the y2* test statistic. 
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Chi-Square Test: None, French, Italian 
Expected counts are printed below observed counts 


Chi-Square contributions are printed below expected counts 


None French Italian Tota 
French 30 39 30 99 
entrée 34.22 30.56 34.22 

0.521 2.334 0.521 
Italian Ld 1 19 31 
entrée 10:72 9.57 10.72 

0.008 7.672 6.404 
Other 43 35 35 113 
entrée 39.06 34.88 39.06 

0.397 0.000 0.422 
Total 84 a3 84 243 


Chi-Sq = 18.279, DF = 4, P-Value = 0.001 


FIGURE 11.6 Minitab output for the two-way table in the restaurant study. The 
output gives the observed counts, the expected counts, and the individual 
components of the chi-square test statistic. 


Looking at the output, we see that just two of the nine components 
contribute about 14 (almost 77%) of the total y2=18.28X° = 18.28 
Comparing the observed and expected counts in these two cells, we see 
that orders of Italian entrées are far below what we expect when French 
music is playing and far above what we expect when Italian music is 
playing. We are led to a specific conclusion: orders of Italian entrées are 
strongly affected by Italian and French music. More advanced methods 
provide tests and confidence intervals that make this follow-up analysis 
more complete. 


CHECK YOUR UNDERSTANDING 


. 


¥ 


y Canada has universal health care. The United States does not but 
often offers more elaborate treatment to patients who have access. How 
do the two systems compare in treating heart attacks? Researchers 
compared random samples of U.S. and Canadian heart attack patients. 
One key outcome was the patients’ own assessment of their quality of 
life relative to what it had been before the heart attack. Here are the data 
for the patients who survived a year: 
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Quality of Canada __— United States 


life 

Much better 79 541 
Somewhat 71 498 
better 

About the 96 779 
same 

Somewhat 50 282 
worse 

Much worse 19 65 
Total 311 2165 


Is there a significant difference between the two distributions of quality- 
of-life ratings? Carry out an appropriate test at the a=0.01a = 0.01 
level. 


Relationships Between Two Categorical Variables 


Two-way tables can summarize data from different types of studies. The 
restaurant experiment compared entrées ordered for three music 
treatments. The observational study about speaking English compared 
independent random samples from three different populations. In both 
cases, we are comparing the distribution of a categorical variable for 
several populations or treatments. We use the chi-square test for 
homogeneity to perform inference in such settings. 

Another common situation that leads to a two-way table is when a 
single random sample of individuals is chosen from a single population 
and then classified based on two categorical variables. In that case, our 
goal is to analyze the relationship between the variables. 
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Francesco Carta fotografo/Getty Images 


Are people who are prone to sudden anger more likely to develop heart 
disease? An observational study followed a random sample of 8474 people 
with normal blood pressure for about four years.2 All the individuals were 
free of heart disease at the beginning of the study. Each person took the 
Spielberger Trait Anger Scale test, which measures how prone a person is 
to sudden anger. Researchers also recorded whether each individual 
developed coronary heart disease (CHD). This includes people who had 
heart attacks, as well as those who needed medical treatment for heart 
disease. Here is a two-way table that summarizes the data: 


Anger level 


Low Moderate High Total 
Yes 1 
CHD status us 
No 8284 
Total 8474 


The bar graph in Figure 11.7 shows the percent of people in each of the 
three anger categories who developed CHD. There is a clear trend: as the 
anger score increases, so does the percent who suffer heart disease. A 
much higher percent of people in the high anger category developed CHD 
(4.27%) than in the moderate (2.33%) and low (1.70%) anger categories. 
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Percent with CHD 
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Anger level 


FIGURE 11.7 Bar graph comparing the percents of people in each anger category 
who developed coronary heart disease (CHD). 


Do these data provide convincing evidence of an association between 
the variables in the larger population? Or is it plausible that there is no 
association between the variables in the population and that we observed 
an association in the sample by chance alone? To answer that question, we 
rely on a new significance test. 


Tests for Independence: Stating Hypotheses 


When we gather data from a single random sample and measure two 
categorical variables, we are often interested in whether the sample data 
provide convincing evidence that the variables have an association in the 
population. That is, does knowing the value of one variable help predict 
the value of the other variable for individuals in the population? To 
determine if evidence from the sample is convincing, we perform a chi- 
square test for independence. 


In this test, our null hypothesis is that there is no association between 
the two categorical variables in the population of interest. The alternative 
hypothesis is that there is an association between the variables. For the 
observational study of anger level and coronary heart disease, we want to 
test the hypotheses 


H0:4o : There is no association between anger level and heart-disease 
status in the population of people with normal blood pressure. 


Ha:4o : There is an association between anger level and heart-disease 
status in the population of people with normal blood pressure. 
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We could substitute the word dependent in place of not independent in the alternative 


hypothesis. We’ll avoid this practice, however, because saying that two variables are 
dependent sounds too much like saying that changes in one variable cause changes in 


the other. 


No association between two variables means that knowing the value of 
one variable does not help us predict the value of the other. That is, the 
variables are independent. An equivalent way to state the hypotheses is 


HO0:/40 : Anger and heart-disease status are independent in the 
population of people with normal blood pressure. 


Ha:/4a * Anger and heart-disease status are not independent in the 
population of people with normal blood pressure. 


Tests for Independence: Expected Counts 


As with the two previous types of chi-square tests, we begin by comparing 
the observed counts in a two-way table with the expected counts if HO 
is true. In the anger study, the null hypothesis is that there is no association 
between anger level and heart-disease status in the population of interest. 
If we assume that HO/Z0 is true, then anger level and CHD status are 
independent. We can find the expected cell counts in the two-way table 
using the definition of independent events from Chapter 5: P(A|B)=P(A) 
P(A\B) = P(A) The chance process here is randomly selecting a person 
and recording his or her anger level and CHD status. 


Anger level 


Low Moderate High Total 
Yes 
CHD status a 
No 8284 
Total 3110 4731 633 8474 


Let’s start by considering the events “Yes” and “Low anger.” We see 
from the two-way table that 190 of the 8474 people in the study had CHD. 
If we imagine choosing one of these people at random, P(Yes)=190/8474 
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P (Yes) = 190/8474 t¢ the null hypothesis is true and anger level and CHD 


status are independent, knowing that the selected individual is low anger 
does not change the probability that this person develops CHD. That is to 
say, P(Yes|Low anger)=P(Yes)=190/8474=0.02242 


P (Yes|Low anger) = P (Yes) = 190/8474 = 0.02242 
Of the 3110 low-anger people in the study, we’d expect 
190 


3110 -—— = 3110 (0.02242) = 69.73 
3110 - 1908474=3110(0.02242)=69.73 8474 


to get CHD. You can see that the general formula we developed earlier for 
a test for homogeneity applies in this situation also: 

expected count=row total - column totaltable total=190 -31108474=69.73 
row total- column total 190-3110 | 69.73 


expected count = eee 
table total 8474 


You can complete the table of expected counts by using the formula 
for each cell or by using the formula for some cells and subtracting to find 
the remaining cells. Here is the completed table of expected counts: 


Expected counts 
Anger level 
Low Moderate High Total 


CHD status 190 
» 8264 


Total 3110 4731 633 8474 


Tests for Independence: Conditions and Calculations 


The 10% and Large Counts conditions for the chi-square test for 
independence are the same as for the test for homogeneity. There is a 
slight difference in the Random condition for the two tests: a test for 
independence uses data from a single random sample, but a test for 
homogeneity uses data from two or more independent random samples or 
from two or more groups in a randomized experiment. 
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CONDITIONS FOR PERFORMING A CHI-SQUARE 
TEST FOR INDEPENDENCE 


e Random: The data come from a random sample from the population 
of interest. 


o 10%: When sampling without replacement, n<0.10 Nn < 0.10 N 


e Large Counts: All expected counts are at least 5. 


We can confirm that the conditions are met in the anger and heart 
disease study. 


e Random: The data came from a random sample of 8474 people with 
normal blood pressure. 


© 10%: It is reasonable to assume that 8474<10%8474 < 10% of all 
people with normal blood pressure. 


e Large Counts: All the expected counts are at least 5 (see table on 
previous page). 
- ae 2 a 
When the conditions are met, we use the familiar y2X test statistic to 
measure the strength of the association between the variables in the 
sample. P-values for this test come from a chi-square distribution with 


df=(number of rows—1)(number of columns-1) 
df = (number of rows — 1) x (number of columns — 1) 


For the anger and heart disease study, 
e Test statistic: y2=(53-69.73)269.73+(110-106.08)2106.08+:=16.077 


_9 _ (53-69.73)? (110-106.08)” a 
X= —to73 — + Foss + °° = 16.077 


° Using technology: With df=(2-1)(3-1)=24f = (2— 1) (8-1) =2) p 
—value=0.00032P — value = 0.00032 


Because the P-value of 0.00032<a=0.050.00032 < a = 0.05 we reject 
H0#0. We have convincing evidence of an association between anger 
level and heart-disease status in the population of people with normal 
blood pressure. 


A follow-up analysis reveals that two cells contribute most of the chi- 
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square test statistic: Low anger, Yes (4.014) and High anger, Yes (11.564). 
A much smaller number of low-anger people developed CHD than 
expected. And a much larger number of high- anger people got CHD than 
expected. 


Chi-square distribution 
with df = 2 


0 2 - 6 8 10 12 14 16\ 18 20 
3 = 16.077 

Can we conclude that proneness to anger causes heart disease? No. 
The anger and heart-disease study is an observational study, not an 
experiment. It isn’t surprising that some other variables are confounded 
with anger level. For example, people prone to anger are more likely than 
others to be men who drink and smoke. We don’t know whether the 
increased rate of heart disease among those with higher anger levels in the 
study is due to their anger or perhaps to their drinking and smoking or 
maybe even to gender. 


Putting It All Together: The Chi-Square Test for 
Independence 


When we want to test for an association between two categorical variables 
in a population, we use a chi-square test for independence. Here are the 
key details. 


The chi-square test for independence is also known as the chi-square test for 


association. 


CHI-SQUARE TEST FOR INDEPENDENCE 


Suppose the conditions are met. To perform a test of 
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HO0:0 : There is no association between two categorical variables in 
the population of interest 
compute the chi-square test statistic 
Xx2=)»: (Observed count — Expected count)2Expected count 
‘ (Observed count — Expected count)? 
ase 3 Expected count 


where the sum is over all cells in the two-way table (not including the 
totals). The P-value is the area to the right of yx” under the chi-square 
density curve with degrees of freedom = (number of rows — 1)(number 
of columns — 1). 


We’re now ready to perform a complete test for independence. 


EXAMPLE | Snowmobiles in Yellowstone » 


Chi-square test for independence 


korolOK/Depositphotos 


PROBLEM: In Chapter 1, you read about a random sample of winter 
visitors to Yellowstone National Park. Each of the 1526 visitors in the 


sample was asked two questions: 
1. Do you belong to an environmental club (like the Sierra Club)? 


2. What is your experience with a snowmobile: own, rent, or never 
used? 
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The two-way table summarizes the results. 


Environmental club status 


Total 

Snowmobile Never used 657 
experience Renter 574 
Total 1526 


Do the data provide convincing evidence of an association between 
environmental club status and type of snowmobile use in the population 
of winter visitors to Yellowstone National Park? 


SOLUTION: 


Use the four-step process! 


STATE: 

H0:4o : There is no association between environmental club status and 
type of snowmobile use in the population of winter visitors to 
Yellowstone. 


You could also say Ho:o : Environmental club status and type of 


snowmobile use are independent in the population of winter visitors to 
Yellowstone. 


Ha:a : There is an association between environmental club status and 
type of snowmobile use in the population of winter visitors to 
Yellowstone. 


We'll use a = 0.05. = 0.05. 


If no significance level is provided, use a=0.05a@ = 0.05 


PLAN: Chi-square test for independence. 


e Random: Random sample of 1526 winter visitors to Yellowstone. / 
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We know this is a test for independence and not homogeneity because 


there was only one random sample. 


o 10%: It is reasonable to assume that 1526<10%1526 < 10% of all 
winter visitors to Yellowstone. / 


e Large Counts: All the expected counts are at least 5 (see table below). 


J 


Expected counts 
Environmental club status 


Not amember| Member Total 

Snowmobile Never used 657 
experience Renter 574 
Owner 295 

Total PPA 305 1526 


Because the observed counts differ from the expected counts, there is 


some evidence for Hg. 


DO: 
e Test statistic: y2 = (445 — 525.7)2525.7+(212 — 131.3)2131.3+- = 116.6 
9 _ (445-525.7)? (212-1313)? 
_ 525.7 131.3 
= 116.6 


e P-value: df =(3-1)(2-1)= df = (3—1)(2—1) =2 

Using Table C:P-value<0.0005P — value < 0.0005 

Using technology: The calculator’s y2x” -Test gives x2 = 116.6 
x” = 116.6 and P-value = 4.82x10-26P — value = 4.82 x 10-*° 
using df = 24f = 2, 

CONCLUDE: Because the P-value of approximately 0<a = 0.05 

0 < a= 0.05 we reject HOM0 We have convincing evidence of an 
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association between environmental club status and type of snowmobile 
use in the population of winter visitors to Yellowstone National Park. 


JORMAL FLOAT AUTO REAL RADIAN MP 0 


x2-Test 
x2=116.5883 P=8 


FOR PRACTICE, TRY EXERCISE 43 


AP® EXAM TIP 


When the P-value is very small, the calculator will report it using 
scientific notation. Remember that P-values are probabilities and must 
be between 0 and 1. If your calculator reports the P-value with a 
number that appears to be greater than 1, look to the right, and you will 
see that the P-value is being expressed in scientific notation. If you 
claim that the P-value is 4.82, you will certainly lose credit. 


CHECK YOUR UNDERSTANDING 


. 


y A random sample of 1200 U.S. college students was asked, “What is 
your perception of your own body? Do you feel that you are overweight, 
underweight, or about right?” The two-way table summarizes the data 
on perceived body image by gender.® 


Gender 
Total 
About right 855 
Body image Overweight 235 
Underweight 110 


Total | 760 | 440 | 1200 


Do these data provide convincing evidence at the a=0.01@ = 0.01 level 
of an association between gender and perceived body image in the 
population of U.S. college students? 


Using Chi-Square Tests Wisely 


Both the chi-square test for homogeneity and the chi-square test for 
independence start with a two-way table of observed counts. They even 
calculate the test statistic, degrees of freedom, and P-value in the same 
way. The questions that these two tests answer are different, however. A 
chi-square test for homogeneity tests whether the distribution of a 
categorical variable is the same for each of several populations or 
treatments. The chi-square test for independence tests whether two 
categorical variables are associated in some population of interest. 


One way to help you distinguish these two tests is to consider the two 
sets of totals in the two-way table. In tests for homogeneity, one set of 
totals is known by the researchers before the data are collected. For 
example, in the experiment to determine if the gender of the interviewer 
affects the response to a question about a female president, Abby and Mia 
decided in advance to randomly assign 50 subjects to each treatment. 


Gender of interviewer 


Total 

Response to Yes 4 
question No ? 
Maybe ? 

Total 100 


Likewise, in the observational study comparing opinions about 
speaking English, researchers knew in advance that they would survey 
1000 people from Australia, 1460 from the U.K., and 1003 from the U.S. 
In both cases, only one set of totals was left to vary. This is consistent with 
the design of the study: select independent random samples (or randomly 
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assign treatments) and compare the distribution of a single categorical 
variable. 


However, in a test for independence, neither set of totals is known in 
advance. In the observational study about snowmobile use in Yellowstone, 
the researchers didn’t know anything about either variable ahead of time— 
they only knew that they would survey 1526 visitors. This is consistent 
with the design of the study: select one sample and record the values of 
two variables for each member. 


Environmental club status 


Total 
Never used ? 
Snowmobile manic , 
experience 
Owner ? 
Total 1526 


Unfortunately, it is quite common to see questions about association 
when a test for homogeneity applies; when a test for independence applies, 
questions about differences between proportions or the distribution of a 
variable are quite common. Many people avoid the distinction altogether 
and pose questions about the “relationship” between two variables. 


Instead of focusing on the question asked, the best plan is to consider 
how the data were produced. If the data come from two or more 
independent random samples or treatment groups in a randomized 
experiment, then do a chi-square test for homogeneity. If the data come 
from a single random sample, with the individuals classified according to 
two categorical variables, use a chi-square test for independence. 


EXAMPLE | Scary movies and fear > 
Choosing the right type of chi-square test 
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Alamy 


PROBLEM: Are men and women equally likely to suffer lingering fear 
from watching scary movies as children? Researchers asked a random 
sample of 117 college students to write narrative accounts of their 
exposure to scary movies before the age of 13. More than one-fourth of 
the students said that some of the fright symptoms are still present when 
they are awake.” The following table breaks down these results by 
gender. 


Gender 
Total 
Fright Yes 36 
symptoms? No 81 
Total 117 


Assume that the conditions for performing inference are met. Minitab 
output for a chi-square test using these data is shown. 


Chi-Square Test: Male, Female 
Expected counts are printed below observed counts 


Chi-Square contributions are printed below expected counts 


Male Female Total 
Yes I 29 36 
11.69 Ue. 
1.883 0.906 
No ot 50 81 
26.31 54.69 
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0.837 0.403 
Total 38 79 ley, 
Chi-Sq = 4.028, DF = 1, P-Value = 0.045 


a. Should a chi-square test for independence or a chi-square test for 
homogeneity be used in this setting? Explain your reasoning. 


b. State an appropriate pair of hypotheses for researchers to test in this 
setting. 


c. Which cell contributes most to the chi-square test statistic? In what 
way does this cell differ from what the null hypothesis suggests? 


d. Interpret the P-value. What conclusion would you draw at a=0.01? 
a = 0.01? 


SOLUTION: 


a. Chi-square test for independence. The data were produced using a 
single random sample of college students, who were then classified 
according to two variables: gender and whether or not they had 
lingering fright symptoms. The chi-square test for homogeneity 
requires independent random samples from each population. 


In this setting, the researchers wouldn’t know either set of totals until 


the data were collected. They would know only the overall total (117) 
in advance. Thus, a test for independence is the correct choice. 


b. H0:/40 : There is no association between gender and whether or not 
college students have lingering fright symptoms. 
Ha:/a : There is an association between gender and whether or not 
college students have lingering fright symptoms. 


c. Men who admit to having lingering fright symptoms account for the 
largest component of the chi-square test statistic (1.883). Far fewer 
men in the sample admitted to lingering fright symptoms (7) than we 
would expect if Hoo were true (11.69). 


d. If there is no association between gender and whether or not college 
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students have lingering fright symptoms, there is a 0.045 probability 
of obtaining an association as strong as or stronger than the one 
observed in the random sample of 117 students. Because the P-value 
of 0.045+a = 0.019-045 + a = 0.01 we fail to reject HOM#0. We do not 
have convincing evidence that there is an association between gender 
and whether or not college students have lingering fright symptoms. 


FOR PRACTICE, TRY EXERCISE 49 


WHAT IF WE WANT TO COMPARE TWO PROPORTIONS? 
Shopping at secondhand stores is becoming more popular and has even 


attracted the attention of business schools. A study of customers’ attitudes 


toward secondhand stores interviewed separate random samples of 
shoppers at two secondhand stores of the same chain in different cities. 
The two-way table shows the breakdown of respondents by gender.12 


Store 
Total 
Gender Male 106 
Female 353 
Total 459 


Do the data provide convincing evidence of a difference in the 
distributions of gender for shoppers at these two stores? To answer this 
question, we could perform a chi-square test for homogeneity. Our 
hypotheses are 


HO:/40 : There is no difference in the distributions of gender for 
shoppers at these two stores. 


Ha:/4a : There is a difference in the distributions of gender for 
shoppers at these two stores. 


A difference in distribution of gender would mean that there is a difference 


in the true proportion of female shoppers at the two stores. So we could 


also use a two-sample z test from Section 10.1 to compare two proportions. 


The hypotheses for this test are 
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Ho: pa — pp = 90 
HO: pA-pB=0Ha: pA-pB<0 Ha: Pa — PB # 0 
where pAP4A and pBP2 are the true proportions of female shoppers at 
Store A and Store B, respectively. 
The TI-84 screen shots show the results from a two-sample z test for 
pA-pBP4 ~ P8 and from a chi-square test for homogeneity. (We verified 


that the Random, 10%, and Large Counts conditions were met before 
carrying out the calculations.) 


ORMAL FLOAT AUTO REAL RADIAN CL fil ORMAL FLOAT AUTO REAL RADIAN CL 1 


PitP2 X2=15.33407007 
z=3. 915874113 P=9,0077244e -S 
P=9,0114092E-S df=1 


61=. 8423236515 
6 2=. 6880733945 
b=. 7690631808 
n1=241 
n2=218 


Note that the P-values from the two tests are the same except for 
rounding errors. You can also check that the chi-square test statistic is the 
square of the two-sample z statistic: (3.915--)2=15.334 
(3.915--+)? = 15.334. 


As the preceding example suggests, the chi-square test for 
homogeneity based on a 2x22 2 table is equivalent to the two-sample z 
test for p1—p2P1 ~ P2 with a two-sided alternative hypothesis. However, 
there are other settings where only one of the options is valid: 


e If the two-way table is larger than 2x22 x 2 the only option is a chi- 
square test for homogeneity. 


e Ifthe table is 2x22 x 2 and the alternative hypothesis is one-sided, use a 
two-sample z test for a difference in proportions rather than a chi-square 
test. 


e Ifthe table is 2x22 x 2 and you want to construct a confidence interval 
for a difference in proportions, the only option is a two-sample z 
interval. 


Because of the points made in the last two bullets, we recommend the 
Chapter 10 methods for comparing two proportions whenever you are 
given a choice. 
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GROUPING QUANTITATIVE DATA INTO CATEGORIES As we 
mentioned in Chapter 1, it is possible to convert a quantitative variable to a 
categorical variable by grouping together intervals of values. Here’s an 
example. Researchers surveyed independent random samples of shoppers 
at two secondhand stores of the same chain in different cities. The two- 
way table summarizes data on the incomes of the shoppers in the two 
samples. 


Total 

Under $10,000 132 
$10,000 to 115 
$19,999 

Income $20,000 to 119 
$24,999 
$25,000 to Al 
$34,999 
$35,000 or more Al 
Total 459 


Personal income is a quantitative variable. But by grouping the values 
of this variable, we create a categorical variable. We could use these data 
to carry out a chi-square test for homogeneity because the data came from 
independent random samples of shoppers at the two stores. Comparing the 
distributions of income for shoppers at the two stores would give more 
information than simply comparing their mean incomes. 


WHAT IF SOME OF THE EXPECTED CELL COUNTS ARE LESS 
THAN 5? Let’s look at a situation where this is the case. A sample survey 
asked a random sample of young adults, “Where do you live now? That is, 
where do you stay most often?” Here is a two-way table of all 2984 people 
in the sample (both men and women) classified by their age and by where 
they lived! Living arrangement is a categorical variable. Even though 
age is quantitative, the two-way table treats age as categorical by dividing 
the young adults into four categories. The table gives the observed counts 
for all 20 combinations of age and living arrangement. 
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— Te 


Parents’ 324 | 378 | 337 | 318 
home 
Another 
person’s 

Living arrangement home 
Your own 116 | 279 | 372 | 487 
place 


Group 
quarters 


Other 
Total 


Our null hypothesis is HO: : There is no association between age 
and living arrangement in the population of young adults. The following 
table shows the expected counts assuming H0/4 is true. We can see that 
two of the expected counts (circled in red) are less than 5. This violates the 
Large Counts condition. 


4 she 


Total 
Parents’ 245.57 | 348.35 | 364.26 | 398.82] 1357 
home 
Another 29.32} 41.59] 43.49} 47.61} 162 
person’s 
Living arrangement home 


Yourown | 226.93 321.90 336.61) 368.55) 1254 
place 


Group 34.75) 49.29) 51.54} 56.43) 192 
quarters 

Other | 6.44) G88) 5.10] 5.58] 19 
Total 540} 766; 801 877 | 2984 


To make all of the expected counts 5 or more, a clever strategy is to 
“collapse” the table by combining two or more rows or columns. In this 
case, it might make sense to combine the “Group quarters” and “Other” 
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living arrangements. Doing so and then running a chi-square test in 
Minitab gives the following output. Notice that the Large Counts condition 
is now met. 


Chi-Square Test: 19, 20, 21, 22 


Expected counts are printed below observed counts 


Chi-Square contributions are printed below expected counts 


19 20 22 
324 378 318 


245.57 348.35 398.82 
25.049) 2.525 16.379 
37 47 38 


29.32 41.59 47.61 
2.014 0.705 1.94 
116 279 487 


226.93 321.9 368.55 
94.226 5.719 38.068 
63 62 34 


38.18 54.16 62.01 
16.129 = 1.134 12.654 
940 766 877 2984 


DF = 9, P-Value = 0.000 


Parent’s home 


Another home 


Own place 


Total 
Chi-Sq = 182.961, 


Section 11.2 Summary 


e We use the chi-square test for homogeneity to compare the 
distribution of a single categorical variable for each of several 
populations or treatments. The null hypothesis is that there is no 
difference in the distribution of the categorical variable for each of the 
populations or treatments. 


e The conditions for performing a chi-square test for homogeneity are: 


: Random: The data come from independent random samples or groups 
in a randomized experiment. 
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° 10%: When sampling without replacement, n < 0.10 N for each 
sample. 


. Large Counts: All expected counts must be at least 5. 


We use the chi-square test for independence to test the association 
between two categorical variables. The null hypothesis is that there is no 
association between the two categorical variables in the population of 
interest. Another way to state the null hypothesis is that the two 
categorical variables are independent in the population of interest. 


The conditions for performing a chi-square test for independence are: 


: Random: The data come from a random sample from the population 
of interest. 


© 10%: When sampling without replacement, n < 0.10 N. 
. Large Counts: All expected counts must be at least 5. 


The expected count in any cell of a two-way table when Hg is true is 


expected count=row total - column totaltable total 


expected count = row total - column total 
table total 


The chi-square test statistic is 


x2=)>; (Observed count — Expected count)2Expected count 
i os (Observed count — Expected count)? 


Expected count 


where the sum is over all cells in the two-way table (not including the 
totals). 


Calculate the P-value by finding the area to the right of y2X under the 
chi-square density curve with (number of rows—1)(number of columns 


= 1)(number of rows — 1)(number of columns — 1) degrees of freedom. 


If the test finds a statistically significant result, consider doing a follow- 
up analysis that looks for the largest components of the chi-square test 
Statistic and compares the observed and expected counts in the 
corresponding cells. 
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11.2 Technology Corner (» 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 
29. Performing chi-square tests for two-way tables Page 735 


Section 11.2 Exercises 


27. The color of candy Inspired by the example about how background 
music influences choice of entrée at a restaurant, a statistics student 
decided to investigate other ways to influence a person’s behavior. 
Using 60 volunteers, she randomly assigned 20 volunteers to get a 
“red” survey, 20 volunteers to get a “blue” survey, and 20 volunteers 
to get a control survey. The first three questions on each survey were 
the same, but the fourth and fifth questions were different. For 
example, the fourth question on the “red” survey was “When you 
think of the color red, what do you think about?” On the blue survey, 
the question replaced red with blue. On the control survey, the last 
two questions were not about color. As a reward, each volunteer was 
allowed to choose a chocolate candy in a red wrapper or a chocolate 
candy in a blue wrapper. Here are segmented bar graphs showing the 
results of the experiment. Describe what you see. 


100 5 
ES Red candy 
80 - fe Blue candy 
= 0-4 
2 
= 
2 
& 40-4 
20 5 
0 


Red Blue Control 
survey survey survey 


Type of survey 


28. Python eggs How is the hatching of water python eggs influenced by 
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the temperature of the snake’s nest? Researchers randomly assigned 
newly laid eggs to one of three water temperatures: hot, neutral, or 
cold. Hot duplicates the extra warmth provided by the mother python, 
and cold duplicates the absence of the mother.42 Here are segmented 
bar graphs showing the results of the experiment. Describe what you 


see. 
1.07 
0.9 - Hatched? 
0.8 4 Hj No 
0.7 4 fa Yes 
0.6 4 
0.5 4 
0.4 4 
0.3 - 
0.2 4 
0.174 
0.0 


Cold Neutral Hot 
Water temperature 


Relative frequency 


29. pg 731 |») More candy The two-way table shows the results of the 


experiment described in Exercise 27. 


Survey type 
Total 
Color of Red 26 
candy Blue 34 
Total 20 20 20 60 


a. State the appropriate null and alternative hypotheses. 


b. Show the calculation for the expected count in the Red/Red cell. 
Then provide a complete table of expected counts. 


c. Calculate the value of the chi-square test statistic. 


30. More pythons The two-way table shows the results of the experiment 
described in Exercise 28. 


Water temperature 


Cold Neutral Hot Total 
Hatching Yes 129 
status No 58 


Total 27 56 104 187 


a. State the appropriate null and alternative hypotheses. 


b. Show the calculation for the expected count in the Cold/Yes cell. 
Then provide a complete table of expected counts. 


c. Calculate the value of the chi-square test statistic. 
31. pg 734 |p) Last candy Refer to Exercises 27 and 29. 
a. Verify that the conditions for inference are met. 


b. Use Table C to find the P-value. Then use your calculator’s y2xX” 
cdf command. 


c. Interpret the P-value from the calculator. 


d. What conclusion would you draw using a=0.01?a@ = 0.01? 


32. Last python Refer to Exercises 28 and 30. 
a. Verify that the conditions for inference are met. 


b. Use Table C to find the P-value. Then use your calculator’s y2x° 
cdf command. 


c. Interpret the P-value from the calculator. 


d. What conclusion would you draw using a=0.10?@ = 0.10? 


33. Sorry, no chi-square How do U.S. residents who travel overseas for 
leisure differ from those who travel for business? The following is the 
breakdown by occupation.12 


Occupation Leisure travelers (%) Business travelers (%) 


Professional/technical 36 30 
Manager/executive 23 48 
Retired 14 
Student 2 
Other 20 
Total 100 100 
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34. 


35. 


Explain why we can’t use a chi-square test to learn whether these two 
distributions differ significantly. 


Going nuts The UR Nuts Company sells Deluxe and Premium nut 
mixes, both of which contain only cashews, Brazil nuts, almonds, and 
peanuts. The Premium nuts are much more expensive than the Deluxe 
nuts. A consumer group suspects that the two nut mixes are really the 
same. To find out, the group took separate random samples of 20 
pounds of each nut mix and recorded the weights of each type of nut 
in the sample. Here are the data:/4 


Type of mix 
Type of Premium Deluxe 
nut 
Cashew 6 lb 5 |b 
Brazil nut 3 lb 4 |b 
Almond 5 lb 6 lb 
Peanut 6 lb 5 Ib 


Explain why we can’t use a chi-square test to determine whether 
these two distributions differ significantly. 


pg 738 |») Gummy bears Courtney and Lexi wondered if the 


distribution of color was the same for name-brand gummy bears 
(Haribo Gold) and store-brand gummy bears (Great Value). To 
investigate, they randomly selected 6 bags of each type and counted 
the number of gummy bears of each color.+2 Here are the data: 


Brand 

Name Store Total 

Red 349 

Green 157 

Color Yellow 135 
Orange 208 

White 146 

Total ala 622 995 
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36. 


37. 


Do these data provide convincing evidence that the distributions of 
color differ for name-brand gummy bears and store-brand gummy 
bears? 


How are schools doing? The nonprofit group Public Agenda 
conducted telephone interviews with three randomly selected groups 
of parents of high school children. There were 202 black parents, 202 
Hispanic parents, and 201 white parents. One question asked, “Are 
the high schools in your state doing an excellent, good, fair, or poor 
job, or don’t you know enough to say?” Here are the survey results:49 


Parents’ race/ethnicity 


Total 
Excellent 68 
Good 205 
Opinion 
about high {2 as 
schools Poor 72 
Don’t 64 
know 
Total 202 202 201 605 


Do these data provide convincing evidence that the distributions of 
opinion about high schools differ for the three populations of parents? 


How to quit smoking It’s hard for smokers to quit. Perhaps 
prescribing a drug to fight depression will work as well as the usual 
nicotine patch. Perhaps combining the patch and the drug will work 
better than either treatment alone. Here are data from a randomized, 
double-blind trial that compared four treatments.12 A “success” 
means that the subject did not smoke for a year following the start of 
the study. 


Group Treatment Subjects Successes 
1 Nicotine patch 244 40 
2 Drug 244 74 
3 Patch plus drug 245 87 
4 Placebo 160 25 
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a. Summarize these data in a two-way table. 


b. Do the data provide convincing evidence of a difference in the 
effectiveness of the four treatments at the a=0.05a = 0.05 
significance level? 


38. Preventing strokes Aspirin prevents blood from clotting and so helps 
prevent strokes. The Second European Stroke Prevention Study asked 
whether adding another anticlotting drug named dipyridamole would 
be more effective for patients who had already had a stroke. Here are 
the data on strokes during the two years of the study:48 


Group Treatment Number of patients §=Number who had a stroke 


1 ‘Placebo 1649 250 
2 Aspirin 1649 206 
3 Dipyridamole 1654 211 
4 Both 1650 157 


a. Summarize these data in a two-way table. 


b. Do the data provide convincing evidence of a difference in the 
effectiveness of the four treatments at the a=0.05a@ = 0.05 
significance level? 


39. How to quit smoking Refer to Exercise 37. Which treatment seems 
to be most effective? Least effective? Justify your choices. 


40. Preventing strokes Refer to Exercise 38. Which treatment seems to 
be most effective? Least effective? Justify your choices. 


41. Relaxing in the sauna Researchers followed a random sample of 
2315 middle-aged men from eastern Finland for up to 30 years. They 
recorded how often each man went to a sauna and whether or not he 
suffered sudden cardiac death (SCD). The two-way table shows the 


data from the study.22 


Weekly sauna frequency 


Yes 


SCD 


42. 


43. 


No 540 1394 191 2125 
Total 601 1513 201 2315 


a. State appropriate hypotheses for performing a chi-square test for 
independence in this setting. 


b. Compute the expected counts assuming that H0/40 is true. 
c. Calculate the chi-square test statistic, df, and P-value. 


d. What conclusion would you draw? 


Is astrology scientific? The General Social Survey (GSS) asked a 
random sample of adults their opinion about whether astrology is very 
scientific, sort of scientific, or not at all scientific. Here is a two-way 
table of counts for people in the sample who had three levels of 


higher education:22 


Degree held 
Associate’s} Bachelor’s | Master’s Total 
Not at all 539 
Opinion scientific 
about Very or 148 
astrology sort of 
scientific 
Total 234 321 132 687 


a. State appropriate hypotheses for performing a chi-square test for 
independence in this setting. 


b. Compute the expected counts assuming that H0/40 is true. 
c. Calculate the chi-square test statistic, df, and P-value. 
d. What conclusion would you draw? 


pg 744 |») Finger length Is your index finger longer than your ring 


finger? Or is it the other way around? It isn’t the same for everyone. 
To investigate if there is a relationship between gender and relative 
finger length, we selected a random sample of 460 U.S. high school 
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44. 


45. 


students who completed a survey. The two-way table shows the 
results. 


Gender 
Female Male Total 
Index longer 158 
ree gen Same eet r 
Ring longer 216 
Total 227 233 460 


Do these data provide convincing evidence at the a=0.10a = 0.10 
level of an association between gender and relative finger length in 
the population of students who completed the survey? 


Regulating guns The National Gun Policy Survey asked a random 
sample of adults, “Do you think there should be a law that would ban 
possession of handguns except for the police and other authorized 
persons?” Here are the responses, broken down by the respondent’s 


level of education:24 


Education 


Opinion 
about 


Do these data provide convincing evidence at the a=0.05a = 0.05 
level of an association between education level and opinion about a 
handgun ban in the adult population? 


Tuition bills A random sample of U.S. adults was recently asked, 
“Would you support or oppose major new spending by the federal 
government that would help undergraduates pay tuition at public 
colleges without needing loans?” The two-way table shows the 
responses, grouped by age.22 
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46. 


47. 


Do these data provide convincing evidence of an association between 
age and opinion about loan-free tuition in the population of U.S. 
adults? 


Online banking A recent poll conducted by the Pew Research Center 
asked a random sample of 1846 Internet users if they do any of their 
banking online. The table summarizes their responses by age.22 Is 
there convincing evidence of an association between age and use of 


online banking for Internet users? 


Age 
18-29 30-49 50-64 65+ Total 
Online Yes 1088 
banking No 758 
Total 395 942 593 356 1846 


Which test? Determine which chi-square test is appropriate in each 
of the following settings. Explain your reasoning. 


a. With many babies being delivered by planned cesarean section, 
Mrs. McDonald’s statistics class hypothesized that there would be 
fewer younger people born on the weekend. To investigate, they 
selected a random sample of people born before 1980 and a 
separate random sample of people born after 1993. In addition to 
year of birth, they also recorded the day of the week on which each 
person was born. 


b. Are younger people more likely to be vegan/vegetarian? To 


investigate, the Pew Research Center asked a random sample of 
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1480 U.S. adults for their age and whether or not they are 
vegan/vegetarian. 


48. Which test? Determine which chi-square test is appropriate in each 
of the following settings. Explain your reasoning. 


49. 


a. 


Does chocolate help heart-attack victims live longer? Researchers 
in Sweden randomly selected 1169 people who had suffered heart 
attacks and asked them about their consumption of chocolate in the 
previous year. Then the researchers followed these people and 


recorded whether or not they had died within 8 years.24 


Random-digit-dialing telephone surveys used to exclude cell- 
phone numbers. If the opinions of people who have only cell 
phones differ from those of people who have landline service, the 
poll results may not represent the entire adult population. The Pew 
Research Center interviewed separate random samples of cell-only 
and landline telephone users who were less than 30 years old and 


asked them to describe their political party affiliation.22 


pg 747 |p») Where do young adults live? A survey by the National 


Institutes of Health asked a random sample of young adults (aged 19 
to 25 years), “Where do you live now? That is, where do you stay 
most often?” Here is the full two-way table (omitting a few who 


refused to answer and one who reported being homeless):22 
Gender 
Female Male Total 
Parents’ 1909 
home 
Another 276 
Living person’s 
location home 
Own place 2423 
Group 246 
quarters 
Total 2488 2366 4854 
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a. Should we use a chi-square test for homogeneity or a chi-square 
test for independence in this setting? Justify your answer. 


b. State appropriate hypotheses for performing the type of test you 
chose in part (a). 


Here is Minitab output from a chi-square test. 


Chi-Square Test: Female, Male 


Expected counts are printed below observed counts 


Chi-Square contributions are printed below expected counts 


Female Male Total 

Parents’ home 923 986 1909 
978.49 930.51 
3.147 3.309 

Another home 144 132 276 
141.47 134.53 
0.045 0.048 

Own place 1294 1129 = 2423 
1241.95 1181.05 
2.181 2.294 

Group 127 119 246 
126.09 119.91 
0.007 0.007 

Total 2488 2366 4854 

Chi-Sq = 11.038, DF = 3, P-Value = 0.012 


c. Check that the conditions for carrying out the test are met. 


d. Interpret the P-value. What conclusion would you draw? 


50. Distance from home A study of first-year college students asked 
separate random samples of students from private and public 
universities the following question: “How many miles is this 
university from your permanent home?” Students had to choose from 
the following options: 5 or fewer, 6 to 10, 11 to 50, 51 to 100, 101 to 
500, or more than 500.2 Here is the two-way table summarizing the 
responses: 


Type of university 
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Distance 
from home 
(miles) 


5 or fewer 
6 to 10 

11 to 50 
51 to 100 
101 to 500 
Over 500 
Total 


Public 


1951 1028 
2688 1285 
10,971 5527 


6765 2211 
15,177 6195 
9811 9486 


43,363 


Private 


253732 


Total 
2979 
3973 
16,498 

8976 
21,372 
15,297 
69,095 


Should we use a chi-square test for homogeneity or a chi-square 


test for independence in this setting? Justify your answer. 


chose in part (a). 


Here is Minitab output from a chi-square test. 


Chi-Square Test: Public, Private 


Expected counts are printed below observed counts 


State appropriate hypotheses for performing the type of test you 


Chi-Square contributions are printed below expected counts 


5 or less 


6 to 10 


11 to 50 


o1 to 100 


101 to 500 


Over 500 


Public 
1951 
1869.6 
3.54 
2688 
2493.4 
15.19 
10971 
10354 
36.77 
6765 
5633.2 
227.4 
15177 
13413 
232 
5811 
9600.2 
1496 
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Private 
1028 
1109.4 
5.97 
1285 
1479.6 
25.59 
5527 
6144.1 
61.98 
2211 
3342.8 
383.2 
6195 
7959.2 
391 
9486 
5696.8 
2520.4 


Total 
2979 


3973 


16498 


8976 


21372 


15297 


Total 43363 
Chi-Sq = 5398.7, DF =5, 


25732 69095 
P-Value = 0.0000 


c. Check that the conditions for carrying out the test are met. 


d. Interpret the P-value. What conclusion would you draw? 


51. Where do you live? Conduct a follow-up analysis for the test in 
Exercise 49. 


52. How far away do you live? Conduct a follow-up analysis for the test 
in Exercise 5i). 


53. Treating ulcers Gastric freezing was once a recommended treatment 
for ulcers in the upper intestine. Use of gastric freezing stopped after 
experiments showed it had no effect. One randomized comparative 
experiment found that 28 of the 82 gastric-freezing patients improved, 
while 30 of the 78 patients in the placebo group improved.22 We can 
test the hypothesis of “no difference” in the effectiveness of the 
treatments in two ways: with a two-sample z test or with a chi-square 
test. 


a. State appropriate hypotheses for a chi-square test. 


b. Here is Minitab output for a chi-square test. Interpret the P-value. 
What conclusion would you draw? 


Chi-Square Test: Gastric freezing, Placebo 

Expected counts are printed below observed counts 

Chi-Square contributions are printed below expected counts 
Gastric freezing Placebo Total 


Improved 28 30 58 
29.73 20.27 
0.1 0.105 
Didn’t improve 54 48 102 
a2 27 49.73 
0.057 0.06 
Total 82 78 160 
Chi-Sq = 0.322, DF = 1, P-Value = 0.570 


c. Here is Minitab output for a two-sample z test. Explain how these 
results are consistent with the test in part (a). 


1655 


Test for Two Proportions 


Sample x N Sample p 
1 28 = 82 0.341463 
2 30 78 0.384615 


Difference = p(1) — p(2) 

Estimate for difference: —0.0431520 
Test for difference = 0 (vs not = 0): 
Z = —-0.57 P-Value = 0.570 


54. Opinions about the death penalty The General Social Survey (GSS) 
asked separate random samples of people with only a high school 
degree and people with a bachelor’s degree, “Do you favor or oppose 
the death penalty for persons convicted of murder?” Of the 1379 
people with only a high school degree, 1010 favored the death 
penalty, while 319 of the 504 people with a bachelor’s degree favored 
the death penalty. We can test the hypothesis of “no difference” in 
support for the death penalty among people in these educational 
categories in two ways: with a two-sample z test or with a chi-square 
test. 


a. State appropriate hypotheses for a chi-square test. 


b. Here is Minitab output for a chi-square test. Interpret the P-value. 
What conclusion would you draw? 


Chi-Square Test: HS, Bachelor 
Expected counts are printed below observed counts 
Chi-Square contributions are printed below expected counts 


HS Bachelor Total 

Favor 1010 319 1329 
973.28 355.72 
1.385 3.790 

Oppose 369 185 504 
405.72 148.28 
3.323 9.092 

Total 1379 504 1883 


Chi-Sq = 17.590, 


DF = 1, P-Value = 0.000 


c. Here is Minitab output for a two-sample z test. Explain how these 
results are consistent with the test in part (a). 
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Multiple Choice Select the best answer for Exercises 55—60. 


Test for Two Proportions 


Sample Xx N 
1 1010 1373 
2 319 504 


Difference = p(1) — p(2) 

Estimate for difference: 0.0994783 
Test for difference = 0 (vs not = 0) : 
Z = 4.19 P-Value = 0.000 


Sample p 
0.732415 
0.632937 


Exercises 55—58 refer to the following setting. The National Longitudinal 
Study of Adolescent Health interviewed a random sample of 4877 teens 

(grades 7 to 12). One question asked, “What do you think are the chances 
you will be married in the next 10 years?” Here is a two-way table of the 


responses by gender:22 


Opinion about pot 


marriage 


Almost no 
chance 


Some chance, 
but probably 


A 50-50 
chance 


A good chance 


Almost certain 


Total 2625 2252 


Total 
222 


6 fall 


959 


1445 
1930 
4877 


55. Which of the following is the appropriate null hypothesis for 
performing a chi-square test? 


a. Equal proportions of female and male teenagers are almost certain 
they will be married in 10 years. 


b. There is no difference between the distributions of female and 


male teenagers’ opinions about marriage in this sample. 


c. There is no difference between the distributions of female and 
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d. 


e. 


male teenagers’ opinions about marriage in the population. 


There is no association between gender and opinion about 
matriage in the sample. 


There is no association between gender and opinion about 
marriage in the population. 


56. Which of the following is the expected count of females who respond 
“Almost certain”? 


ai. 


58. 


a. 
b. 
C. 
d. 


e. 


487.7 
O20 
965 
1038.8 
1174 


Which of the following is the correct number of degrees of freedom 
for the chi-square test using these data? 


a. 
b. 


Cc. 


e. 


4 


4876 


For these data, y2=69.8X" = 69.8 with a P-value of approximately 0. 
Assuming that the researchers used a significance level of 0.05, which 
of the following is true? 


a. 
b. 
C. 


d. 


A Type I error is possible. 
A Type II error is possible. 
Both a Type I and a Type IJ error are possible. 


There is no chance of making a Type I or Type II error because the 
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59. 


60. 


P-value is approximately 0. 


e. There is no chance of making a Type I or Type II error because the 
calculations are correct. 


When analyzing survey results from a two-way table, the main 
distinction between a test for independence and a test for 
homogeneity is 


a. how the degrees of freedom are calculated. 
b. how the expected counts are calculated. 

c. the number of samples obtained. 

d. the number of rows in the two-way table. 


e. the number of columns in the two-way table. 


Cocaine addicts need cocaine to feel any pleasure, so perhaps giving 
them an antidepressant drug will help. A 3-year study with 72 chronic 
cocaine users compared an antidepressant drug called desipramine 
with lithium (a standard drug to treat cocaine addiction) and a 
placebo. One-third of the subjects were randomly assigned to receive 
each treatment. At the end of the study, researchers recorded whether 
or not the subjects relapsed.22 Which of the following conditions 
must be satisfied to perform the appropriate chi-square test using the 
data from this study? 


I. The population distribution is approximately Normal. 
II. The treatments were randomly assigned. 
III. The observed counts are all at least 5. 
a. I only 
b. Il only 
c. III only 
d. I and II 


e. I, II, and III 
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Recycle and Review 


For Exercises 61 and 62, you may find the inference summary chart inside 
the back cover helpful. 


61. 


62. 


Inference recap (8.1 to 11.2) In each of the following settings, state 
which inference procedure from Chapter 8, 9, 10, or 11 you would 
use. Be specific. For example, you might answer, “Two-sample z test 
for the difference between two proportions.” You do not have to carry 


out any procedures.24 


a. What is the average voter turnout during an election? A random 
sample of 38 cities was asked to report the percent of registered 
voters who voted in the most recent election. 


b. Are blondes more likely to have a boyfriend than the rest of the 


single world? Independent random samples of 300 blondes and 
300 nonblondes were asked whether they have a boyfriend. 


Inference recap (8.1 to 11.2) In each of the following settings, state 
which inference procedure from Chapter 8, 9, 10, or 11 you would 
use. Be specific. For example, you might answer, “Two-sample z test 
for the difference between two proportions.” You do not have to carry 


out any procedures.22 


a. Is there a relationship between attendance at religious services and 
alcohol consumption? A random sample of 1000 adults was asked 
whether they regularly attend religious services and whether they 
drink alcohol daily. 


b. Separate random samples of 75 college students and 75 high 


school students were asked how much time, on average, they spend 
watching television each week. We want to estimate the difference 
in the average amount of TV watched by high school and college 
students. 
Exercises 63 and 64 refer to the following setting. For their final 
project, a group of AP® Statistics students investigated the following 
question: “Will changing the rating scale on a survey affect how 
people answer the question?” To find out, the group took an SRS of 
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63. 


64. 


50 students from an alphabetical roster of the school’s just over 1000 
students. The first 22 students chosen were asked to rate the cafeteria 
food on a scale of 1 (terrible) to 5 (excellent). The remaining 28 
students were asked to rate the cafeteria food on a scale of 0 (terrible) 
to 4 (excellent). Here are the data: 


1 to 5 scale 
Rating 1 2 3 4 5 
Frequency 2 3 1 13 3 
0 to 4 scale 
Rating 0 1 2 3 
Frequency 0 0 2 18 


Design and analysis (4.2, 11.2) 


a. Was this an observational study or an experiment? Justify your 
answer. 


b. Explain why it would not be appropriate to perform a chi-square 


test in this setting. 


Average ratings (1.3, 2.1, 10.2) The students decided to compare the 
average ratings of the cafeteria food on the two scales. 


a. Find the mean and standard deviation of the ratings for the students 
who were given the 1-to-5 scale. 


b. For the students who were given the 0-to-4 scale, the ratings have a 
mean of 3.21 and a standard deviation of 0.568. Since the scales 
differ by one point, the group decided to add 1 to each of these 
ratings. What are the mean and standard deviation of the adjusted 
ratings? 


c. Would it be appropriate to compare the means from parts (a) and 
(b) using a two-sample t test? Justify your answer. 
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— FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


Two statistics students wanted to know if including additional 
information in a survey question would change the distribution of 
responses. To find out, they randomly selected 30 teenagers and 
asked them one of the following two questions. Fifteen of the 
teenagers were randomly assigned to answer Question A, and the 
other 15 students were assigned to answer Question B. 


Question A: When choosing a college, how important is a good 
athletic program: very important, important, somewhat important, not 
that important, or not important at all? 


Question B: It’s sad that some people choose a college based on its 
athletic program. When choosing a college, how important is a good 
athletic program: very important, important, somewhat important, not 
that important, or not important at all? 
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The table below summarizes the responses to both questions. For 
these data, the chi-square test statistic is y2=6.12X = 6-12, 


Question 
Total 

Very 9 

important 

Important 7 
Importance Somewhat fs) 
of a good important 
athletic Not that 3 
program important 

Not 6 

important at 

all 

Total 30 


a. State the hypotheses that the students are interested in testing. 


b. Describe a Type I error and a Type II error in the context of the 
hypotheses stated in part (a). 


c. For these data, explain why it would not be appropriate to use 
a chi-square distribution to calculate the P-value. 


d. To estimate the P-value, 100 trials of a simulation were 
conducted, assuming that the additional information didn’t 
have an effect on the response to the question. In each trial of 
the simulation, the value of the chi-square test statistic was 
calculated. These simulated chi-square test statistics are 
displayed in the dotplot shown here. 


« 
* ¢ © $8 


6 8 10 12 14 16 18 
Simulated chi-square statistic 


Based on the results of the simulation, what conclusion would 
you make about the hypotheses stated in part (a)? 
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After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 


score. 
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Chapter 11 Review 


Section 11.1: Chi-Square Tests for Goodness of Fit 


In this section, you learned the details for performing a chi-square test for 
goodness of fit. The null hypothesis is that a single categorical variable 
follows a specified distribution in a population of interest. The alternative 
hypothesis is that the variable does not follow the specified distribution in 
the population of interest. 


The chi-square test statistic measures the difference between the 
observed distribution of a categorical variable and its hypothesized 
distribution. To calculate the chi-square test statistic, use the following 
formula that involves the observed and expected counts for each value of 
the categorical variable: 

Xx2=)». (Observed count — Expected count)2Expected count 
‘ (Observed count — Expected count)? 
2. a 


Expected count 


To calculate the expected counts, multiply the total sample size by the 
hypothesized proportion for each category. Larger values of the chi-square 
test statistic provide more convincing evidence that the distribution of the 
categorical variable differs from the hypothesized distribution in the 
population of interest. 


When the Random, 10%, and Large Counts conditions are satisfied, we 
can accurately model the sampling distribution of the chi-square test 
Statistic using a chi-square distribution (density curve). The Random 
condition says that the data are from a random sample from the population 
of interest. The 10% condition says that the sample size should be less than 
10% of the population size when sampling without replacement. The 
Large Counts condition says that the expected counts for each category 
must be at least 5. In a test for goodness of fit, use a chi-square distribution 
with degrees of freedom=number of categories—1 
degrees of freedom = number of categories — 1. 


When the results of a test for goodness of fit are significant, consider 
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doing a follow-up analysis. Identify which categories of the variable had 
the largest contributions to the chi-square test statistic and whether the 
observed values in those categories were larger or smaller than expected. 


Section 11.2: Inference for Two-Way Tables 


In this section, you learned two different tests to analyze categorical data 
that are summarized in a two-way table. A test for homogeneity compares 
the distribution of a single categorical variable for two or more populations 
or treatments. A test for independence looks for an association between 
two categorical variables in a single population. 


In a chi-square test for homogeneity, the null hypothesis is that there is 
no difference in the true distribution of a categorical variable for two or 
more populations or treatments. The alternative hypothesis is that there is a 
difference in the distributions. The Random condition is that the data come 
from independent random samples or groups in a randomized experiment. 
The 10% condition applies for each sample when sampling without 
replacement, but not in experiments. Finally, the Large Counts condition 
remains the same—the expected counts must be at least 5 in each cell of 
the two-way table. 


To calculate the expected counts for a test for homogeneity, use the 
following formula: 


expected count=row total - column totaltable total 
row total - column total 


expected count = 
*P table total 


To calculate the P-value, compute the chi-square test statistic and use a 


chi-square distribution with (number of rows—1)(number of columns~—1) 
(number of rows — 1) (number of columns — 1) 


In a chi-square test for independence, the null hypothesis is that there 
is no association between two categorical variables in one population (or 
that the two variables are independent in the population). The alternative 
hypothesis is that there is an association between the two variables (or that 
the two variables are not independent). For this test, the Random condition 
says that the data must come from a single random sample. The 10% 
condition applies when sampling without replacement. The Large Counts 
condition is still the same—the expected counts must all be at least 5. The 
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method for calculating expected counts, the chi-square test statistic, the 
degrees of freedom, and the P-value are exactly the same in a test for 
independence and a test for homogeneity. 


As with tests for goodness of fit, when the results of a test for 
homogeneity or independence are significant, consider doing a follow-up 
analysis. Identify which cells in the two-way table had the largest 
contributions to the chi-square test statistic and whether the observed 
values in those cells were larger or smaller than expected. 


Goodness of fit 


Number of 1 

samples/treatments 

Number of 1 

variables 

Null hypothesis The stated 
distribution of a 
categorical 
variable in the 
population of 
interest is correct. 

Random The data come 

condition from a random 


10% condition 


Large Counts 
condition 


Expected counts 


sample from the 
population of 
interest. 


Homogeneity 


2 or more 


There is no 
difference in the 
distribution of a 
categorical 
variable for 
several 
populations or 
treatments. 


The data come 
from independent 
random samples or 
groups ina 
randomized 
experiment. 


Independence 
1 


There is no 
association 
between two 
categorical 
variables in the 
population of 
interest. 


The data come 
from a random 
sample from the 
population of 
interest. 


When sampling without replacement, n < 0.10N for each 


sample. 


All expected counts > 5 


(samplesize) 


(expectedproportion) 


ea ( expected ) 
size proportion 


row total - column totaltable total 
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row total - column total 
table total 


Formula for test X2=> 
statistic (Observed count — Expected count)2Expected count 


sii = (Observed count — Expected count)” 
Expected count 


Degrees of (# categories — 1) (# rows — 1)(# columns — 1) 
freedom 
TI-83/84 name x2 GOF y2-testX” —test 

—test 


x? GOF—test 


What Did You Learn? 


Relevant 
Related Chapter 
Example Review 
Learning Target Section on Page(s) Exercise(s) 


State appropriate hypotheses and 11.1 712 R11.1 
compute the expected counts and chi- 

square test statistic for a chi-square test 

for goodness of fit. 


State and check the Random, 10%, and 11.1 718 R11.1 
Large Counts conditions for performing a 
chi-square test for goodness of fit. 


Calculate the degrees of freedom and P- —— 11.1 716 R11.1 
value for a chi-square test for goodness 

of fit. 

Perform a chi-square test for goodness of 11.1 718 Ril] 
fit. 

Conduct a follow-up analysis when the 11.1, Discussion R11.3 
results of a chi-square test are statistically 11.2 on 720, 

significant. 739 

State appropriate hypotheses and 11.2 731 R11.4, 
compute the expected counts and chi- R11.5 


square test statistic for a chi-square test 
based on data in a two-way table. 
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State and check the Random, 10%, and 
Large Counts conditions for a chi-square 
test based on data in a two-way table. 


Calculate the degrees of freedom and P- 
value for a chi-square test based on data 
in a two-way table. 


Perform a chi-square test for 
homogeneity. 


Perform a chi-square test for 
independence. 


Choose the appropriate chi-square test in 
a given setting. 
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734 


734 


738 


744 


747 


Chapter 11 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


R11.1 |») Testing a genetic model Biologists wish to cross pairs of 


tobacco plants having genetic makeup Gg, indicating that each plant 
has one dominant gene (G) and one recessive gene (g) for color. 
Each offspring plant will receive one gene for color from each 
parent. The Punnett square shows the possible combinations of 
genes received by the offspring. 


Parent 2 passes on: 


G g 
Parent 1 G GG Gg 
Passes 
on: 
8 Gg 88 


The Punnett square suggests that the expected ratio of green (GG) to 
yellow-green (Gg) to albino (gg) tobacco plants should be 1:2:1. In 
other words, the biologists predict that 25% of the offspring will be 
green, 50% will be yellow-green, and 25% will be albino. To test 
their hypothesis about the distribution of offspring, the biologists 
mate 84 randomly selected pairs of yellow-green parent plants. Of 
84 offspring, 23 plants were green, 50 were yellow-green, and 11 
were albino. Do the data provide convincing evidence at the a=0.01 
a = 0.01 level that the true distribution of offspring is different from 
what the biologists predict? 


R11.2 |») Sorry, no chi-square We would prefer to learn from teachers 


who know their subject. Perhaps even preschool children are 
affected by how knowledgeable they think teachers are. Assign 48 
three- and four-year-olds at random to be taught the name of a new 
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toy by either an adult who claims to know about the toy or an adult 
who claims not to know about it. Then ask the children to pick out a 
picture of the new toy from a set of pictures of other toys and say its 
name. The response variable is the count of right answers in four 
tries. Here are the data:22 


Count of correct answers 


0 1 2 3 4 | Total 
Total 25 f. 9 3 10 48 


The researchers report that children who were taught by the teacher 
claiming to be knowledgeable did significantly better (y2=20.4, 


P<0.05)(X" = 20.4, P < 0.05) Explain why this result isn’t valid. 
R11.3 |p) Fewer TVs? The United States Energy Information 


Administration periodically surveys a random sample of U.S. 
households to determine how they use energy.22 One of the 
variables they track is how many TVs are in a household (None, 1, 
2, 3, 4, or 5 or more). The computer output compares the distribution 
of number of TVs for households in 2009 and 2015. 


2009 2015 All 

None 192 174 366 
252 114 
14.113 31.034 

1 3098 1680 4778 
3284 1494 
10.577 23.258 

2 4801 2190 6991 
4806 2185 
0.004 0.010 

3 3406 1495 4901 
3369 1532 
0.408 0.897 

4 1818 689 2507 
1723 784 
5.204 11.442 
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2009 2015 All 


5 or more 1242 392 1634 
1123 511 
12.564 27.628 

All 14557 6620 21177 


Cell Contents: Count 
Expected count 
Contribution to Chi-square 
Chi-Square = 137.137, DF = 5, P-Value = 0.000 


a. Which chi-square test is appropriate to analyze these data? 
Explain your answer. 


b. Show how the numbers 252 and 14.113 were obtained for the 
2009/None cell. 


c. Which 3 cells contribute most to the chi-square test statistic? 
How do the observed and expected counts compare for these 
cells? 


R11.4 |») Stress and heart attacks You read a newspaper article that 


describes a study of whether stress management can help reduce 
heart attacks. The 107 subjects all had reduced blood flow to the 
heart and so were at risk of a heart attack. They were assigned at 
random to three groups. The article goes on to say: 


One group took a four-month stress management program, 
another underwent a four-month exercise program, and the 
third received usual heart care from their personal 
physicians. In the next three years, only 3 of the 33 people in 
the stress management group suffered “cardiac events,” 
defined as a fatal or non-fatal heart attack or a surgical 
procedure such as a bypass or angioplasty. In the same 
period, 7 of the 34 people in the exercise group and 12 out of 


the 40 patients in usual care suffered such events.24 


a. Use the information in the news article to make a two-way table 
that describes the study results. 


b. Compare the success rates of the three treatments in preventing 
cardiac events. 
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c. Do the data provide convincing evidence at the a=0.05a = 0.05 
level that the true success rates for patients like these are not the 
same for the three treatments? 


R11.5 |») Popular kids Who were the popular kids at your elementary 


school? Did they get good grades or have good looks? Were they 
good at sports? A study was performed in Michigan to examine the 
factors that determine social status for children in grades 4, 5, and 6. 
Researchers administered a questionnaire to a random sample of 478 
students in these grades. One of the questions asked, “What would 
you most like to do at school: make good grades, be good at sports, 
or be popular?” The two-way table summarizes the students’ 
responses. 2© Is there convincing evidence of an association between 
gender and goal for students in grades 4, 5, and 6? 


Gender 
Total 
Grades 247 
Goal Popular 141 
Sports 90 
Total 478 
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Chapter 11 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


Exercises T11.1 and T11.2 refer to the following setting. Recent revenue 
shortfalls in a midwestern state led to a reduction in the state budget for 
higher education. To offset the reduction, the largest state university 
proposed a 25% tuition increase. It was determined that such an increase 
was needed simply to compensate for the lost support from the state. 
Separate random samples of 50 freshmen, 50 sophomores, 50 juniors, and 
50 seniors from the university were asked whether they were strongly 
opposed to the increase, given that it was the minimum increase necessary 
to maintain the university’s budget at current levels. Here are the results: 


Year 
Freshman |Sophomore Junior Senior Total 
Strongly Yes 122 
opposed? No 78 
Total 50 50 50 50 200 


T11.1 Which null hypothesis would be appropriate for performing a chi- 
Square test? 


a. The closer students get to graduation, the less likely they are to 
be opposed to tuition increases. 


b. The mean number of students who are strongly opposed is the 
same for each of the 4 years. 


c. The distribution of student opinion about the proposed tuition 
increase is the same for each of the 4 years at this university. 


d. Year in school and student opinion about the tuition increase 
are independent in the sample. 


e. There is an association between year in school and opinion 
about the tuition increase at this university. 
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T11.2 The conditions for carrying out the chi-square test in Exercise 
T11.1 are: 


I. Independent random samples from the populations of interest. 
II. All sample sizes are less than 10% of the populations of interest. 


If. All expected counts are at least 5. 


Which of the conditions is (are) satisfied in this case? 
a. I only 

b. Il only 

c. [and III only 

d. Il and III only 


e. I, II, and III 


Exercises T11.3—111.5 refer to the following setting. A random 
sample of traffic tickets given to motorists in a large city is 
examined. The tickets are classified according to the race or 
ethnicity of the driver. The results are summarized in the following 


table. 
Race/Ethnicity White Black Hispanic Other 
Number of tickets 69 52 18 9 


The proportion of this city’s population in each of the racial/ethnic 
categories listed is as follows. 


Race/Ethnicity White Black Hispanic Other 
Proportion 0.55 0.30 0.08 0.07 


We wish to test HO:/0 : The racial/ethnic distribution of traffic 
tickets in the city is the same as the racial/ethnic distribution of the 
city’s population. 


T11.3 Assuming HO is true, what is the expected number of Hispanic 
drivers who would receive a ticket? 


a. 8 
b. 10.36 
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12 


T11.4 We compute the value of the y2x” test statistic to be 6.57. 
Assuming that the conditions for inference are met, which of the 
following is the correct P-value? 


a. Greater than 0.20 


b. 


Between 0.10 and 0.20 


c. Between 0.05 and 0.10 
d. Between 0.01 and 0.05 


e. 


Less than 0.01 


2 
T11.5 The category that contributes the largest component to the y2* 
test statistic is 


a. 
b. 
C. 
d. 


e. 


White, with 12.4 fewer tickets than expected. 
White, with 12.4 more tickets than expected. 
Hispanic, with 6.16 fewer tickets than expected. 
Hispanic, with 6.16 more tickets than expected. 


Other, with 1.36 fewer tickets than expected. 


T11.6 Which of the following statements about chi-square distributions 
are true? 


I. For all chi-square distributions, P(y2>0)=1" (x? 20) =1 


II. A chi-square distribution with fewer than 10 degrees of freedom 


is roughly symmetric. 


III. The more degrees of freedom a chi-square distribution has, the 


Bo oF p 


larger the mean of the distribution. 
I only 
II only 
III only 
I and III 
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e. I, II, and III 


Exercises T11.7—T11.10 refer to the following setting. All current- 
carrying wires produce electromagnetic (EM) radiation, including 
the electrical wiring running into, through, and out of our homes. 
High-frequency EM radiation is thought to be a cause of cancer. 
The lower frequencies associated with household current are 
generally assumed to be harmless. To investigate the relationship 
between current configuration and type of cancer, researchers 
visited the addresses of a random sample of children who had died 
of some form of cancer (leukemia, lymphoma, or some other type) 
and classified the wiring configuration outside the dwelling as 
either a high-current configuration (HCC) or a low-current 
configuration (LCC). Here are the data: 


Type of cancer 


Wiring HCC 
configuration LCC 
Total 136 31 48 215 


Computer software was used to analyze the data. The output 
included the value y2=0.435X° = 0.435 


T11.7 Which of the following is the appropriate degrees of freedom for 
the y2X test? 

a 1 

b. 2 

@ 3 

d. 4 

e 5 


T11.8 Which of the following is the expected count of cases with 
lymphoma in homes with an HCC? 
79-31 


a. 79°31215 215 
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d. 


e. 


10-21 


10:°21215 215 
79-3110 10 
136-31 


136°31215 215 


None of these 


T11.9 Which of the following may we conclude, based on the chi- 
Square test results? 


a. 


There is convincing evidence of an association between wiring 
configuration and the chance that a child will develop some 
form of cancer. 


HCC either causes cancer directly or is a major contributing 
factor to the development of cancer in children. 


Leukemia is the most common type of cancer among children. 


There is not convincing evidence of an association between 
wiring configuration and the type of cancer that caused the 
deaths of children. 


There is convincing evidence that HCC does not cause cancer 
in children. 


T11.10 A Type I error would occur if we found convincing evidence 
that 


a. 
b. 


HCC wiring caused cancer when it actually didn’t. 
HCC wiring didn’t cause cancer when it actually did. 


there is no association between the type of wiring and the form 
of cancer when there actually is an association. 


there is an association between the type of wiring and the form 
of cancer when there actually is no association. 


the type of wiring and the form of cancer have a positive 
correlation when they actually don’t. 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
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methods as well as on the accuracy and completeness of your results and 
explanations. 


T11.11 A large distributor of gasoline claims that 60% of all drivers 
stopping at their service stations choose regular unleaded gas and 
that premium and supreme are each selected 20% of the time. To 
investigate this claim, researchers collected data from a random 
sample of drivers who put gas in their vehicles at the distributor’s 
service stations in a large city. The results were as follows. 


Gasoline selected 
Regular Premium Supreme 
261 51 88 


Carry out a test of the distributor’s claim at the 5% significance 
level. 


T11.12 A study conducted in Charlotte, North Carolina, tested the 
effectiveness of three police responses to spouse abuse: (1) advise 
and possibly separate the couple, (2) issue a citation to the 
offender, and (3) arrest the offender. Police officers were trained to 
recognize eligible cases. When presented with an eligible case, a 
police officer called the dispatcher, who would randomly assign 
one of the three available treatments to be administered. There 
were a total of 650 cases in the study. Each case was classified 
according to whether the abuser was arrested within 6 months of 
the original incident.22 


Police response 


Advise | Citation | Arrest Total 
and 
Subsequen separate 
arrest? No ve 
Yes ‘07 
Total rs 


a. Explain the purpose of the random assignment in the design of 
this study. 
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b. State an appropriate pair of hypotheses for performing a chi- 
square test in this setting. 


c. Assume that all the conditions for performing the test in part (b) 
are met. The test yields y2=5.063X" = 5.063 and a P-value of 
0.0796. Interpret this P-value. 


d. What conclusion should we draw from the study? 


T11.13 In the United States, there is a strong relationship between 
education and smoking: well-educated people are less likely to 
smoke. Does a similar relationship hold in France? To find out, 
researchers recorded the level of education and smoking status of a 
random sample of 459 French men aged 20 to 60 years.28 The two- 
way table displays the data. 


Education 


Smoking 
status? 


Is there convincing evidence of an association between smoking 
status and educational level among French men aged 20 to 60 
years? 
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More About 


Regression 


Andrey Gudkov/Alamy 


Introduction 
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Cumulative AP® Practice Test 4 
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INTRODUCTION 


When a scatterplot shows a linear relationship between a quantitative 
explanatory variable x# and a quantitative response variable y,4¥» we can 
use the least-squares line calculated from the data to predict y¥ for a given 
value of x.@. If the data are a random sample from a larger population, we 
use Statistical inference to answer questions like these: 


e Is there really a linear relationship between x# and y¥ in the population, 
or is it plausible that the pattern we see in the scatterplot happened by 
chance alone? 


e Inthe population, how much will the predicted value of y¥ change for 
each increase of 1 unit in x?2? What’s the margin of error for this 
estimate? 


If the data come from a randomized experiment, the values of the 
explanatory variable correspond to the levels of some factor that is being 
manipulated by the researchers. For instance, malaria researchers might 
want to investigate how temperature affects the life span of mosquitoes. 
They could set up several tanks at each of several different temperatures 
and then randomly assign hundreds of mosquitoes to each of the tanks. 
The response variable of interest is the time (in days) from hatching to 
death. Suppose that a scatterplot of average life span versus temperature 
has a linear form. We use statistical inference to decide if these data 
provide convincing evidence that changes in temperature cause changes in 
life span. 


It is conventional to refer to a scatterplot of the points (x, y) (2 y) as a graph of y¥ 


versus x.£- So a scatterplot of life span versus temperature uses life span as the 


response variable and temperature as the explanatory variable. 


In Section 12.1, we will show you how to estimate and test claims 
about the slope of the population (true) regression line. Before you 
perform inference about a slope, you should understand the sampling 
distribution of the sample slope. The following activity gets you started. 
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ACTIVITY Sampling from Old Faithful 


In Chapter 7, you learned about the sampling distribution of a sample 
proportion p/? and the sampling distribution of a sample mean x .#- In 
this activity, you will explore the sampling distribution of the sample 
slope b191 when calculating least-squares regression lines from random 
samples. 


Benny Marty/Shutterstock 


Old Faithful geyser is one of the most popular attractions in 
Yellowstone National Park. As you saw in Chapter 3, it is possible to 
use a least-squares regression line to predict y=¥ ~ interval of time (in 
minutes) until the next eruption from x=£ = (in minutes) of an eruption. 
In one particular month, Old Faithful erupted 263 times. Your teacher 
has printed out this population on 263 cards. Each card gives the 
duration of a particular eruption on one side and the interval to the next 
eruption on the other side. 


1. Form teams of 2 or 3 students. 


2. Have one student from each team select a random sample of 15 cards 
from the population and return to the team. This student should read 
aloud the duration (x)(@ ) and interval (x)(# ) values for each eruption 
while the remaining team members enter these values into their 
calculators (or other technology). Replace the 15 cards in the 
population. 


3. After each team has recorded the values for its 15 randomly selected 
eruptions, the team should calculate the sample least-squares 


regression line for its data. Record the value of the sample slope b1. 
bj. 
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4. Your teacher will prepare an axis for a dotplot to display the 
distribution of the sample slope. Add the value you calculated in Step 
3 to this dotplot, using the symbol “b151” instead of a dot. Repeat 
Steps 2—4 as necessary to get at least 20 sample slopes. 


5. Asa class, describe this simulated sampling distribution. Remember 
to discuss shape, center, and variability. 


6. What do you think would happen to the simulated sampling 
distribution if the sample size was increased from 15 to 50? Discuss 
as a Class. 


Sometimes a scatterplot reveals that the relationship between two 
quantitative variables has a curved form. One strategy is to transform one 
or both variables so that the graph shows a linear pattern. Then we can use 
least-squares regression to fit a linear model to the data. Section 12.2 
examines methods of transforming data to achieve linearity. 
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SECTION 12.1 Inference for Linear Regression 


LEARNING TARGETS By the end of the section, you should be able to: 


e Check the conditions for performing inference about the slope B11 
of the population (true) regression line. 


Interpret the values of bO, b1, 5,50; bi, 8, and SEb1> Ph: in context, 


and determine these values from computer output. 


Construct and interpret a confidence interval for the slope B1°1 of 
the population (true) regression line. 


Perform a significance test about the slope 81°. of the population 
(true) regression line. 


Figure 12.1 shows the relationship between x=durationt = duration (in 
minutes) of an eruption and y=interval of timeY = interval of time (jn 
minutes) until the next eruption for all 263 eruptions of the Old Faithful 
geyser during a particular month. The least-squares regression line is also 
shown. 


Interval until next eruption (min) 


10 15 20 25 30 35 40 45 50 
Duration (min) 


FIGURE 12.1 Scatterplot of the duration and interval between eruptions of Old 
Faithful for all 263 eruptions in a single month. The population least-squares line is 


shown in blue. 


Because the scatterplot includes all the eruptions in a particular month, 
the least-squares regression line is called a population regression line (or 
true regression line). In most cases, we don’t have data from the entire 
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population, so we use the sample regression line (or estimated regression 
line) to estimate the population regression line. 


DEFINITION Population regression line, Sample 
regression line 


A regression line calculated from every value in the population is 
called a population regression line (true regression line). The 
equation of a population regression line is p=B0+B1x4 — Bo + Pix 
where 


e yyy is the mean yY-value for a given value of x.2. 
e 80% is the population y¥ intercept. 
e §1°1 is the population slope. 


A regression line calculated from a sample is called a sample 
regression line (estimated regression line). The equation of a sample 
regression line is y\=b0+b1xY = 50 + 12 where 


e y/¥ is the estimated mean y¥-value for a given value of x.z. 
e b0°0 is the sample yY intercept. 
e b11 is the sample slope. 


Note that the symbols p00 and B1 Pr here refer to the intercept and slope of the 


population regression line. They are in no way related to the probability of a Type II 


error, which is sometimes designated by the symbol pe (the Greek letter “beta”). 


In Chapter 3, we interpreted y/Y as the predicted value of y¥ for a 
given value of x.£. That interpretation is still valid. What if we want to use 
the sample regression line to make an inference about the population 
regression line p=B0+B1x?4 — Bo + P12? Now we can think of y\¥ as 
being an estimate for ry“Y the mean value of y¥ for all individuals in the 
population with the given value of x.@. 


How does the slope of the sample regression line b1®1 relate to the 
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slope of the population regression line B1?1° To find out, we’ll learn 
about the sampling distribution of the sample slope. 


Sampling Distribution of b1" 


Confidence intervals and significance tests about the slope of the 
population regression line are based on the sampling distribution of b1)1 
the slope of the sample regression line. Figure 12.2 shows the results of 
taking three different SRSs of 15 Old Faithful eruptions from the 
population described earlier. Each graph displays the selected points and 


the least-squares regression line for that sample (in green). The population 
(4, = 33.35 + 13.292) 


is also shown (in 


regression line (wy=33.35+13.29x) 
blue). 


eruption (min) 
eruption (min) 


Interval until next 
eruption (min) 
Interval until next 
Interval until next 


T T T T T T T T T T T T T T T T T T T T T T T T 
15 2.0 25 3.0 3.5 4.0 4.5 5.0 15 2.0 25 3.0 3.5 4.0 4.5 5.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 


Duration (min) Duration (min) Duration (min) 


Sample 1: } = 44 + 10.0x Sample 2: } = 39 + 12.5x Sample 3: } = 24 + 15.7x 
FIGURE 12.2 Scatterplots and least-squares regression lines (in green) for three 
different SRSs of 15 Old Faithful eruptions, along with the population regression 
line (in blue). 


Notice that the slopes of the sample regression lines (b1=10.0, 
b1=12.5, and b1=15.7)(01 = 10.0, b = 12.5, andb; = 15.7) vary quite a bit 
from the slope of the population regression line, B1=13.29°1 = 18.29 The 
pattern of variation in the sample slope b1°1 is described by its sampling 
distribution. 

To get a better picture of this variation, we used technology to simulate 
choosing 1000 SRSs of n=15n = 15 points from the Old Faithful data, 
each time calculating the sample regression line y\=b0+b1x.Y = bo + biz. 
Figure 12.3 displays the values of the slope b151 for the 1000 sample 
regression lines. We have added a vertical line (in blue) at 13.29 
corresponding to the slope of the population regression line B1,71- 
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Simulated sample slope 5; 


FIGURE 12.3 Dotplot of the sample slope b1 


by 


regression line in 1000 simulated SRSs of n=15 


) ae RS 


population slope (13.29) is marked with a blue vertical line. 


of the least-squares 


eruptions. The 
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Let’s describe this simulated sampling distribution of b1.1- 


Shape: We can see that the distribution of b1°1-values is roughly 
symmetric and single peaked. Figure 12.4(a) is a Normal probability 
plot of these sample regression line slopes. The strong linear pattern in 
the graph tells us that the simulated sampling distribution of b1°1 is 
close to Normal. 


Center: The mean of the 1000 b11-values is 13.34. This value is quite 
close to the slope of the population (true) regression line, 13.29. 


Variability: The standard deviation of the 1000 b1°1-values is 1.40. We 
will soon see that the standard deviation of the sampling distribution of 
b141 is actually 1.42. 


Expected z score 
\ 


8 10 12 14 16 18 8 10 12 14 16 18 
(a) Simulated sample slope 5, (b) Simulated sample slope 5, 


FIGURE 12.4 (a) Normal probability plot and (b) histogram of the 1000 sample 
regression line slopes from Figure 12.3. The blue density curve in Figure 12.4(b) is 
for a Normal distribution with mean 13.29 and standard deviation 1.42. 


Figure 12.4(b) is a histogram of the b1°1-values from the 1000 
simulated SRSs. We have superimposed the density curve for a Normal 
distribution with mean 13.29 and standard deviation 1.42. This curve 
models the approximate sampling distribution of the slope quite well. 

Let’s do a quick recap. For all 263 eruptions of Old Faithful in a single 

month, the population regression line is py=33.35+13.29x. 
Hy = 33.35 + 13.292. wre yse the symbols B0=33.3540 = 33.35 and 
B1=13.29°1 = 13.29 to represent the y¥ intercept and slope parameters. 
The standard deviation of the residuals for this line is the parameter 
0=6.47.0 = 6.47. 

Figure 12.4(b) shows the approximate sampling distribution of the 
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slope b141 of the sample regression line for samples of 15 eruptions. If we 
take all possible SRSs of size n=15n = 15 from this population, we get the 
sampling distribution of b1.91- Can you guess its shape, center, and 
variability? 


Shape: Approximately Normal 


Center: pb1=81=13.29"% ~ By = 13.29 (b1°1 is an unbiased estimator 
of B1.°1-) 


Sars o, = —— = “4 = 142, 
Variability: ob1=oo0xn=6.471.1815=1.42, oryn 1.18/15 


where ox? is the standard deviation of duration for the 263 eruptions. 


We interpret ob17": just like any other standard deviation: the slopes 
of the sample regression lines typically differ from the slope of the 
population regression line by about 1.42. 

Here’s a summary of the important facts about the sampling 
distribution of b1.1- 


SAMPLING DISTRIBUTION OF A SLOPE 


Choose an SRS of n” observations (x, y)(@ >Y) froma population of 
size NN with least-squares regression line 


wy=B0+B1xMy = Bo t Px 
Let b11 be the slope of the sample regression line. Then: 


e The mean of the sampling distribution of b11 is pb1=B1. = 91: 


e The standard deviation of the sampling distribution of b1°1 is 


Co 
Th, = 


_ 
obl=ooxn Oxryn 


as long as the 10% condition is satisfied: n<0.10 N.n < 0.10 N. 


e The sampling distribution of b1°1 will be approximately Normal if 
the values of the response variable y¥ follow a Normal distribution 
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| for each value of the explanatory variable xz (the Normal condition). | 


We’ll say more about the Normal condition in a moment. 


Think About It 


WHAT’S WITH THAT FORMULA FOR ob1?7%: * Three factors 
affect the standard deviation of the sampling distribution of b1:1 : 


1. o,%> the standard deviation of the residuals for the population 
regression line. Because o@ is in the numerator of the formula, when 
o@ is larger, so is ob1.7%* When the points vary more from the 
population (true) regression line, we should expect more variability in 
the slopes b141 of sample regression lines from repeated random 
sampling or random assignment. 


2. ox,7%«» the standard deviation of the explanatory variable. Because ox 
7x is in the denominator of the formula, when ox®= is larger, ob17% 
is smaller. More variability in the values of the explanatory variable 
leads to a more precise estimate of the slope of the true regression 
line. 


3. n,’: the sample size. As with every other formula for the standard 
deviation of a statistic, the variability of the statistic gets smaller as 
the sample size increases. A larger sample size will lead to a more 
precise estimate of the true slope. 


Conditions for Regression Inference 


We can fit a least-squares line to any data relating two quantitative 
variables, but the results are useful only if the scatterplot shows a linear 
pattern. Inference about regression involves more detailed conditions. 
Figure 12.5 shows the regression model in picture form when the 
conditions are met. 
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For any fixed x, the responses y 
follow a Normal distribution 
with standard deviation o. 


FIGURE 12.5 The regression model when the conditions for inference are met. 
The line is the population (true) regression line, which shows how the mean 


response [ly 
changes as the explanatory variable x 
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changes. For any 
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fixed value of x, the 
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observed response y 
varies according to a Normal distribution having mean pty 


1696 


and standard 


deviation o. 


Shape: For each possible value of the explanatory variable x,*> the 
values of the response variable y¥ follow a Normal distribution. 


Center: For each possible value of the explanatory variable x,”> the 
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mean value of the response variable py“ falls on the population (true) 
regression line py=B0+B1x.4y ~ So + Pi. Because the regression line 
goes through the mean value of y¥ for each value of x,» we can 
interpret the slope as the change in the average value of y¥ for each 1- 
unit increase in x.@. 


Variability: For each possible value of the explanatory variable x,”» the 
values of the response variable y¥ have the same standard deviation 0.7. 


Consider the population of all eruptions of the Old Faithful geyser in a 


given year. For each eruption, let x@ be the duration (in minutes) and y¥ 
be the interval of time (in minutes) until the next eruption. Suppose that 
the conditions for regression inference are met, the population regression 


line is Wwy=34+13x,My 


= 34+ 132, and the variability around the line is 


given by o=6.0 = 6. 


Let’s focus on the eruptions that lasted x=2% = 2 minutes. For this 


“subpopulation”: 


The average amount of time until the next eruption is py=34+13(2)=60 


minutes.4y = 34+ 18 (2) = 60 minutes. 


The amounts of time until the next eruption follow a Normal 
distribution with mean 60 minutes and standard deviation 6 minutes. 


For about 95% of these eruptions, the amount of time y¥ until the next 
eruption is between 60-2(6)=48 minutes®? — 2(6) = 48 minutes and 
60+2(6)=72 minutes. + 2(6) = 72 minutes. That is, if the previous 
eruption lasted 2 minutes, 95% of the time the next eruption will occur 
in 48 to 72 minutes. 


__— My = 34+ 13x 
Interval 
(min) 


Duration (min) 
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Here are the conditions for performing inference about the linear 
regression model. The acronym LINER should help you remember them! 


CONDITIONS FOR REGRESSION INFERENCE 


Suppose we have n? observations on a quantitative explanatory 
variable x® and a quantitative response variable y.¥- Our goal is to 
study or predict the behavior of y¥ for given values of x.@. 


e Linear: The actual relationship between x# and y¥ is linear. For any 
fixed value of x,”> the mean response py” falls on the population 


(true) regression line py=B0+B1x.4y ~ Bo + Piz. 


Independent: Individual observations are independent of each other. 
When sampling without replacement, check the 10% condition. 


Normal: For any fixed value of x,”> the response y¥ varies 
according to a Normal distribution. 


Equal SD: The standard deviation of y¥ (call it 07) is the same for 
all values of x.#. 


Random: The data come from a random sample from the population 
of interest or a randomized experiment. 


Although the conditions for regression inference are a bit complicated, 
it is not hard to check for major violations. Most of the conditions involve 
the population (true) regression line and the deviations of responses from 
this line. We usually can’t observe the population line, but the sample 
regression line estimates it. The residuals from the sample regression line 
estimate the deviations from the population line. We can check several of 
the conditions for regression inference by looking at graphs of the 
residuals. Start by making a residual plot and a histogram, dotplot, 
stemplot, boxplot, or Normal probability plot of the residuals. 


Here’s a summary of how to check the conditions one by one. 


e Linear: Examine the scatterplot to see if the overall pattern is roughly 
linear. Make sure there are no leftover curved patterns in the residual 
plot. 
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Good: Scatterplot has a linear form. Bad: Residual plot shows a curved pattern. 
45,000 4 100 4 


. 
40,000 . 
<a i 
35,000 4 ~~ 8 - i 
= ° 
° 
30,000 - 


25,000 4 


Price ($) 


20,000 4 


Residual ($1000) 
t¢ 


15,000 4 


10,000 4 


5000 + T T T T T T T 
0 40,000 80,000 120,000 160,000 
Miles driven Carat weight 


Independent: Knowing the value of the response variable for one 
individual shouldn’t help predict the value of the response variable for 
other individuals. If sampling is done without replacement, remember to 
check that the sample size is less than 10% of the population size (10% 
condition). There are also other issues that can lead to a lack of 
independence. One example is measuring the same variable over time, 
yielding what is known as time-series data. Knowing that a young girl’s 
height at age 6 is 48 inches would definitely give you additional 
information about her height at age 7. We will avoid doing inference for 
time-series data in this course. 


Normal: Make a histogram, dotplot, stemplot, boxplot, or Normal 
probability plot of the residuals and check for strong skewness or 
outliers. Ideally, we would check the Normality of the residuals at each 
possible value of x.«. Because we rarely have enough observations at 
each x#-value, however, we make one graph of all the residuals to 
check for Normality. 


Equal SD: Look at the scatter of the residuals above and below the 
“residual=0” “residual = 0” line in the residual plot. The variability of 
the residuals in the vertical direction should be roughly the same from 
the smallest to the largest x#-value. 


1700 


Good: Residuals have roughly equal variability 


Bad: The response variable y has greater variability for 
at all x-values in the data set. 


larger values of the explanatory variable x. 


5000 4 “ 2] 


Residual 
co 
Residual 


-50005 * 


—10,000 = ; 1 
0 40,000 


T T 


—4 
T T T T 
80,000 120,000 160,000 0 2 4 6 8 


Miles driven 


e Random: See if the data came from a random sample from the 
population of interest or a randomized experiment. If not, we can’t make 
inferences about a larger population or about cause and effect. 


Let’s look at an example that illustrates the process of checking 
conditions. 


EXAMPLE |The helicopter experiment » 
Checking conditions 


Fotosearch 


PROBLEM: Mrs. Barrett’s class did a fun experiment using paper 
helicopters. After making 70 helicopters using the same template, 
students randomly assigned 14 helicopters to each of five drop heights: 
152 centimeters (cm), 203 cm, 254 cm, 307 cm, and 442 cm. Teams of 
students released the 70 helicopters in a random order and measured the 
flight times in seconds. The class used computer software to carry out a 
least-squares regression analysis for these data. Here are a scatterplot, 
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residual plot, and histogram of the residuals. 


3.0 a 0.50 
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Check whether the conditions for performing inference about the 
regression model are met. 


SOLUTION: 


Use the LINER acronym! 


e Linear: The scatterplot shows a clear linear form and there is no 
leftover curved pattern in the residual plot. “ 


Although there are “stacks” in the residual plot, the residuals are 


centered on the horizontal line at 0 for each drop height used in the 


experiment. 


e Independent: Because the helicopters were released in a random order 
and no helicopter was used twice, knowing the result of one 
observation should not help us predict the value of another 
observation. / 


e Normal: There is no strong skewness or outliers in the histogram of 
the residuals. / 


Note that we do not have to check the 10% condition here because 


there was no random sampling. 


e Equal SD: The residual plot shows a similar amount of scatter about 
the residual=Oresidual = 0 line for each drop height. However, flight 
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times seem to vary a little more for the helicopters that were dropped 
from a height of 307 cm. ¥ 


e Random: The helicopters were randomly assigned to the five possible 
drop heights. ¥ 


FOR PRACTICE, TRY EXERCISE 5 


You will always see some irregularity when you look for Normality 
and equal standard deviation in the residuals, especially when you have 
few observations. oo Don’t overreact to minor issues in the graphs 


when checking the Normal and Equal SD conditions. 


Estimating the Parameters 


When the conditions are met, we can do inference about the regression 
model py=B0+B1x.4v = 90 + P1*- The first step is to estimate the 
unknown parameters. If we calculate the sample regression line 
y\=b0+b1x,Y = bo + 612, the sample slope b141 is an unbiased estimator 
of the true slope B1°1 and the sample y¥ intercept b0°0 is an unbiased 
estimator of the true y¥ intercept 80.50: The remaining parameter is the 
standard deviation 0,7» which describes the variability of the response y¥ 
about the population (true) regression line. 


AP® EXAM TIP 


We use the same notation as the AP® Statistics exam formula sheet for 
the equation of the sample regression line yA=b0+b1x.¥ = bo + by x. 


However, your graphing calculator probably uses the notation yA=a+bx. 
y =a-+ be. Just remember: The slope is always the coefficient of x,” 
no matter what symbol is used. 


The least-squares regression line computed from the sample data 
estimates the population (true) regression line. So the residuals estimate 
how much y¥ varies about the population line. Because o@ is the standard 
deviation of responses about the population (true) regression line, we 
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estimate it with the standard deviation of the residuals s:s : 
s=)‘residuals2n—2=)'(yi-y/i)2n-2 


| > residuals 


en a 


Recall from Chapter 3 that ss describes the size of a “typical” prediction 
error. Because s§ is estimated from data, it is sometimes called the 
regression standard error or the root mean squared error. 

It is possible to do inference about any of the three parameters in the 
regression model: BO, B10: 41 or 0.7. However, the slope B1°1 of the 
population (true) regression line is usually the most important parameter in 
a regression problem. So we’ ll restrict our attention to inference about the 
slope. 

When the conditions are met, the sampling distribution of the slope b1 
b1 is approximately Normal with mean pb1=61/ = 1 and standard 
deviation 

Co 


ob = = 
ob1=ooxn oryn 


In practice, we don’t know o@ for the true regression line. So we estimate 
it with the standard deviation of the residuals, s.s- We also don’t know the 
standard deviation ox%« for the population of x#-values. (For reasons 
beyond the scope of this text, we replace the denominator with sxn-1. 

Sz vn—1.) So we estimate the variability of the sampling distribution of b 
b with the standard error of the slope 


8 


i, =—_— 
SEb1=ssxn-1 srVn—-l 
This standard error has the same interpretation as the standard error of 
x © or the standard error of pr.P: It measures how far our estimate 
typically varies from the truth. In this case, it measures how far the sample 
slope typically varies from the population (true) slope if we repeat the data 
production process many times. 


Although we give the formula for the standard error of b11 you 
should rarely have to calculate it by hand. Computer output gives the 
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standard error SEb192h: immediately to the right of the sample slope b1. 
bj. 


AP® EXAM TIP 


The AP® Statistics exam formula sheet gives the formula for the 
standard error of the slope as 


sb1=)\(yi-y/i)2n-2) (xi-x )2 


The numerator is just a fancy way of writing the standard deviation of 
the residuals s. Can you show that the denominator of this formula is 
the same as ours? 


EXAMPLE |The helicopter experiment, part 2 » 


Estimating the parameters 


istockphoto 


PROBLEM: Earlier, Mrs. Barrett’s class used computer software to 
perform a least-squares regression analysis of their helicopter data. 
Recall that the data came from dropping 70 paper helicopters from 
various heights (in inches) and measuring the flight times (in seconds). 
Some output from this regression analysis is shown here. We checked 
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conditions for performing inference earlier. 


Regression Analysis: Flight time versus Drop height 


Predictor Coef SE Coef T P 

Constant —0.03761 0.05838 -0.64 0.522 
Drop height (cm) 0.0057244  0.0002018 28.37 0.000 
S = 0.168181 R-Sq = 92.2% R-Sq(adj) = 92.1% 


. What is the estimate for 8020’ Interpret this value. 


jab) 


b. What is the estimate for 8121" Interpret this value. 
c. What is the estimate for o?¢? Interpret this value. 


d. Give the standard error of the slope SEb1.9 2H: - Interpret this value. 


SOLUTION: 


a. b0=-0.03761;00 = —9-03761; if a helicopter is dropped from 0 cm, it 
will take -0.03761~—9-93761 second to land, on average. 


A negative flight time is clearly unrealistic and is likely due to 


extrapolating to x=02 = 0 from the data in the experiment. 


b. b1=0.0057244;51 = 0.0057244; for each increase of 1 cm in drop 
height, the average flight time increases by 0.0057244 second. 


c. s=0.168181;5 = 9-168181; the actual flight times typically vary by 
about 0.168181 second from the times predicted with the least- 


squares regression line using x=drop height. = drop height. 


Remember that the slope and y¥ intercept describe average flight 


times, not the actual flight times. You could also describe the slope 


as the increase in predicted flight time for each increase of 1 cm in 


drop height. 
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d. SEb1=0.0002018;9 ~": = 0.0002018; 


assignment many times, the slope of the sample regression line 


if we repeated the random 


would typically vary by about 0.0002018 from the slope of the true 
regression line for predicting flight time from drop height. 


The standard error of the slope is just to the right of the slope, in the 


column called “SE Coef.” 


FOR PRACTICE, TRY EXERCISE 7 


When we compute the least-squares regression line based on a random 
sample of data, we can think about doing inference for the population 
regression line. When our least-squares regression line is based on data 
from a randomized experiment, as in the helicopter example, the resulting 
inference is about the true regression line relating the explanatory and 
response variables. From now on, we’Il use the term population regression 
line in sampling situations and the term true regression line when 
describing experiments. 


Constructing a Confidence Interval for the Slope 


In a regression setting, we often want to estimate the slope B1°1 of the 
population (true) regression line. The slope b1°1 of the sample regression 
line is our point estimate for B1.°1- A confidence interval is more useful 
than the point estimate because it gives a set of plausible values for B1./1- 


The confidence interval for B1°1 has the familiar form 


Statistic+(critical value) - (standard deviation of statistic) 
statistic + (critical value) - (standard deviation of statistic) 


We use the statistic b1°1 as our point estimate and SEb1>Eh: to 
estimate the standard deviation of the statistic. Because we don’t know the 
true standard deviation ob1,7"* we must use a t*t* critical value rather 
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than a z*2* critical value. Get the t*t* critical value from a té distribution 
with n-2" — 2 degrees of freedom. (The explanation of why df=n-2 
df = n — 2 is beyond the scope of this book.) Here is the formula: 


b1itkSEb121 + & SEs, 


We call this a té interval for the slope. Here are the details. 


tt INTERVAL FOR THE SLOPE 


When the conditions are met, a C% confidence interval for the 
unknown slope B11 of the population (true) regression line is 


b1lit*SEb121 + & SEs, 


where t*¢* is the critical value for the tt distribution with n-2” — 2 
degrees of freedom and C% of the area between -t*~?” and t*.t*. 


rl The values of té given in the computer regression output are 


not the critical values for a confidence interval. They come from 
carrying out a significance test about the y¥ intercept or slope of the 
population (true) regression line. We’ll discuss tests in more detail shortly. 


You can find a confidence interval for the y¥ intercept B00 of the 
population (true) regression line in the same way, using b0% and SEbO 
SE) from the “Constant” row of the computer output. However, we are 
usually interested only in the point estimate for B00 that’s provided in the 


output. 


Here is an example using a familiar context that illustrates the four- 
step process for calculating and interpreting a confidence interval for the 
slope. 


EXAMPLE |How much is that truck worth? (») fi 


4 STEP 
Confidence interval for a slope 
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Oleksiy Maksymenko/imageBROKER/Newscom 


PROBLEM: Everyone knows that cars and trucks lose value the more 
they are driven. Can we predict the price of a used Ford F-150 
SuperCrew 4x44 4 if we know how many miles it has on the 
odometer? A random sample of 16 used Ford F-150 SuperCrew 4x4s 

4 x 4s was selected from among those listed for sale on autotrader.com. 
The number of miles driven and price (in dollars) were recorded for 
each of the trucks. Here are the data: 


Miles 70,583 129,484 29,932 29,953 24,495 75,678 8359 4447 
driven 

Price 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
(S) 

Miles 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
driven 

Price 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 
(S) 


Here is some computer output from a least-squares regression analysis 
of these data. Construct and interpret a 90% confidence interval for the 
slope of the population regression line. 


45,000 4 10,000 + 
ee 


40,000 


35,000 5000 + 


. 
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Frequency 


-8000 —4000 0 4000 8000 
Residual ($) 


Regression Analysis: Price ($) versus Miles 
driven 

Predictor Coef SECoef T P 

Constant 38257 2446 15.640.000 


Miles —0.16292 0.03096 —5.260.000 
driven 
S= R-Sq = R-Sq(adj) = 64.0% 
5740.13 66.4% 

SOLUTION: 


Follow the four-step process! 


STATE: 90% CI for 81=the slope of the population regression 
6; = the slope of the population regression ine relating y=priceY = Price 


to x=milest = miles driven for used Ford F-150 SuperCrew 4x4s4 x 4s 
listed for sale on autotrader.com. 
PLAN: tt interval for the slope. 


Remember to use the acronym LINER when checking the conditions. 


Linear: The scatterplot shows a clear linear pattern. Also, the residual 
plot shows no leftover curved patterns. / 

Independent: Assume that 16<10%16 < 10% of all used Ford F-150 
SuperCrew 4x4s.4 x 4s. / 

Normal: There is no strong skewness or outliers in the histogram of 
residuals. ¥ 
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Equal SD: The scatter of points around the residual=Oresidual = 0 line 
appears to be about the same at all x-values. / 


Random: Random sample of 16 used Ford F-150 SuperCrew 4x4s. 
4x4s.f 


b1tt*SEb11 + # SEs, 


DO: With 


df=16-2=14,t*=1.761.—0.16292+1.761(0.03096)=—0.16292+0.05452= 
(-0.21744,-0.10840) 
df = 16-—2=14,t* = 1.761. — 0.16292 + 1.761 (0.03096) 


— —0.16292 + 0.05452 
= (0.21744, —0.10840) 


Refer to Technology Comer 30 for details on how to do calculations for 
at interval for slope. The calculator gives (—0.2173,—0.1084) 


(—0.2173, — 0.1084) using af=14.df = 14. 


CONCLUDE: We are 90% confident that the interval from -0.2174 

0.2174 to -0.1084—0-1084 captures the slope of the population 
regression line relating y=priceY = PTice to x=milest = miles driven for 
used Ford F-150 SuperCrew 4x4s4 = 4s listed for sale on 
autotrader.com. 


This means we are 90% confident that the average price of a used Ford 


F-150 goes down between $0.1084 and $0.2174 per mile. 


FOR PRACTICE, TRY EXERCISE 11 


The change in average price of a used Ford F-150 is quite small for a 1- 
mile increase in miles driven. What if miles driven increased by 1000 
miles? We can just multiply both endpoints of the confidence interval in 
the example by 1000 to get a 90% confidence interval for the 
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corresponding change in average price. The resulting interval is 
(-217.44,-108.40).(—217.44, —108.40). That is, we are 90% confident that 


the average price of a used Ford F-150 will decrease between $108.40 and 
$217.44 for every additional 1000 miles driven. 


So far, we have used computer regression output when performing 
inference about the slope of a population (true) regression line. The TI-84 
and other calculators can do the calculations for inference when the sample 
data are provided. 


30. TECHNOLOGY CORNER | CONSTRUCTING A 
CONFIDENCE INTERVAL FOR SLOPE 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 


Let’s use the data from the preceding example to construct a confidence 
interval for the slope of a population (true) regression line on the TI-84. 
Note: The TI-83 and older operating systems for the TI-84 do not 
include this option. 


LinResgTInt 


Calculate 


ORMAL FLOAT AUTO REAL RADIAN CL o 


LinRegTInt 
y=atbx 
(-.2173, -.1084) 
b= ~. 1628114837 
df=14 
s=5737.55499 
a=38254. 8639 
r2=, 664549225 
r=~-,. 8151988868 


e Enter the x#-values (miles driven) into L1 and the y¥-values (price) 
into L2. 


e Press [STAT] then choose TESTS and LinRegTInt.... 
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e In the LinRegTInt screen, adjust the inputs as shown. Then highlight 
“Calculate” and press [ENTER|, 


e The linear regression t? interval results are shown here. Note that ss 
is the standard deviation of the residuals, not the standard error of the 
slope. 


AP® EXAM TIP 


When you see a list of data values on an exam question, wait a moment 
before typing the data into your calculator. Read the question through 
first. Often, information is provided that makes it unnecessary for you 


to enter the data at all. This can save you valuable time on the AP® 
Statistics exam. 


CHECK YOUR UNDERSTANDING 


. 


y Many people believe that students learn better if they sit closer to 
the front of the classroom. Does sitting closer cause higher achievement, 
or do better students simply choose to sit in the front? To investigate, a 
Statistics teacher randomly assigned students to seat locations in the 
classroom for a particular chapter and recorded the test score for each 
student at the end of the chapter. Here are the data, along with output 
from a regression analysis. Construct and interpret a 95% confidence 
interval for the slope of the true regression line. 


Row 1 if 1 1 2 2 Z 2 
Score 
Row 
Score 
Row 
Score 
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Test score 
Residual 
Co 
7 
° 


T T T T rs 
3 as 5 6 7 


Row number Row number 


N= 


20 -15 -10 5 0O 5 10 15 20 


Residual 
Term Coef SECoef T-Value P-Value 
Constant 85.71 4.24 20.22 0.000 
Row = (117 0.947 =e 0.248 
S = 10.0673 R-sq = 4.73% R-sq(adj) = 


1.33% 


Performing a Significance Test for the Slope 


When the conditions for inference are met, we can use the slope b151 of 
the sample regression line to construct a confidence interval for the slope 
81° of the population (true) regression line. We can also perform a 
significance test to determine whether a specified value of B11 is 
plausible. The null hypothesis has the general form H0:$1=hypothesized 
slope.o : 8; = hypothesized slope. To do a test, calculate the standardized 
test statistic: 


standardized test statistic=statistic - parameterstandard deviation of 
Statistict=b1 — hypothesized slopeSEb1 


; a statistic — parameter 
standardized test statistic = 


standard deviation of statistic 
b; — hypothesized slope 


SEy, 


t —— 


To find the PP-value, use a tt distribution with n-2” — 2 degrees of 
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freedom. Here are the details for the t test for the slope. 


tt TEST FOR THE SLOPE 


Suppose the conditions are met. To perform a test of the hypothesis 
HO0:B1=hypothesized slope,/0 : 41 = hypothesized slope, compute the 
standardized test statistic 
b; —hypothesized slope 

E : - SEp 

t=b1—-hypothesized slopeSEb1 1 
Find the PP-value by calculating the probability of getting a tt statistic 
this large or larger in the direction specified by the alternative 
hypothesis Hao ina tt distribution with df=n—2.df = n — 2. 


If sample data suggest a linear relationship between two variables, how 
can we determine whether this happened just by chance or whether there is 
actually a linear association between x# and yY in the population? By 
performing a test of HO:81=0./40 : 1 = 9. A regression line with slope 0 
is horizontal. That is, the mean of y¥ does not change at all when xz 
changes. So HO: B1=0M0 : A = 0 says that there is no linear association 
between x and y¥Y in the population. Put another way, HO says that 
linear regression ofy¥Yonx@is no better for predictingy¥than the mean of 
the response variabley .¥: 


Does a null hypothesis of “no association” sound familiar? It should. In Chapter 11, 


you learned about chi-square tests for independence, which analyze the relationship 
between two categorical variables. The t test for slope looks for an association 


between two quantitative variables. 


Regression output from statistical software usually gives the value of t¢ 
for a test of HO:81=0.40 : 1 = 9. If you want to test a null hypothesis 
other than H0:81=0,/40 : 91 = 9, get the slope and standard error from the 
output and use the formula to calculate the té statistic. 

Computer software also reports the PP-value for a two-sided test of 
H0:81=0.4 0: 1 = 9. For a one-sided test in the proper direction, just 
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divide the PP-value in the output by 2. The following example shows 
what we mean. 


EXAMPLE |Crying and IQ (») fi. 
4 STEP 


Significance test for B 


=? ~~ —- 
H. Armstrong Roberts/ClassicStock 
/Archive Photos/Getty Images 


PROBLEM: Infants who cry easily may be more easily stimulated than 
others. This may be a sign of higher IQ. Child development researchers 
explored the relationship between the crying of infants 4 to 10 days old 
and their later IQ test scores. A snap of a rubber band on the sole of the 
foot caused the infants to cry. The researchers recorded the crying and 
measured its intensity by the number of peaks in the most active 20 
seconds. They later measured the children’s IQ at age three years using 
the Stanford—Binet IQ test. The table contains data from a random 
sample of 38 infants.4 


Crycount IQ | Crycount IQ 
20 90 17 94 


Crycount IQ Crycount IQ 


10 87 12 94 
12 97 16 100 19 103 12 103 

9 103 23 103 13 104 14 106 
16 106 ay 108 18 109 10 109 
18 109 15 112 18 112 20 113 
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ES) 114 21 114 16 118 fe) 119 


12 119 12 20 19 120 16 124 
20 132 15 133 22 135 31 135 
16 136 17 141 30 155 22 157 
8) 159 13 162 


Here is computer output from a least-squares regression analysis of 
these data. Do these data provide convincing evidence at the a=0.05 

a = 0.05 level of a positive linear relationship between count of crying 
peaks and IQ in the population of infants? 


180 
170 
160 
150 
140 
130 
120 
110 


se es Cs es ee ee 


IQ at age three years 
Residual 


100 
-10 4 
90 
80 4 20 
70 -30 4 
Qe ot tot T T T T T T T 
0 5 10 15 20 25 30 35 40 10 15 202-25 30 35 40 -20 0 20 40 60 
Count of crying peaks Count of crying peaks Residual 


Regression Analysis: IQ versus Crycount 
Predictor Coef SE Coef T 2 
Constant 91.268 8.934 1022 0.000 
Crycount 1.4929 0.4870 3.07 0.004 
S = 17.50 R-Sq = 20.7% R-Sq(adj) = 18.5% 


SOLUTION: 


Follow the four-step process! 


STATE: We want to test 

Hy : fi =0 
HO:81 = 0Ha:81>0 Ha : 61 > 0 
where B1=the slope?1 = the slope of the regression line relating y=IQ 
Score to x=countY = [Q Score tox = count of crying peaks in the 
population of infants. Use a=0.05.0 = 0.05. 


There is some evidence for Ha/4a because b1=1.4929>0.01 = 1.4929 >0. 
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PLAN: té test for the slope. 


Linear: The scatterplot shows a linear relationship between crying peaks 
and IQ, and there are no leftover curved patterns in the residual plot. / 


The tt statistic and PP-value are found in the row for “Crycount” under 
the corresponding headings. Because the P-value is for a two-sided test 


and there is evidence for Ha! a> we cut the PP-value in half for the one- 
sided test. 


Independent: 38 is less than 10% of all infants. / 


Normal: The histogram of residuals does not show strong skewness or 
clear outliers. / 


Equal SD: The residual plot shows a fairly equal amount of scatter 
around the horizontal line at 0 for all xz-values. Y 


Random: Random sample of 38 infants. / 
DO: 


e t=3.07t = 3.07 


e P-value=0.004/2=0.002/ “value = 0.004 /2 = 0.002 


CONCLUDE: Because the PP-value of 0.002<a=0.05,0-002 < a = 0.05, 
we reject H0O.40- There is convincing evidence of a positive linear 
relationship between the count of crying peaks and IQ score in the 
population of infants. 


t distribution 
with df = 36 


P-value = 0.002 
/ 


Refer to Technology Comer 31 for details on how to do calculations for 
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a tt test for slope. The calculator gives t=3.065t = 3.065 and P- 
value=0.002 using df=36,/-value = 0.002 using df = 36. 


FOR PRACTICE, TRY EXERCISE 15 


Based on the results of the crying and IQ study, should we ask doctors 
and parents to make infants cry more so that they’1] be smarter later in life? 
Hardly. This observational study gives statistically significant evidence of 
a positive linear relationship between the two variables. However, we can’t 
conclude that more intense crying as an infant causes an increase in IQ. 
Maybe infants who cry more are more alert to begin with and tend to score 
higher on intelligence tests. 


31. TECHNOLOGY CORNER |PERFORMING A 
SIGNIFICANCE TEST FOR SLOPE 


TI-Nspire and other technology instructions are on the book’s website at 

highschool.bfwpub.com/tps6e. 

Let’s use the data from the crying and IQ study to perform a 

significance test for the slope of the population regression line on the 

TI-83/84. 

e Enter the x#-values (crying count) into L1 and the y¥-values (IQ 
score) into L2. 


e Press [STAT], then choose TESTS and LinRegTTest.... 


e In the LinRegTTest screen, adjust the inputs as shown. Then 
highlight “Calculate” and press ENTER |. Note: Remember that the 
TI-83/84 uses B/ for the slope, not B1.°1- 
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ORMAL FLOAT AUTO REAL RADIAN CL it] 


LinRegTTest 


Frea:1 

B & p:#0 <0 
RegEQ: 
Calculate 


NORMAL FLOAT AUTO REAL RADIAN CL i 


LinkesgTTest 

y=atbx 

B>@ and P>O 

t=3. 065489379 

p=, 0020526501 
df=36 

a=91. 26829865 
b=1.492896598 
$s=17.49872122 


e The linear regression té test results take two screens to present. We 


show only the first screen. 


Think About It 


WHAT’S WITH THAT p>0? > 9 IN THE LinRegTTest SCREEN? 
The slope b1°1 of the least-squares regression line is closely related to 
the correlation rr” between the explanatory and response variables x 


and y¥Y (recall that bl=rsysx”! 3 ). In the same way, the slope B19 
of the population regression line is closely related to the correlation p? 
(the lowercase Greek letter rho) between x# and y¥ in the population. 
In particular, the slope is 0 when the correlation is 0. 


Testing the null hypothesis HOB=0/40° = 9 is exactly the same as testing 
that there is no correlation between x# and y¥ in the population from 
which we drew our data. You can use the test for zero slope to test the 
hypothesis H0:p=0/40 : P = 0 of zero correlation between any two 
quantitative variables. That’s a useful trick. Because correlation also 
makes sense when there is no explanatory— response distinction, it is 
handy to be able to test correlation without doing regression. 


CHECK YOUR UNDERSTANDING 


1720 


\ 


y The preceding Check Your Understanding (page 781) described 
some results from an experiment about seat locations and test grades. 
Here again is the output from a least-squares regression analysis for 

these data: 


Term Coef SE Coef T-Value P-Value 
Constant 85.71 4.24 20.22 0.000 
Row =1.117 ‘0.947 =(1¢8 0.248 


S = 10.0673 R-sq = 4.73% R-sq(adj) = 1.33% 


1. Do these data provide convincing evidence at the a=0.05a = 0.05 
significance level of a negative linear relationship between row and 
test score for students like those in the experiment? Assume that 
the conditions for regression inference are met. 

2. Describe a Type I error and a Type II error in this context. Which 
error is possible based on your conclusion? Explain your reasoning. 


Section 12.1 Summary 


e When an association between two quantitative variables is linear, use a 
least-squares regression line to model the relationship between the 
explanatory variable x# and the response variable y.4: 


e Inference in this setting uses the sample regression line y\=b0+b1x 
y = bo + i= to estimate or test a claim about the population (true) 


regression line j1y=B0+B1x.4¥ ~ Bo + Piz. 


e The conditions for regression inference are 


: Linear: The actual relationship between x# and y¥ is linear. For any 


fixed value of x,”> the mean response pryhy falls on the population 
(true) regression line py=B0+B1x.4y ~ Bo + Pix. 


« Independent: Individual observations are independent of each other. 
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When sampling without replacement, check the 10% condition. 


« Normal: For any fixed value of x,» the response y¥ varies according 
to a Normal distribution. 


: Equal SD: The standard deviation of y¥ (call it o7) is the same for all 
values of x.@. 


= Random: The data come from a random sample from the population 
of interest or a randomized experiment. 


When the conditions for inference are met, the sampling distribution of 
the sample slope b151 is approximately Normal with mean pb1=B1 


= Orn 
ry 


Me, = Bi and standard deviation ob1=ooxn. 


The slope b1®1 and intercept b0%0 of the sample regression line estimate 
the slope B1°1 and intercept B00 of the population (true) regression 
line. Use the standard deviation of the residuals ss to estimate o7 and 


S 8 
the standard error of the slope SEb1=ssxn-1 seV"~1 to estimate 


ob1.741° 


Confidence intervals and significance tests for the slope B1°1 of the 
population regression line are based on a t¢ distribution with n-2” ~ 2 
degrees of freedom. 


The ttinterval for the slope B1°1 is b1+t*SEb1.01 = SEp, - 


To test the null hypothesis HO: 8 1=hypothesized slope, 
Ho:; = hypothesized slope, carry out a tttest for the slope. This test 
uses the standardized test statistic 
b; —hypothesized slope 
-_ SE; 
t=b1—-hypothesized slopeSEb1 1 


The most common null hypothesis is H0:81=0,40:41 = 9, which says 
that there is no linear relationship between x# and y¥ in the population. 


12.1 Technology Corners | >» 


TI-Nspire and other technology instructions are on the book’s website at 
highschool.bfwpub.com/tps6e. 
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30. Constructing a confidence interval for slope Page 781 
31. Performing a significance test for slope Page 785 


Section 12.1 Exercises 


1. Predicting height Using the health records of every student at a high 


2. 


school, the school nurse created a scatterplot relating y=height 

y = height (jn centimeters) to x=age® = @8¢ (in years). After verifying 
that the conditions for the regression model were met, the nurse 
calculated the equation of the population regression line to be 
wy=105+4.2xHy = 105 + 4.22 ith g=70 = 7 cm. 


a. According to the population regression line, what is the average 
height of 15-year-old students at this high school? 


b. About what percent of 15-year-old students at this school are taller 


than 180 cm? 


c. If the nurse used a random sample of 50 students from the school to 
calculate the regression line instead of using all the students, would 
the slope of the sample regression line be exactly 4.2? Explain your 
answer. 

Predicting high temperatures Using the daily high and low 

temperature readings at Chicago’s O’ Hare International Airport for an 

entire year, a meteorologist made a scatterplot relating y=highY = high 
temperature to x=lowz = low temperature, both in degrees 

Fahrenheit.2 After verifying that the conditions for the regression 

model were met, the meteorologist calculated the equation of the 


= 16.6 + 1.022 


population regression line to be py=16.6+1.02xv with 


0=6.64°F.c = 6.64 F. 


a. According to the population regression line, what is the average 
high temperature on days when the low temperature is 40°F ?40 F? 


b. About what percent of days with a low temperature of 40°F40 F 


have a high temperature greater than 70°F?70 F? 


c. If the meteorologist used a random sample of 10 days to calculate 
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the regression line instead of using all the days in the year, would 

the slope of the sample regression line be exactly 1.02? Explain 

your answer. 
Oil and residuals Researchers examined data on the depth of small 
defects in the Trans-Alaska Oil Pipeline. The researchers compared the 
results of measurements on 100 defects made in the field with 
measurements of the same defects made in the laboratory.? The figure 
shows a residual plot for the least-squares regression line based on 
these data. Explain why the conditions for performing inference about 
the slope 81/1 of the population regression line are not met. 


305 
20-7 
104 % 
° % 
3 . * 
= “ wy 2 ‘ 
A} a 2.0 
= 0 _ 2. om fa ————— — a 
oS "Ze ® 
os oY fe 3 . 
» 
-10> e 
® 
-207 
-307 
T T T T 
0 20 40 60 80 


Laboratory measurement 


SAT Math scores Is there a relationship between the percent of high 
school graduates in each state who took the SAT and the state’s mean 
SAT Math score? Here is a residual plot from a linear regression 
analysis that used data from all 50 states in a recent year. Explain why 
the conditions for performing inference about the slope B11 of the 
population regression line are not met. 


Residual 
Co 
1 


“Tl T T T T T T T T Ir 
0 10 20 30 40 50 60 70 80 90 


Percent of high school graduates taking the SAT 
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5. pg 775 |») Beer and BAC How well does the number of beers a 


person drinks predict his or her blood alcohol content (BAC)? Sixteen 
volunteers aged 21 or older with an initial BAC of 0 took part in a 
study to find out. Each volunteer drank a randomly assigned number of 
cans of beer. Thirty minutes later, a police officer measured their 
BAC. A least-squares regression analysis was performed on the data 
using x=number = number of beers and y=BAC.Y = BAC. Here is a 
residual plot and a histogram of the residuals. Check whether the 
conditions for performing inference about the regression model are 
met. 


Residual 
So 
= 


0 1 2 3 4 5 6 7 8 9 
Number of beers 


Frequency 
MN 


0 


—0.03 -0.02 -0.01 0.00 0.01 0.02 0.03 0.04 
Residual 


6. Prey attracts predators Here is one way in which nature regulates the 
size of animal populations: high population density attracts predators, 
which remove a higher proportion of the population than when the 
density of the prey is low. One study looked at kelp perch and their 
common predator, the kelp bass. On each of four occasions, the 
researcher set up four large circular pens on sandy ocean bottoms off 
the coast of southern California. He randomly assigned young perch to 
1 of 4 pens so that one pen had 10 perch, one pen had 20 perch, one 


1725 


pen had 40 perch, and the final pen had 60 perch. Then he dropped the 
nets protecting the pens, allowing bass to swarm in, and counted the 
number of perch killed after two hours.’ A regression analysis was 
performed on the 16 data points using x=numberz = number of perch 
in pen and y=proportionY = Proportion of perch killed. Here is a 
residual plot and a histogram of the residuals. Check whether the 
conditions for performing inference about the regression model are 
met. 


Residual 


Number of perch 


w 


Frequency 
N 


04 03 02 01 00 O1 O02 03 
Residual 


7. pg 777 |») Beer and BAC Refer to Exercise 5. Here is computer 


output from the least-squares regression analysis of the beer and blood 
alcohol data. 


Dependent variable is: BAC 

No Selector 

R squared = 80.0% R squared (adjusted) = 78.6% 

s = 0.0204 with 16 — 2 = 14 degrees of freedom 

Variable Coefficient  s.e. of Coeff t-ratio prob 
Constant  -0.012701 0.0126 -1.00 0.3320 
Beers 0.017964 0.0024 7.84  <0.0001 
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10. 


11. 


. What is the estimate for 8020” Interpret this value. 


jab) 


oO 


. What is the estimate for 8121" Interpret this value. 
c. What is the estimate for o?¢? Interpret this value. 


d. Give the standard error of the slope SEb1.52h - Interpret this value. 
Prey attracts predators Refer to Exercise 6. Here is computer output 
from the least-squares regression analysis of the perch data. 


Predictor Coef Stdev.  t-ration p 
Constant 0.12049 0.09269 1.30 0.215 
Perch 0.0085690.002456 3.49 0.004 
S= R-Sq = 46.5% = R-Sq(adj) = 
0.1886 42.7% 


. What is the estimate for B0?0? Interpret this value. 


oOo p 


. What is the estimate for 8121" Interpret this value. 

c. What is the estimate for o?¢? Interpret this value. 

d. Give the standard error of the slope SEb1.92H - Interpret this value. 
Beer and BAC Refer to Exercises 5 and 7. 


a. Find the critical value for a 99% confidence interval for the slope of 
the true regression line. Then calculate the confidence interval. 


b. Interpret the interval from part (a). 


c. Explain the meaning of “99% confident” in this context. 
Prey attracts predators Refer to Exercises 6 and 8. 


a. Find the critical value for a 90% confidence interval for the slope of 
the true regression line. Then calculate the confidence interval. 


b. Interpret the interval from part (a). 

c. Explain the meaning of “90% confident” in this context. 

pg 779 |) Less mess? Kerry and Danielle wanted to investigate if 
tapping on a can of soda would reduce the amount of soda expelled 
after the can has been shaken. For their experiment, they vigorously 
shook 40 cans of soda and randomly assigned each can to be tapped 


for 0 seconds, 4 seconds, 8 seconds, or 12 seconds. After opening the 
cans and waiting for the fizzing to stop, they measured the amount 
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expelled (in milliliters) by subtracting the amount remaining from the 
original amount in the can.2 Here are their data: 


Amount expelled (mL) 


(0 sec) (4 sec) (8 sec) (12 sec) 
110 95 88 80 
100 105 84 72 
105 105 87 80 
105 105 85 75 
105 95 79 70 
110 90 100 65 
107 88 85 71 
105 95 85 ei 
104 94 80 76 
106 96 80 75 


Here is some computer output from a least-squares regression analysis 
of these data. Construct and interpret a 95% confidence interval for the 
slope of the true regression line. 
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12. 


Amount expelled (mL) 
8 
L 


o 
N 
> 


6 8 10 12 
Tapping time (sec) 


Residual (mL) 
o 
| 
+ 
ee 


Tapping time (sec) 


Frequency 


-15 -10 -5 0 5 10 15 = 20 


Residual (mL) 
Predictor Coef SE Coef T P 
Constant 106.36 1.3238 80.345 0.000 
Tapping time -2.6350 0.1769 ~-14.895 0.000 


S =5.00347 R-Sq= 85.4% R-Sq(adj) = 85.0% 


More mess? When Mentos are dropped into a newly opened bottle of 
Diet Coke, carbon dioxide is released from the Diet Coke very rapidly, 
causing the Diet Coke to be expelled from the bottle. To see if using 
more Mentos causes more Diet Coke to be expelled, Brittany and Allie 
used twenty-four 2-cup bottles of Diet Coke and randomly assigned 
each bottle to receive either 2, 3, 4, or 5 Mentos. After waiting for the 
fizzing to stop, they measured the amount expelled (in cups) by 
subtracting the amount remaining from the original amount in the 
bottle.° Here are their data: 


Amount expelled (cups) 
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(2 Mentos) 
1,125 

1,25 
1.0625 
1.25 

1.125 
1.0625 


(3 Mentos) 
1.1875 
1.125 
1,25 
1.1875 
1.3125 
1.1875 


(4 Mentos) 
1.25 
1.3125 
1.25 
1.375 
1.3125 
1.25 


(5 Mentos) 
1.25 
1.4375 
1.3125 
1.3125 
1.375 
1.4375 


Here is computer output from a least-squares regression analysis of 
these data. Construct and interpret a 95% confidence interval for the 


slope of the true regression line. 


1.45 4 


& 140-4 
g 1.35 4 
2 1304 
& 1.25 4 
2 120-4 
3 115 4 
1.10 4 
2 1.05 4 ae - se: 
2 3 4 5 
Number of Mentos 
0.10 
Fa 0.05 : 
- 0.00 
3 -0.05 i‘ 
m e 
-0.10 . 
T T T T 
2 3 4 5 
Number of Mentos 
10 
8 
oy 
g 6 
E. 
2 
0.10 0.00 0.10 
Residual (cups) 
Predictor Coef SE Coef T P 
Constant 1.0021 0.0451 22.215 0.000 
Mentos 0.0708 0.0123 5.770 0.000 
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S = 0.06724 R-Sq=60.2% R-Sq(adj) = 58.4% 


13. Beavers and beetles Do beavers benefit beetles? Researchers laid out 
23 circular plots, each 4 meters in diameter, at random in an area 
where beavers were cutting down cottonwood trees. In each plot, they 
counted the number of stumps from trees cut by beavers and the 
number of clusters of beetle larvae. Ecologists think that the new 
sprouts from stumps are more tender than other cottonwood growth so 
that beetles prefer them. If so, more stumps should produce more 


beetle larvae.Z 


Here is computer output for a regression analysis of these data. 
Construct and interpret a 99% confidence interval for the slope of the 
population regression line. Assume that the conditions for performing 
inference are met. 


Predictor Coef SE Coef T P 
Constant —1.286 2.853 —0.45 0.657 
Stumps 11.894 1.136 10.47 0.000 


S=6.41939 R-Sq=83.9% R-Sq(adj) = 83.1% 


14. Ideal proportions The students in Mr. Shenk’s class measured the arm 
spans and heights (in inches) of a random sample of 18 students from 
their large high school. Here is computer output from a least-squares 
regression analysis of these data. Construct and interpret a 90% 
confidence interval for the slope of the population regression line. 
Assume that the conditions for performing inference are met. 


Predictor Coef Stdev t-ratio Pp 
Constant 11.547 5.600 2.06 0.056 
Armspan 0.84042 0.08091 10.39 0.000 


S=1.613 R-Sq=87.1% R-Sq(adj) = 86.3% 


15. pg 783 |p) Weeds among the corn Lamb’s quarters is a common 


weed that interferes with the growth of corn. An agriculture researcher 
planted corn at the same rate in 16 small plots of ground and then 
weeded the plots by hand to allow a fixed number of lamb’s quarters 


1731 


plants to grow in each meter of corn row. The decision on how many 
of these plants to leave in each plot was made at random. No other 
weeds were allowed to grow. Here are the yields of corn (bushels per 
acre) in each of the plots:8 


Weeds per meter Yield Weeds per meter Yield 
0 166.7 3 158.6 
0 172.2 3 176.4 
0 165.0 3 153, 
0 176.9 3 156.0 
1 166.2 9 162.8 
1 157.3 9 142.4 
1 166.7 o 162.8 
1 161.1 9 162.4 


Here is some computer output from a least-squares regression analysis 
of these data. Do these data provide convincing evidence at the a=0.05 
a = 0.05 level that more lamb’s quarters reduce corn yield? 
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Yield (bushels/acre) 
3 
| 


Weeds per meter 


15-4 
® 10-7 
8 
4 5-4” . 
B oopt— 
3 -104 : 
-15 4 , 
hee ibe IN os ae A 
0 1 Z | 4 5 6 7 8 9 
Weeds per meter 
15 -10 5 0 5 10 15 
Residual (bushels/acre) 
Predictor Coef SE Coef T P 
Constant 166.483 Z325 61.11 0.000 
Weeds per —1.0987 0.5712 =1.92 0.075 


meter 
S=7.97665 R-Sq=20.9% R-Sq(adj) = 15.3% 


16. Time at the table Does how long young children remain at the lunch 
table help predict how much they eat? Here are data on a random 
sample of 20 toddlers observed over several months.2 “Time” is the 
average number of minutes a child spent at the table when lunch was 
served. “Calories” is the average number of calories the child 
consumed during lunch, calculated from careful observation of what 
the child ate each day. 


Time Calories Time Calories 


21.4 472 42.4 450 
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30.8 498 43.1 410 


Bled 465 Zod 504 
33.5 456 3L3 437 
32.8 423 28.6 489 
he 437 o2.0 436 
22.8 508 30.6 480 
34.1 431 35.1 439 
33.9 479 33.0 444 
43.8 454 43.7 408 


Here is some computer output from a least-squares regression analysis 
of these data. Do these data provide convincing evidence at the a=0.01 
a = 0.01 level of a linear relationship between time at the table and 
calories consumed in the population of toddlers? 


$ 
‘E 
< 
a 
a) 
Time (min) 
40 
30 
S 20 
= 10 
i. as 
S -10 
7 
-20 
-30 
-40 
T T T T T 
20 25 30 35 40 45 
Time (min) 
py te or 
40 -30 -20 -10 0 10 2 30 40 
Residual (cal) 
Predictor Coef SE Coef T P 
Constant 560.65 29.37 19.09 0.000 
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Time -3.0771 0.8498 -3.62 0.002 
S = 23.3980 R-Sq=42.1%  R-Sq(adj) = 38.9% 


17. Is wine good for your heart? A researcher from the University of 
California, San Diego, collected data on average per capita wine 
consumption and heart disease death rate in a random sample of 19 
countries for which data were available. The following table displays 
the data.12 


Alcohol from Alcohol from 

wine Heart disease death wine Heart disease death 
(liters/year) rate (per 100,000) (liters/year) rate (per 100,000) 
2.5 211 7.9 107 
3.9 167 1.8 167 
29 131 1.9 266 
2.4 191 0.8 227 
2.9 220 6.5 86 
0.8 297 1.6 207 
9.1 71 5.8 115 
2.7 172 1.3 285 
0.8 211 1.2 199 
0.7 300 


Is there convincing evidence of a negative linear relationship between 
wine consumption and heart disease deaths in the population of 
countries? 

18. The professor swims Here are data on the time (in minutes) Professor 
Moore takes to swim 2000 yards and his pulse rate (beats per minute) 
after swimming on a random sample of 23 days: 


Time 34.12 35.72 34.72 34.05 34.13 35.72 
Pulse 152 124 140 152 146 128 
Time 36.17 35.57 39.37 35.57 35.43 36.05 
Pulse 136 144 148 144 136 124 
Time 34.85 34.70 34.75 oe Fes fe 34.60 34.00 
Pulse 148 144 140 156 136 148 
Time 34.35 35.62 35.68 35.28 cloned 

Pulse 148 132 124 132 139 
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Is there convincing evidence of a negative linear relationship between 
Professor Moore’s swim time and his pulse rate in the population of 
days on which he swims 2000 yards? 

19. Turn up the volume? Nicole and Elena wanted to know if listening to 
music at a louder volume negatively impacts test performance. To 
investigate, they recruited 30 volunteers and randomly assigned 10 
volunteers to listen to music at 30 decibels, 10 volunteers to listen to 
music at 60 decibels, and 10 volunteers to listen to music at 90 
decibels. While listening to the music, each student took a 10-question 
math test. Here is computer output from a least-squares regression 
analysis using x=volumex = volume and y=numberY = number 


correct:44 

Predictor Coef SE Coef T P 
Constant 9.9000 0.7525 13.156 0.0000 
Volume -—0.0483 0.0116 —4.163 0.0003 


S=1.55781  R-Sq=38.2%  R-Sq(adj) = 36.0% 


Is there convincing evidence that listening to music at a louder volume 
hurts test performance? Assume the conditions for inference are met. 

20. Pencils and GPA Is there a relationship between a student’s GPA and 
the number of pencils in his or her backpack? Jordynn and Angie 
decided to find out by selecting a random sample of students from 
their high school. Here is computer output from a least-squares 
regression analysis using x=numberz = number of pencils and 
y=GPA:¥ = GPA :12 


Predictor Coef SE Coef T P 
Constant 3.2413 0.1809 17.920 0.0000 
Pencils —0.0423 0.0631 —0.670 0.5062 


S = 0.738533 R-Sq=0.9% R-Sq(adj) = 0.0% 


Is there convincing evidence of a linear relationship between GPA and 
number of pencils for students at this high school? Assume the 
conditions for inference are met. 

21. Stats teachers’ cars A random sample of 21 AP® Statistics teachers 
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was asked to report the age (in years) and mileage of their primary 
vehicles. Here is a scatterplot of the data: 


150,000 
120,000 - 


90,000 


Mileage 


60,000 


30,000 


oe T T 
0 2.5 5.0 75 10.0 
Car age (yr) 
Here is some computer output from a least-squares regression analysis 
of these data. Assume that the conditions for regression inference are 
met. 


Variable Coef SE Coef t-ratio prob 
Constant 7288.54 6591 1.11 0.2826 
Car age 11630.6 1249 6.31 <0.0001 


S=19280 R-Sq=82.0% RSq(adj) = 81.1% 


a. Verify that the 95% confidence interval for the slope of the 
population regression line is (9016.4, 14,244.8). 


b. A national automotive group claims that the typical driver puts 
15,000 miles per year on his or her main vehicle. We want to test 
whether AP® Statistics teachers are typical drivers. Explain why an 


appropriate pair of hypotheses for this test is HO:81=15,000 
Ho : 8; = 15,000 versus Ha:81#15,000.4e : F1 # 15,000. 


c. Compute the standardized test statistic and P P-value for the test in 
part (b). What conclusion would you draw at the a=0.05a@ = 0.05 
significance level? 


d. Does the confidence interval in part (a) lead to the same conclusion 
as the test in part (c)? Explain your answer. 


1737 


22. Paired tires Exercise 75 in Chapter 10 (page 686) compared two 
methods for estimating tire wear. The first method used the amount of 
weight lost by a tire. The second method used the amount of wear in 
the grooves of the tire. A random sample of 16 tires was obtained. 
Both methods were used to estimate the total distance traveled by each 
tire. The following scatterplot displays the two estimates (in thousands 


of miles) for each tire.2 


Groove method (1000 mi) 
8 
! 


10 - 


T T T T T 
10 20 30 40 50 


Weight method (1000 mi) 


Here is some computer output from a least-squares regression analysis 
of these data. Assume that the conditions for regression inference are 


met. 

Predictor Coef SE Coef T P 
Constant 1.351 2.105 0.64 0.531 
Weight 0.79021 0.07104 11.12 0.000 


S = 2.62078 R-Sq=89.8% R-Sq(adj) = 89.1% 


a. Verify that the 99% confidence interval for the slope of the 
population regression line is (0.5787, 1.0017). 


b. Researchers want to test whether there is a difference in the two 
methods of estimating tire wear. Explain why the researchers might 


want to test the hypotheses H0:81=140 : 61 = 1 versus Ha:8141. 
H,: pi #l 


c. Compute the standardized test statistic and PP-value for the test in 
part (b). What conclusion would you draw at the a=0.01a = 0.01 
significance level? 


d. Does the confidence interval in part (a) lead to the same conclusion 
as the test in part (c)? Explain your answer. 
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25s 


Multiple Choice Select the best answer for Exercises 23-28. 
Exercises 23—28 refer to the following setting. To see if students with 
longer feet tend to be taller, a random sample of 25 students was 
selected from a large high school. For each student, x=foot length 

« = foot length (cm) and y=height¥ = height (cm) were recorded. We 
checked that the conditions for inference about the slope of the 
population regression line are met. Here is a portion of the computer 
output from a least-squares regression analysis using these data: 


Predictor Coef SE Coef T P 
Constant 91.9766 10.2204 8.999 0.000 
Foot length 3.0867 0.4117 7.498 0.000 


S=6.47044 R-Sq=72.8% R-Sq(adj)= 71.5% 


Which of the following is the equation of the least-squares regression 


line for predicting height from foot length? 


a. height)=10.2204+0.4117 (foot length) 


height = 10.2204 + 0.4117 (foot length) 


b. height(=0.4117+3.0867 (foot length) 


height = 0.4117 + 3.0867 (foot length) 


c. height\=91.9766+3.0867 (foot length) 


height = 91.9766 + 3.0867 (foot length) 


d. height\=91.9766+6.47044 (foot length) 


height = 91.9766 + 6.47044 (foot length) 


e. height\=3.0867+6.47044 (foot length) 


height = 3.0867 + 6.47044 (foot length) 


24. The slope B1°1 of the population regression line describes 


a. the exact increase in height (cm) for students at this high school 


when foot length increases by 1 cm. 


b. the average increase in foot length (cm) for students at this high 


school when height increases by 1 cm. 


c. the average increase in height (cm) for students at this high school 
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d. 


e. 


when foot length increases by 1 cm. 


the average increase in foot length (cm) for students in the sample 
when height increases by 1 cm. 


the average increase in height (cm) for students in the sample when 
foot length increases by 1 cm. 


25. Is there convincing evidence that height increases as foot length 
increases? To answer this question, test the hypotheses 


26. 


27s 


a. 
b. 
C. 
d. 


e. 


H0:81=040 : 91 = 9 versus Ha:B1&gt;0.4e : fi > 9. 
HO0:81=0/0 : 51 = 9 versus Ha: B1<0.Ha : fi &lt; 0. 
HO0:81=040 : 61 = 9 versus Ha:B140./0 : Fi # 9: 

HO:61&gt;0/40 : 91 > 0 versus Ha:B1=0.4a : 1 = 0. 
HO:B1=14o : 1 = 1 versus Ha:B1&gt;1.4a : 1 > 1. 


Which of the following is the best interpretation of the value 0.4117 in 
the computer output? 


a. 


For each increase of 1 cm in foot length, the average height 
increases by about 0.4117 cm. 


. When using this model to predict height, the predictions will 


typically be off by about 0.4117 cm. 


The linear relationship between foot length and height accounts for 
41.17% of the variation in height. 


The linear relationship between foot length and height is moderate 
and positive. 


In repeated samples of size 25, the slope of the sample regression 
line for predicting height from foot length will typically vary from 
the population slope by about 0.4117. 


Which of the following is a 95% confidence interval for the population 
slope B1?°1? 


a. 
b. 
C. 


d. 


3.0867+0.41173.0867 = 0.4117 
3.0867+0.85183.0867 = 0.8518 
3.0867+0.80693.0867 = 0.8069 
3.086740.84973.0867 = 0.8497 
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e. 3.086740.84813.0867 + 0.8481 
28. Which of the following would have resulted in a violation of the 
conditions for inference? 


a. If the entire sample was selected from one classroom 
b. If the sample size was 15 instead of 25 


c. If the scatterplot of x=foott = foot length and y=height¥ = height 
did not show a perfect linear relationship 


d. If the histogram of heights had an outlier 


e. If the standard deviation of foot length was different from the 
standard deviation of height 


Recycle and Review Exercises 29-31 refer to the following setting. 
Does the color in which words are printed affect your ability to read 
them? Do the words themselves affect your ability to name the color in 
which they are printed? Mr. Starnes designed a study to investigate 
these questions using the 16 students in his AP® Statistics class as 
subjects. Each student performed the following two tasks in random 
order while a partner timed his or her performance: (1) Read 32 words 
aloud as quickly as possible, and (2) say the color in which each of 32 
words is printed as quickly as possible. Try both tasks for yourself 
using the word list given. 


BROWN RED BLUE GREEN 
RED GREEN BROWN BROWN 
GREEN RED BLUE BLUE 
BROWN BLUE GREEN RED 
BLUE BROWN RED RED 
RED BLUE BROWN GREEN 
BLUE GREEN GREEN BLUE 
GREEN BROWN RED BROWN 


29. Color words (4.2) Let’s review the design of the study. 
a. Explain why this was an experiment and not an observational study. 


b. Did Mr. Starnes use a completely randomized design or randomized 
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Cc. 


block design? Why do you think he chose this experimental design? 


Explain the purpose of the random assignment in the context of the 
study. 


Here are the data from Mr. Starnes’s experiment. For each subject, the 
time to perform the two tasks is given to the nearest second. 


Subject Words Colors Subject Words Colors 


1 13 20 a 10 16 
2 10 21 10 , 13 
3 15 22 11 11 11 
4 12 25 12 17 26 
fs) 13 17 13 15 20 
6 11 13 14 15 15 
A 14 Be 15 2 18 
8 16 21 16 10 18 


30. Color words (10.3) Now let’s analyze the data. 


a. 


Calculate the difference (Colors-Words)(Colors ~ Words) for each 


subject and summarize the distribution of differences with a 
boxplot. Does the graph provide evidence of a difference in the 
average time required to perform the two tasks? Explain your 
answer. 


Explain why it is not safe to use paired t¢ procedures to do 
inference about the mean difference in time to complete the two 
tasks. 


31. Color words (3.1, 3.2, 12.1) Can we use a student’s word task time to 
predict his or her color task time? 


a. 


Make an appropriate scatterplot to help answer this question. 
Describe what you see. 


. Use technology to find the equation of the least-squares regression 


line. Define any variables you use. 


Find and interpret the residual for the student who completed the 
word task in 9 seconds. 


Assume that the conditions for performing inference about the slope 
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of the true regression line are met. The PP-value for a test of HO: 
B1=040:51 = 9 versus Ha: B1&gt;04a:F1 > 9 is 0.0215. Interpret 
this value. 
Note: John Ridley Stroop is often credited with the discovery in 1935 
of the fact that the color in which “color words” are printed interferes 
with people’s ability to identify the color. The paper outlining the so- 
called Stroop effect, though, was originally published by German 
researchers in 1929. 
32. Yahtzee (5.3, 6.3) In the game of Yahtzee, 5 six-sided dice are rolled 
simultaneously. To get a Yahtzee, the player must get the same 
number on all 5 dice. 


a. Luis says that the probability of getting a Yahtzee in one roll of the 
] 3) 
dice is ieee ;) ’ Explain why Luis is wrong. 

b. Nassir decides to keep rolling all 5 dice until he gets a Yahtzee. He 
is surprised when he still hasn’t gotten a Yahtzee after 25 rolls. 
Should he be? Calculate an appropriate probability to support your 
answer. 
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SECTION 12.2 Transforming to Achieve Linearity 


LEARNING TARGETS By the end of the section, you should be able to: 


e Use transformations involving powers and roots to find a power 
model that describes the relationship between two quantitative 
variables, and use the model to make predictions. 


Use transformations involving logarithms to find a power model 
that describes the relationship between two quantitative variables, 
and use the model to make predictions. 


Use transformations involving logarithms to find an exponential 
model that describes the relationship between two quantitative 
variables, and use the model to make predictions. 


Determine which of several transformations does a better job of 
producing a linear relationship. 


In Chapter 3, we learned how to analyze relationships between two 
quantitative variables that showed a linear pattern. When two-variable data 
show a curved relationship, we must develop new techniques for finding 
an appropriate model. This section describes several simple 
transformations of data that can straighten a nonlinear pattern. Once the 
data have been transformed to achieve linearity, we can use least-squares 
regression to generate a useful model for making predictions. And if the 
conditions for regression inference are met, we can estimate or test a claim 
about the slope of the population (true) regression line using the 
transformed data. 

The Gapminder website (www.gapminder.org) provides loads of data 
on the health and well-being of the world’s inhabitants. Figure 12.6 shows 
a scatterplot of data from Gapminder./4 The individuals are all the world’s 
nations for which data were available in 2015. The explanatory variable, 
income per person, is a measure of how rich a country is. The response 
variable is life expectancy at birth. 


1744 


. es) 2, & = be 3 ss 

m 80-4 @ pn atte e @) e ‘. 
i : 3. § © - rs 
Ras] 75 = - eH o> < 2 ° + 
$ 65 - --—{ Gabon] United States 
a 60 4 a © ® - 2 
. S+ & : Equatorial Guinea BH Americas 

° _ 
5 am Europe & Russia 


® Asia & Pacific 


40 - South Africa | [| Africa 
l I T T 
20,000 40,000 60,000 80,000 100,000 120,000 140,000 


Income per person ($) 
FIGURE 12.6 Scatterplot of the life expectancy of people in many nations against 
each nation’s income per person. The color of each circle indicates the geographic 
region in which that country is located. The size of each circle is based on the 
population of the country—bigger circles indicate larger populations. 


We expect people in richer countries to live longer because they have 
better access to medical care and typically lead healthier lives. The overall 
pattern of the scatterplot does show this, but the relationship is not linear. 
Life expectancy rises very quickly as income per person increases and then 
levels off. People in very rich countries such as the United States live no 
longer than people in poorer but not extremely poor nations. In some less 
wealthy countries, people live longer than in the United States. 


Four African nations are outliers. Their life expectancies are similar to 
those of their neighbors, but their income per person is higher. Gabon and 
Equatorial Guinea produce oil, and South Africa and Botswana produce 
diamonds. It may be that income from mineral exports goes mainly to a 
few people and so pulls up income per person without much effect on 
either the income or the life expectancy of ordinary citizens. That is, 
income per person is a mean, and we know that mean income can be much 
higher than median income. 

The scatterplot in Figure 12.6 shows a curved pattern. We can 
straighten things out using logarithms. Figure 12.7 plots the logarithm of 
income per person against life expectancy for these same countries. The 
effect is remarkable. This graph has a clear, linear pattern. 
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FIGURE 12.7 Scatterplot of life expectancy against log(income per person) for 
many nations. 


Applying a function such as the logarithm or square root to a 
quantitative variable is called transforming the data. Transforming data 
amounts to changing the scale of measurement that was used when the 
data were collected. In Chapter 2 , we discussed linear transformations, 
such as converting temperature in degrees Fahrenheit to degrees Celsius or 
converting distance in miles to kilometers. However, linear 
transformations cannot straighten a curved relationship between two 
variables. To do that, we resort to functions that are not linear. The 
logarithm function, applied in the income and life expectancy example, is 
a nonlinear function. We’ll return to transformations involving logarithms 
later. 


Transforming with Powers and Roots 
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Vasko/Getty Images 


When you visit a pizza parlor, you order a pizza by its diameter—say, 10 
inches, 12 inches, or 14 inches. But the amount you get to eat depends on 
the area of the pizza. The area of a circle is m7 times the square of its 
radius r.”- So the area of a round pizza with diameter x@ is 


area =m? =0(=) =n x? ae 
= oe 4] 4 


This is a power model of the form y=axpY = @@" with a=n/4¢ = 7/4 and 


area=mr2=m(x2)2=n(x24)=n4x2 


When we are dealing with things of the same general form, whether 
circles or fish or people, we expect area to go up with the square of a 
dimension such as diameter or height. Volume should go up with the cube 
of a linear dimension. That is, geometry tells us to expect power models in 
some settings. There are other physical relationships between two 
variables that are described by power models. Here are some examples 
from science. 


e The distance that an object dropped from a given height falls is related 
to time since release by the model 


; : > 
distance=a(time)2“istance — i (time) 


e The time it takes a pendulum to complete one back-and-forth swing (its 
period) is related to its length by the model 
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: 1/2 
period=alength=a(length)1/2Petiod = a,/length = a(length) 


e The intensity of a light bulb is related to distance from the bulb by the 
model 


intensity = = a(distance) ” 


intensity=adistance2=a(distance)-2 distance? 


Although a power model of the form y=axpY = @2” describes the 
nonlinear relationship between x# and y¥ in each of these settings, there is 
a linear relationship between xp2” and y.¥: If we transform the values of 
the explanatory variable x# by raising them to the pP? power, and graph the 
points (xp, y), (2 ¥)> the scatterplot should have a linear form. The 
following example shows what we mean. 


EXAMPLE |Go fish! @ 


Transforming with powers 


Les Breault/Alamy 


PROBLEM: Imagine that you have been put in charge of organizing a 
fishing tournament in which prizes will be given for the heaviest 
Atlantic Ocean rockfish caught. You know that many of the fish caught 
during the tournament will be measured and released. You are also 
aware that using delicate scales to try to weigh a fish that is flopping 


1748 


around in a moving boat will probably not yield very accurate results. It 
would be much easier to measure the length of the fish while on the 
boat. What you need is a way to convert the length of the fish to its 
weight. 


You contact the nearby marine research laboratory, and it provides 
reference data on the length (in centi- meters) and weight (in grams) for 
Atlantic Ocean rockfish of several sizes.42 Here is a scatterplot of the 
data. Note the clear curved form. 


Length 52) ep 115) 143 316.6) 2192 28s 23.3. 20.0) 26.7 


Weight 2 8 21 38 69 117. 148 »=©190 264 = 293 


Length 28.2. 29.6 308 32.0 33.0 340 349 364 37.1 37.7 
Weight 318 371 455 «65504 0518) 6537) =6651—S 719) 726_~—Ss 8:10 


800 + . 
700 4 ” 
° 
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u | - 
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. 
100 — 
0- . ° : 
T T T T T 
8 16 24 32 40 
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Because length is one-dimensional and weight (like volume) is three- 
dimensional, a power model of the form weight=a (length)3 


8 = 3 
weight = a(length)” should describe the relationship. Here is a 


scatterplot of weight versus length?: 


Weight (g) 
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Because the transformation made the association roughly linear, we 
used computer software to perform a linear regression analysis of 


=f: 3 
y=weight¥ ~ weight versus x=length3.% = length”. 


Regression Analysis: Weight versus Length\3 

Predictor Coef SE Coef T 2 
Constant 4.066 6.902 0.59 0.563 
Length3 0.0146774 0.0002404 61.07 0.000 
S=18.8412 R-Sq=99.5% R- 


a. Give the equation of the least-squares regression line. Define any 
variables you use. 


b. Suppose a contestant in the fishing tournament catches an Atlantic 
Ocean rockfish that’s 36 centimeters long. Use the model from part 
(a) to predict the fish’s weight. 


SOLUTION: 


a. weight\=4.066+0.0146774 (length)3 
weight — 4.066 + 0.0146774 (length)* 


b. weight\=4.066+0.0146774(36)3weight\=688.9 grams 
weight — 4.066 + 0.0146774(36)° 


weight = 688.9 grams 


If you write the equation as y\=4.066+0.0146774x3, 


y = 4.066 + 0.01467742°, make sure to define y=weight¥ = weight and 
x=length.” = length. 


FOR PRACTICE, TRY EXERCISE 33 


There’s another way to transform the data in the “Go fish!” example to 
achieve linearity. We can take the cube root of the weight values and graph 
S/orataht . : 
weight3V W¢isht versus length. Figure 12.8 shows that the resulting 
scatterplot has a linear form. 
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FIGURE 12.8 The scatterplot of weight3 


weight 


versus length is 
linear. 


Why does this transformation work? Start with weight=a(length)3 


3 
weight = a(length)” and take the cube root of both sides of the equation: 


\/weight = ,)/a(length)° 
= - ,/weight = \/a (length) 
weight3=a(length)33weight3=a3(length) 


That is, there is a linear relationship between length and weight3. 


\/ weight. 


EXAMPLE |Go fish! @ 


Transforming with roots 
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Doug Wilson/Alamy 


PROBLEM: Figure 12.8 shows that the relationship between length 


and weight3 V weight j. roughly linear for Atlantic Ocean rockfish. Here 
is computer output from a linear regression analysis of y=weight3 


y = ./ weight versus x=length® = length 


3 > 
Regression Analysis: weight3 V weight versus Length 


Predictor Coef SE Coef T P 
Constant —0.02204 0.07762 -0.28 0.780 
Length 0.246616 0.002868 86.00 0.000 


S = 0.124161 R-Sq = 99.8% R-Sq(adj) = 99.7% 


. Give the equation of the least-squares regression line. Define any 
variables you use. 


. Suppose a contestant in the fishing tournament catches an Atlantic 
Ocean rockfish that’s 36 centimeters long. Use the model from part 
(a) to predict the fish’s weight. 


SOLUTION: 


a. weight3\=—0.02204+0.246616 (length) 
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Ls 


\/weight = —0.02204 + 0.246616 (length) 


If you write the equation as y3=—0.02204+0.246616x, 
'/y = —0.02204 + 0.246616z, 


make sure to define y=weight 
y = weight and x=length.% = length. 


b. weight34=—0.02204+0.246616(36)=8.856weight“=8.8563 =694.6 
\/weight = —0.02204 + 0.246616 (36) = 8.856 


Bane weight = 8.856° = 694.6 grams 


The least-squares regression line gives the predicted value of the cube 


root of weight. To get the predicted weight, reverse the cube root by 
raising the result to the third power. 


FOR PRACTICE, TRY EXERCISE 35 


When experience or theory suggests that the relationship between two 
variables is described by a power model of the form y=axpY = 2” where 
p? is known, there are two methods for transforming the data to achieve 
linearity. 

.. Raise the values of the explanatory variable xz to the pP power and plot 
the points (xp, y).(e?s y)- 

’. Take the pPth root of the values of the response variable y¥ and plot the 
points (xp, y).(2?; y)- 

What if you have no idea what value of p? to use? You could guess 
and test until you find a transformation that works. Some technology 
comes with built-in sliders that allow you to dynamically adjust the power 
and watch the scatterplot change shape as you do. 

It turns out that there is a much more efficient method for linearizing a 
curved pattern in a scatterplot. Instead of transforming with powers and 
roots, we use logarithms. This more general method works when the data 
follow an unknown power model or any of several other common 
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mathematical models. 


Transforming with Logarithms: Power Models 


To achieve linearity from a power model, we apply the logarithm 
transformation to both variables. Here are the details: 


. A power model has the form y=axp,¥ ~ ax”, where a and p?P are 
constants. 
’. Take the logarithm of both sides of this equation. Using properties of 


logarithms, we get 


log y=log (axp)=logatlog(xp)=log atp log x 
log y = log (az?) = loga + log (x”) = loga + plogaz 


The equation 
log y=log at+p log xlog y= log a-t plog H 


shows that taking the logarithm of both variables results in a linear 
relationship between log x# and log y.¥-Note: You can use base-10 
logarithms or natural (base-e) logarithms to straighten the association. 


3. Look carefully: the powerpP in the power model becomes the slope of 
the straight line that links log y!°8 ¥ to log x.l08 @- 


If a power model describes the relationship between two variables, a 
scatterplot of the logarithms of both variables should produce a linear 
pattern. Then we can fit a least-squares regression line to the transformed 
data and use the linear model to make predictions. Here’s an example. 


EXAMPLE |Go fish! @ 


Transforming with logarithms: Power models 
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RGB Ventures/SuperStock 

/Alamy 
PROBLEM: In the preceding examples, we used powers and roots to 
find a model for predicting the weight of an Atlantic Ocean rockfish 
from its length. We still expect a power model of the form 


. 3 
weight=a(length)3¥ «ight = a(length)” based on geometry. Here once 
again is a scatterplot of the data from the local marine research lab: 


Weight (g) 


Length (cm) 


We took the logarithm (base 10) of the values for both variables. Here is 
some computer output from a linear regression analysis of the 
transformed data. 
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log (weight) 
| 
Residual 


T T T T 
0.6 0.8 1.0 1.2 1.4 1.6 0.6 0.8 1.0 1.2 1.4 1.6 
log (length) log (length) 


Regression Analysis: log(Weight) versus log(Length) 
Predictor Coef SE Coef T e 
Constant -1.89940 0.03799 -49.99 0.000 


log(Length) 3.04942 0.02764 110.31 0.000 
S = 0.0281823 R-Sq = 99.9% R-Sq(adj) = 99.8% 


a. Based on the output, explain why it would be reasonable to use a 
power model to describe the relationship between weight and length 
for Atlantic Ocean rockfish. 


b. Give the equation of the least-squares regression line. Be sure to 
define any variables you use. 


c. Suppose a contestant in the fishing tournament catches an Atlantic 
Ocean rockfish that’s 36 centimeters long. Use the model from part 
(b) to predict the fish’s weight. 


SOLUTION: 


a. The scatterplot of log(weight) versus log(length) has a linear form, 
and the residual plot shows a fairly random scatter of points about 
the residual = 0 line.residual = 0 line. So a power model seems 
reasonable here. 


If a power model describes the relationship between two variables x 


and y, then a linear model should describe the relationship between 
log x and log y. 


b. log(weight)\=—1.89940+3.04942 log(length) 
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V— 


log (weight) = -—1.89940 + 3.04942 log (length) 


If you write the equation as log(y)\=—1.89940+3.04942 log(x), 


i) 


log (y) = —1.89940 + 3.04942 log(), make sure to define y=weight 
y = weight and x=length.% = length. 


c. log(weight)\=—1.89940+3.04942 
log(36)=2.8464log(weight)\=2.8464weight\=102.8464~702.1 grams 


log (weight) = —1.89940 + 3.04942 log (36) — 2.8464 
log (weight) = 2.8464 
weight = 107544 ~ 702.1 grams 


The least-squares regression line gives the predicted value of the 


base-10 logarithm of weight. To get the predicted weight, undo the 
logarithm by raising 10 to the 2.8464 power. 


FOR PRACTICE, TRY EXERCISE 37 


, 
Pa 
6 oe 
-~ 5 e 
F 
B 4 
E 
£ 
7) . 
1 . 
T T T T T 
2.0 2.5 3.0 3.5 4.0 
In (length) 
On the TI-83/84, you can “undo” the logarithm using the function 


keys. To solve log(weight)\=2.8464, 0s (weight) = 2.8464, press 
2.8464 

In addition to base-10 logarithms, you can also use natural (base-e) 
logarithms to transform the variables. Using the same Atlantic Ocean 
rockfish data, here is a scatterplot of In(weight) versus In(length). 
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The least-squares regression line for these data is 


In(weight)/=—4.3735+3.04942 In(length) 
In (weight) = —4.3735 + 3.04942 In (length) 


To predict the weight of an Atlantic Ocean rockfish that is 36 centimeters, 
we Start by substituting 36 for length. 


In(weight)\=—4.3735+3.04942 In(36)=6.55415 
In (weight) = —4.3735 + 3.04942 In (36) = 6.55415 


To get the predicted weight, we then undo the natural logarithm by raising 
e to the 6.55415 power. 


weight\=e6.55415=702.2 gramsWeight = e°°°*!° = 702.2 grams 


On the TI-83/84, you can “undo” the natural logarithm using the 
function keys. To solve In(weight)=6.55415,2(weight) = 6.55415, press 


6.55415 [ENTER] 


Your calculator and most statistical software will calculate the 
logarithms of all the values of a variable with a single command. The 
important thing to remember is that if the relationship between two 
variables is described by a power model, then we can linearize the 
relationship by taking the logarithm of both the explanatory and response 
variables. 


Think About It 


HOW DO WE FIND THE POWER MODEL FOR PREDICTING 
YY from X?X° The least-squares line for the transformed rockfish 
data is 


log(weight)\=—1.89940+3.04942 log(length) 
log (weight) = —1.89940 + 3.04942 log (length) 


If we use the definition of the logarithm as an exponent, we can 
rewrite this equation as 


weight\=10-1.89940 + 3.04942 log(length) 
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RIC ie Qo | 4949 v{ lene, 
weight = 10 1.89940+-3.04942 log(length) 


Using properties of exponents, we can simplify this as follows: 


weight4=10-1.89940- 103.04942log(length)using the fact 
thatbmbn=bm-+nweight4=10-1.89940- 10log(length)3.04942using the fact 
that plog x=logxpweight/=0.0126(length)3.04942using the fact that 
10logx=x 
weight = 10 1.89940 | 1093-04942 log(length) using the fact thatb™b" — p™t" 


weight = 107189940 . jglos(length)3.041942 sing the fact that plog x = log x? 


weight = 0.0126(length)* 4?” 


using the fact that 10logx = z 
This equation is now in the familiar form of a power model y=axp 
y = az" with a=0.0126a = 0.0126 and b=3.04942.b = 3.04942. Notice 
how close the power is to 3, as expected from geometry. 
We could use the power model to predict the weight of a 36- 
centimeter-long Atlantic Ocean rockfish: 
weight = 0.0126(36)3.04942~701.76 grams 
weight = 0.0126(36)°°" ~ 701.76 grams 


This is roughly the same prediction we got earlier. Here is the 
scatterplot of the original rockfish data with the power model added. Note 
how well this model fits the association! 


= § 
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Transforming with Logarithms: Exponential Models 


A linear model has the form y=b0+b1x.¥ = %0 + 61”. The value of y¥ 
increases (or decreases) at a constant rate as x® increases. The slope b1)1 
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describes the constant rate of change of a linear model. That is, for each 1- 
unit increase in x,” the model predicts an increase of b141 units in y.¥- 
You can think of a linear model as describing the repeated addition of a 
constant amount. Sometimes the relationship between y¥ and x is based 
on repeated multiplication by a constant factor. That is, each time x# 
increases by 1 unit, the value of y¥ is multiplied by b.b. An exponential 
model of the form y=abx¥ ~ ab” describes such growth by multiplication. 


Populations of living things tend to grow exponentially if not 
restrained by outside limits such as lack of food or space. More pleasantly 
(unless we’re talking about credit card debt!), money also displays 
exponential growth when interest is compounded each time period. 
Compounding means that the last period’s income earns income in the 
next period. Figure 12.9 shows the balance of a savings account where 
$100 is invested at 6% interest, compounded annually (assuming no 
additional deposits or withdrawals). After x@ years, the account balance y 
Y is given by the exponential model y=100(1.06)x.4¥ ~ 100(1.06)". 


35,000 4 


10,000 4 
5,000 4 
04 


T T T T T 
0 20 40 60 80 100 


Time (years) 
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FIGURE 12.9 Scatterplot of the exponential growth of a $100 investment in a 
savings account paying 6% interest, compounded annually. 


An exponential model of the form y=abx¥ = @" describes the 
relationship between x and y,¥> where a@ and bb are constants. We can 
use logarithms to produce a linear relationship. Start by taking the 
logarithm of each side (we’ll use base 10, but the natural logarithm In 
using base e€ would work just as well). Then use algebraic properties of 
logarithms to simplify the resulting expressions. Here are the details: 


log y=log (abx)taking the logarithm of both sideslog y=log a+log (bx)using 
the property log(mn)=log m+log nlog y=log a+x log busing the property 
log mp=p log m 
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log y = log (ab*) taking the logarithm of both sides 
log y = loga + log (b”) using the property log (mn) = logm + logn 
logy=loga+alogb using the property log m? = plogm 
We can then rearrange the final equation as 
log y=log a+(log b)xlog y = loga + (logb) x 


Notice that log al°8 @ and log blog © are constants because a and b are 
constants. So the equation gives a linear model relating the explanatory 
variable x to the transformed variable log y.¥- Thus, if the relationship 
between two variables follows an exponential model, a scatterplot of the 
logarithm of y¥ against x# should show a roughly linear association. 


EXAMPLE | Moore’s law and computer chips >» 


Transforming with logarithms: Exponential models 


Nandana de silva/Alamy 


PROBLEM: Gordon Moore, one of the founders of Intel Corporation, 
predicted in 1965 that the number of transistors on an integrated circuit 
chip would double every 18 months. This is Moore’s law, one way to 
measure the revolution in computing. Here are data on the dates and 


number of transistors for Intel microprocessors:1& 


Processor Year Transistors | Processor Year Transistors 

Intel 4004 1971 2,300) Pentium III 2001 45,000,000 
Tualatin 

Intel 8008 1972 3,500 | Itanium 2 2002 220,000,000 
McKinley 

Intel 8080 1974 4,500|Itanium 2 2003 410,000,000 
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Intel 8086 


Intel 80286 


Intel 80386 


Intel 80486 


Pentium 


Pentium Pro 


Pentium II 
Klamath 


Pentium II 
Katmai 


Pentium 4 
Willamette 


1978 


1982 


1985 


1989 


1993 


1995 
oo 


1999 


Madison 6M 


29,000|Itanium 2 with9 2004 
MB cache 


134,000] Dual-core 2006 
Itanium 2 


275,000 |Six-core Xeon 2008 
7400 


1,180,235|8-core Xeon 2010 
Nehalem-EX 


3,100,000|10-core Xeon 2011 
Westmere-EX 


5,500,000|61-core Xeon Phi 2012 


7,500,000/18-core Xeon 2014 
Haswell-E5 


9,500,000 |22-core Xeon 2016 
Broadwell-E5 


2000 42,000,000 


592,000,000 


1,700,000,000 


1,900,000,000 


2,300,000,000 


2,600,000,000 


5,000,000,000 
5,060,000,000 


7,200,000,000 


Here is a scatterplot that shows the growth in the number of transistors 
on a computer chip from 1971 to 2016. Notice that we used “years since 
1970” as the explanatory variable. We’ll explain this on page 806. If 
Moore’s law is correct, then an exponential model should describe the 
relationship between the variables. 


a. Here is a scatterplot of the natural (base-ee) logarithm of the number 
of transistors on a computer chip versus years since 1970. Based on 
this graph, explain why it would be reasonable to use an exponential 
model to describe the relationship between number of transistors and 
years since 1970. 
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b. Here is some computer output from a linear regression analysis of the 
transformed data. Give the equation of the least-squares regression 
line. Be sure to define any variables you use. 


c. Use your model from part (b) to predict the number of transistors on 
an Intel computer chip in 2020. 


Predictor Coef SE Coef Tp P 
Constant 7.2272 0.3058 23.64 0.000 
Years since 1970 0.3542 0.0102 34.59 0.000 


S = 0.6653 R-Sq = 98.2% R-Sq(adj) = 98.2% 
SOLUTION: 


a. The scatterplot of In(transistors) versus years since 1970 has a fairly 
linear pattern. So an exponential model seems reasonable here. 


If an exponential model describes the relationship between two 


variables x# and y,¥> we expect a scatterplot of (x,%> In y¥) to be 
roughly linear. 
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b. In(transistors)\=7.2272+0.3542(years since 1970) 
In (transistors) = 7.2272 + 0.3542 (years since 1970) 


2020 is 50 years since 1970. 


c. In(transistors)A=7.2272+0.3542(50) = 24.9372transistors\=e24.9372 
= 67,622,053,360 
In(transistors) = 7.2272 + 0.3542 (50) = 24.9372 


—_— 


transistors = e74-%972 — §7, 622,053, 360 


This model predicts that an Intel chip made in 2020 will have about 
68 billion transistors. 


The least-squares regression line gives the predicted value of In 


(transistors). To get the predicted number of transistors, undo the 
logarithm by raising e€ to the 24.9372 power. 


FOR PRACTICE, TRY EXERCISE 43 


Here is a residual plot for the linear regression in part (b) of the 
example: 


Residual 
S 


T T T T T T 
0 10 20 30 40 50 


Years since 1970 
The residual plot shows a leftover pattern, with the residuals going 
from positive to negative to positive to negative as we move from left to 
right. However, the residuals are small in size relative to the transformed y 
Y-values, and the scatterplot of the transformed data is much more linear 
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than the original scatterplot. We feel reasonably comfortable using this 
model to make predictions about the number of transistors on a computer 
chip. 

Let’s recap this big idea: When an association follows an exponential 
model, the transformation to achieve linearity is carried out by taking the 
logarithm of the response variable. The crucial property of the logarithm 
for our purposes is that if a variable grows exponentially, its logarithm 
grows linearly. 


Think About It 


HOW DO WE FIND THE EXPONENTIAL MODEL FOR 
PREDICTING YY FROM X?X° The least-squares line for the 
transformed data in the computer chip example is 


In(transistors)\=7.2272+0.3542(years since 1970) 
In (transistors) = 7.2272 + 0.3542 (years since 1970) 


If we use the definition of the logarithm as an exponent, we can 
rewrite this equation as 


transistors\=e7.2272+0.3542(years since 1970) 


: 7.99794 42(vears since 1970) 
transistors = e' 22"? 0.3542(years since 1970) 


Using properties of exponents, we can simplify this as follows: 


transistors\=e7.2272 : e0.3542(years since 1970)using the fact that 
bmbn=bm-+ntransistors“=e7.2272 :(e0.3542)(years since 1970)using the 
fact that (bm)n=bmntransistors\=1376.4 : (1.4250)(years since 


1970)simplifying 
—_— aes a ee . Fi 
transistors = e”-2272 . ¢9.3542(years since 1970) using the fact that bb" = b™*" 
ae 7.227: 3542 \(years since 1970) . 
transistors = e727? . (¢93542)(veass since 1979) using the fact that (b")" = b™ 


transistors = 1376.4 - (1.4250) %"* "°° 187) simplifying 


This equation is now in the familiar form of an exponential model 
y=abxY = 2" with a=1376.4a = 1376.4 and b=1.4250.b = 1.4250. Here is 
the scatterplot of the original transistor data with the exponential model 
added: 
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Transistors (billions) 


0 5 10 1S 20 25 30 £35 40 45 
Years since 1970 
We could use the exponential model to predict the number of 
transistors on an Intel chip in 2020: 
transistors\=1376.4(1.4250)50*6.7529 - 1010. 


aE Ea = 50 1 
transistors = 1376.4(1.4250)”” ~ 6.7529 - 10°". This is roughly the same 


prediction we obtained earlier. 


The calculation at the end of the Think About It feature might give you 
some idea of why we used years since 1970 as the explanatory variable in 
the example. To make a prediction, we substituted the value x=50z = 50 
into the equation for the exponential model. This value is the exponent in 
our calculation. If we had used year as the explanatory variable, our 
exponent would have been 2020. Such a large exponent can lead to 
overflow errors on a calculator. 


Putting It All Together: Which Transformation Should 
We Choose? 


Suppose that a scatterplot shows a curved relationship between two 
quantitative variables xz and y.¥: How can we decide whether a power 
model or an exponential model better describes the relationship? The 
following example shows the strategy we should use. 


EXAMPLE | What’s a planet, anyway? Py 


Choosing a model 
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NASA/JPL-Caltech 


PROBLEM: On July 31, 2005, a team of astronomers announced that 
they had discovered what appeared to be a new planet in our solar 
system. They had first observed this object almost two years earlier 
using a telescope at Caltech’s Palomar Observatory in California. 
Originally named UB313, the potential planet is bigger than Pluto and 
has an average distance of about 6.3 billion miles from the sun. (For 
reference, Earth is about 93 million miles from the sun.) Could this new 
astronomical body, now called Eris, be a new planet? 


At the time of the discovery, there were nine known planets in our solar 
system. Here are data on the distance from the sun and period of 
revolution of those planets, along with a scatterplot. Note that distance 
is measured in astronomical units (AU), the number of Earth distances 
the object is from the sun./Z There appears to be a strong curved 
relationship between distance from the sun and period of revolution. 


In August 2006, the International Astronomical Union agreed on a new definition 


of planet. Both Pluto and Eris were classified as “dwarf planets.” 


Distance Period of 
fromsun revolution (Earth 
Planet (AU) years) 
Mercury 0.387 0.241 
Venus 0.723 0.615 
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Earth 1.000 1.000 


Mars 1.524 1.881 
Jupiter 5.203 11.862 
Saturn 9.539 29.456 
Uranus 19.191 84.070 
Neptune 30.061 164.810 
Pluto 39529 248.530 
250 4 
- 200 4 
Ss 150 4 
= 100 
-¥ 
50 
o7 é T I 
0 10 20 30 40 


Distance (AU) 


The following graphs show the results of two different transformations 
of the data. The graph on the left plots the natural logarithm of period 
against distance from the sun for all nine planets. The graph on the right 
plots the natural logarithm of period against the natural logarithm of 
distance from the sun for the nine planets. 


In (period) 
ma NO Ww 
1 
In (period) 
N 
1 


| T T 


0 10 20 30 40 -1 0 1 2 3 
Distance (AU) In (distance) 
a. Based on the scatterplots, would an exponential model or a power 
model provide a better description of the relationship between 
distance and period? Justify your answer. 


b. Here is computer output from a linear regression analysis of the 
transformed data in the graph on the right. Give the equation of the 
least-squares regression line. Be sure to define any variables you use. 
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Predictor Coef SE Coef T P 


Constant 0.0002544 0.0001759 1.45 0.191 


In(distance) 1.49986 0.00008 18598.27 0.000 
S = 0.000393364 R-Sq = 100.0% R-Sq(adj) = 100.0% 


c. Use your model from part (b) to predict the period of revolution for 
Eris, which is about 68.05 AU from the sun. 


d. Here is a residual plot for the linear regression in part (b). Do you 


expect your prediction in part (c) to be too large, too small, or about 
right? Justify your answer. 


0.00075 - 
0.00050 = 


0.00025 — 


Residual 


0.00000 4 


—0.00025 5 


—0,00050 


T T T T T | 
-1 0 1 2 3 4 


In (distance) 


SOLUTION: 


a. The graph of In(period) versus In(distance) is much more linear than 


the graph of In(period) versus distance, so a power model would be 
more appropriate. 


If an association follows an exponential model, a graph of In(y) 


versus xX should be roughly linear. If an association follows a power 
model, a graph of In(y) versus In(x) should be roughly linear. 


b. In(period) = 0.0002544+1.49986 In(distance) 
In (period) = 0.0002544 + 1.49986 In (distance) 


c. In(period)\=0.0002544+1.49986 
In(68.05)=6.330period4=e6.330*561 years 
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In(period) = 0.0002544 + 1.49986 In (68.05) 
6.330 


e330 ~ 


— 
period = 561 years 


To predict the period, we have to undo the natural logarithm by 


raising e to the 6.330 power. 


d. Eris’s value for In(distance) is In(68.05) = 4.22,!n (68.05) = 4.22, 
which would fall at the far right of the residual plot, where all the 
residuals are positive. Because it is likely that residual=actual y 
—predicted y>o,residual = actual y — predicted y > 0, we would 


expect our prediction to be too small. 


FOR PRACTICE, TRY EXERCISE 47 


Here is the scatterplot of the original planetary data with the power 
model added. It seems remarkable that period of revolution is closely 
related to the 1.5 power of distance from the sun. Johannes Kepler made 
this fascinating discovery about 400 years ago without the aid of modern 
technology—a result known as Kepler’s third law. 


Period (yr) 


0 5 10 15 20 25 30 35 40 
Distance (AU) 


What if the scatterplots of (log x,” log y¥) and (x,”> log y¥) both look 
linear? Fit a least-squares regression line to both sets of transformed data. 
Then compare residual plots and look for the one with the most random 
scatter. If the residual plots look roughly the same, use the values of ss and 
12r* to decide whether a power model or an exponential model is a better 
choice. 
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We have used statistical software to do all the transformations and 
linear regression analysis in this section so far. Now let’s look at how the 
process works on a graphing calculator. 


32. Technology Corner | TRANSFORMING TO 
ACHIEVE LINEARITY 


TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/tps6e. 

We'll use the planet data to illustrate a general strategy for performing 
transformations with logarithms on the TI-83/84. A similar approach 
could be used for transforming data with powers and roots. 


e Enter the values of the explanatory variable in L1 and the values of 
the response variable in L2. Make a scatterplot of y¥ versus x® and 
confirm that there is a curved pattern. 


ORMAL FLOAT AUTO REAL RADIAN CL ra] 


Ploti:LasL2 


e Define L3 to be the natural logarithm (In) of L1 and L4 to be the 
natural logarithm of L2. To see whether a power model fits the 
original data, make a plot of In y¥ (L4) versus In x (L3) and look 
for linearity. To see whether an exponential model fits the original 
data, make a plot of In y¥ (L4) versus xz (L1) and look for linearity. 


ORMAL FLOAT AUTO REAL RADIAN CL i INORMAL FLOAT AUTO REAL RADIAN CL i] 
Ploti:LosL4 Ploti:LisL4 
o® o © 
a o 
o o 
a o 
o 
4 4 + , 
a 
# 

X=".9493306 Y="1.422958 X=.387 Y="1.422958 


e Ifa linear pattern is present, calculate the equation of the least- 


1771 


squares regression line. For the planet data, we executed the 
command LinReg(a+bx)L3,L4.LinReg (a + bx) L3, L4. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


y=atbx 


a=2.5444423E -4 
b=1.499860986 


r2=, 9999999798 
r=.9999999899 


e Construct a residual plot to look for any departures from the linear 
pattern. For Xlist, enter the list you used as the explanatory variable 
in the linear regression calculation. For Ylist, use the RESID list 
stored in the calculator. For the planet data, we used L3 as the Xlist. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


Ploti:LosRESID 


X=".9493306 ¥=6.5112E-4 


CHECK YOUR UNDERSTANDING 


One sad fact about life is that we’ ll all die someday. Many adults 
plan ahead for their eventual passing by purchasing life insurance. Many 
different types of life insurance policies are available. Some provide 
coverage throughout an individual’s life (whole life), while others last 
only for a specified number of years (term life). The policyholder makes 
regular payments (premiums) to the insurance company in return for the 
coverage. When the insured person dies, a payment is made to 
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designated family members or other beneficiaries. 


How do insurance companies decide how much to charge for life 
insurance? They rely on a staff of highly trained actuaries—people with 
expertise in probability, statistics, and advanced mathematics—to 
establish premiums. For an individual who wants to buy life insurance, 
the premium will depend on the type and amount of the policy as well as 
personal characteristics like age, sex, and health status. 


The table shows monthly premiums for a 10-year term-life insurance 
policy worth $1,000,000.48 


Age (years) Monthly premium 
AO $29 
45 $46 
50 $68 
55 $106 
60 $157 
65 $257 


The output shows three possible models for predicting monthly 
premium from age. Option 1 is based on the original data, while Options 
2 and 3 involve transformations of the original data. Each set of output 
includes a scatterplot with a least-squares regression line added and a 
residual plot. 


(Seater Po) 


(Seater Po) (Seater Pt] ) 


Residual 
~ 


kw 


—— premium =-343 + 8.63 (age) —— In(premium) =—12.98 + 4.416 In(age) —— In(premium) =-0.063 + 0.0859 (age) 


1. Use each model to predict how much a 58-year-old would pay for 
such a policy. 
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2. What type of function—linear, power, or exponential—best describes 
the relationship between age and monthly premium? Explain your 
answer. 


Section 12.2. Summary 


e Curved relationships between two quantitative variables can sometimes 
be changed into linear relationships by transforming one or both of the 
variables. Once we transform the data to achieve linearity, we can fit a 
least-squares regression line to the transformed data and use this linear 
model to make predictions. 


e When theory or experience suggests that the relationship between two 
variables follows a power model of the form y=axp¥ = @2” 
transformations involving powers and roots can linearize a curved 
pattern in a scatterplot. 


. Option 1: Raise the values of the explanatory variable x% to the power 
p, then look at a graph of (xp,y).(2”: y)- 
« Option 2: Take the pth root of the values of the response variable y,¥: 
then look at a graph of (x,yp). (x, 4/9) 
e Another useful strategy for straightening a curved pattern in a scatterplot 
is to take the logarithm of one or both variables. When a power model 


describes the relationship between two variables, a plot of log y¥ versus 
log x (or In y¥ versus In x#) should be linear. 


e For an exponential model of the form y=abx,¥ = 2"; the predicted 
values of the response variable are multiplied by a factor of b for each 
increase of 1 unit in the explanatory variable. When an exponential 
model describes the relationship between two variables, a plot of log y¥ 
versus x& (or In y¥ versus x) should be linear. 


12.2 Technology Corner 
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TI-Nspire and other technology instructions are on the book’s website at 


highschool.bfwpub.com/ps6e. 
32. Transforming to achieve linearity Page 809 


Section 12.2 Exercises 


33. pg 797 |p) The swinging pendulum Mrs. Hanrahan’s precalculus 


class collected data on the length (in centimeters) of a pendulum and 
the time (in seconds) the pendulum took to complete one back-and- 
forth swing (called its period). The theoretical relationship between a 
pendulum’s length and its period is 


 ——— 
period =—— ,/length 


period=2nglength v9 
where g¥% is a constant representing the acceleration due to gravity (in 


this case, g=980 cm/s29 = 980¢m/s") Here is a graph of period 


f(T 
versus length, V !°®8th; along with output from a linear regression 


analysis using these variables. 


2.2 4 
J 

2.0 

1.8 4 
@ 16- 
3 
Ss 14- - 
5 . 
& 12-4 er 

1.0 4 e 

ee 
08+ ° 
0.6 -— T T l l ] T T 
4 5 6 Ff 8 9 10 11 
Viength 


Regression Analysis: Cength \/ length, period) 


Predictor Coef SE Coef T P 
Constant —0.08594 0.05046 -1.70 0.123 
sqrt(length) 0.209999 0.008322 25.23 0.000 
S = 0.0464223 R-Sq=98.6% R-Sq(adj) = 98.5% 
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a. Give the equation of the least-squares regression line. Define any 
variables you use. 


b. Use the model from part (a) to predict the period of a pendulum 
with length 80 cm. 


34. Boyle’s law If you have taken a chemistry or physics class, then you 
are probably familiar with Boyle’s law: for gas in a confined space 
kept at a constant temperature, pressure times volume is a constant (in 
symbols, PV=kPV = k). Students in a chemistry class collected data 
on pressure and volume using a syringe and a pressure probe. If the 


true relationship between the pressure and volume of the gas is PV=k, 
PV =k, then 


P=k= 
P=k 1V y 


] 
Here is a graph of pressure versus 1volume, volume’ along with output 


from a linear regression analysis using these variables: 


3.0 - 


25> 


Pressure (atm) 
N 
oS 
| 


ai a A Te A Pe 
0.050 0.075 0.100 0.125 0.150 0.175 


1/V 


Regression Analysis: 


1 
( volume ’ pressure) 


(1volume,pressure) 

Predictor Coef SE Coef T P 
Constant 0.36774 0.04055 9:07 0.000 
1/V 15.8994 0.4190 37.95 0.000 


S = 0.044205 R-Sq=99.6% R-Sq(adj) = 99.5% 


a. Give the equation of the least-squares regression line. Define any 
variables you use. 
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b. Use the model from part (a) to predict the pressure in the syringe 
when the volume is 17 cubic centimeters. 


35. pg 799 |») The swinging pendulum Refer to Exercise 33. Here is 


a graph of period? versus length, along with output from a linear 
regression analysis using these variables. 


| =f 
4 am 
B34 
5 
Pe 
2 =I o 
1- ad iis 
e* 
T T I I I T T T T T 
20 30 40 SO 60 70 80 90 100 110 
Length (cm) 
Regression Analysis: (length, period?) 
Predictor Coef SE Coef T P 
Constant —0.15465 0.05802 —2.67 0.026 


Length 0.042836 0.001320 32.46 0.000 
S = 0.105469 R-Sq=99.2% R-Sq(adj) = 99.1% 


a. Give the equation of the least-squares regression line. Define any 
variables you use. 


b. Use the model from part (a) to predict the period of a pendulum 
with length 80 centimeters. 


] 
36. Boyle’s law Refer to Exercise 34. Here is a graph of 1Pressure Pressure 


versus volume, along with output from a linear regression analysis 
using these variables: 
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UP 
° 
a 

| 


T T T T T T 
5.0 7.5 10.0 12.5 15.0 17.5 20.0 


Volume (cm*) 


Regression Analysis: 


volume, — ) 
’ Pressure 


(volume,1Pressure) 
Predictor Coef SE Coef T P 
Constant 0.100170 0.003779 26.51 0.000 
Volume  0.0398119 0.0002741 145.23 0.000 
S = 0.003553 R-Sq = 100.0% R-Sq(adj) = 100.0% 


a. Give the equation of the least-squares regression line. Define any 
variables you use. 


b. Use the model from part (a) to predict the pressure in the syringe 


when the volume is 17 cubic centimeters. 


37. pg 800 |») The swinging pendulum Refer to Exercise 33. We took 


the logarithm (base 10) of the values for both length and period. Here 
is some computer output from a linear regression analysis of the 
transformed data. 


1778 


log (period) 
S 
| 


l 
213 14 15 16 ff 18 19 20 21 
log (length) 


Residual 
oS 
= 
+ 


12 13 #14 15 16 17 #18 #19 20 21 
log (length) 


Regression Analysis: log(Period) versus log(Length) 


Predictor Coef SE Coef T 
Constant —0.73675 0.03808 —19.35 0.000 
log(Length) 0.51701 0.02511 20.59 0.000 


S =0.0185568 R-Sq=97.9% R-Sq(adj) = 97.7% 


a. Based on the output, explain why it would be reasonable to use a 
power model to describe the relationship between the length and 
period of a pendulum. 


b. Give the equation of the least-squares regression line. Be sure to 
define any variables you use. 


c. Use the model from part (b) to predict the period of a pendulum 
with length 80 cm. 


38. Boyle’s law Refer to Exercise 34. We took the logarithm (base 10) of 
the values for both volume and pressure. Here is some computer 
output from a linear regression analysis of the transformed data. 
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39. 


log (pressure) 


0.8 0.9 1.0 1.1 1.2 1.3 
log (volume) 
0.0050 
0.0025 - 
z 0.0000 
- 
$ _0.0025 4 
-4 * ~ 
—0.0050 — 
—0.0075 - 
T I I T T 
0.8 0.9 1.0 1.1 1.2 1.3 
log (volume) 


Regression Analysis: log(Pressure) versus log(Volume) 


Predictor Coef SE Coef T P 
Constant 1.11116 0.01118 99.39 0.000 
log(Volume) —0.81344 0.01020 —79.78 0.000 


S = 0.00486926 R-Sq=99.9% R-Sq(adj) = 99.9% 


a. Based on the output, explain why it would be reasonable to use a 
power model to describe the relationship between pressure and 
volume. 


b. Give the equation of the least-squares regression line. Be sure to 


define any variables you use. 


Use the model from part (b) to predict the pressure in the syringe 
when the volume is 17 cubic centimeters. 


Brawn versus brain How is the weight of an animal’s brain related 
to the weight of its body? Researchers collected data on the brain 

weight (in grams) and body weight (in kilograms) for 96 species of 
mammals./2 The following figure is a scatterplot of the logarithm of 
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40. 


brain weight against the logarithm of body weight for all 96 species. 
The least-squares regression line for the transformed data is 


log y\=1.01+0.72 log xlog y = 1.01 + 0.72 log « 


log (brain weight) 


0 1 2 3 
log (body weight) 


Based on footprints and some other sketchy evidence, some people 
believe that a large ape-like animal, called Sasquatch or Bigfoot, lives 
in the Pacific Northwest. Bigfoot’s weight is estimated to be about 
127 kilograms (kg). How big do you expect Bigfoot’s brain to be? 


Determining tree biomass It is easy to measure the diameter at 
breast height (in centimeters) of a tree. It’s hard to measure the total 
aboveground biomass (in kilograms) of a tree, because to do this, you 
must cut and weigh the tree. The biomass is important for studies of 
ecology, so ecologists commonly estimate it using a power model. 
The following figure is a scatterplot of the natural logarithm of 
biomass against the natural logarithm of diameter at breast height 
(DBH) for 378 trees in tropical rain forests.22 The least-squares 
regression line for the transformed data is 


In yA=—2.00+2.42 In xAny =~—2.00 + 2.42Inz 


Use this model to estimate the biomass of a tropical tree 30 cm in 
diameter. 
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41. 


42. 


In (biomass) 


2.5 3 3.5 4 4.5 5 


In(DBH) 


Braking distance How is the braking distance for a motorcycle 
related to the speed at which the motorcycle was traveling when the 
brake was applied? Statistics teacher Aaron Waggoner gathered data 
to answer this question. The table shows the speed (in miles per hour) 
and the distance needed to come to a complete stop when the brake 
was applied (in feet). 


Speed (mph) Distance (ft) Speed (mph) Distance (ft) 
6 1.42 a2 52.08 
9 4.92 40 84.00 
19 18.00 48 110.33 
30 44.75 


a. Transform both variables using logarithms. Then calculate and 
state the least-squares regression line using the transformed 
variables. 


b. Use the model from part (a) to calculate and interpret the residual 


for the trial when the motorcycle was traveling at 48 mph. 


Braking distance, again How is the braking distance for a car related 
to the amount of tread left on the tires? Here are the braking distances 
(measured in car lengths) for a car making a panic stop in standing 
water, along with the tread depth of the tires (in 1/32 inch):24 
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Tread depth Braking distance (car 
(1/32 in.) lengths) 


11 a 
9.8 
10.1 
10.4 
10.8 
11.2 
11.8 
12.4 
13.6 
15.2 
18.3 


—_ 
So 


PNW HKU DN CO OO 


a. Transform both variables using logarithms. Then calculate and 
State the least-squares regression line using the transformed 
variables. 


b. Use the model from part (a) to calculate and interpret the residual 
for the trial when the tread depth was 3/32 inch. 


43. pg 804 |») Killing bacteria Expose marine bacteria to X-rays for 


time periods from 1 to 15 minutes. Here is a scatterplot showing the 


number of surviving bacteria (in hundreds) on a culture plate after 


each exposure time:22 


Count (hundreds of bacteria) 
Ss 4 
ra ran 


0 ze 4 6 8 10 12 14 16 
Time (min) 


a. Below is a scatterplot of the natural logarithm of the number of 
surviving bacteria versus time. Based on this graph, explain why it 
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would be reasonable to use an exponential model to describe the 
relationship between count of bacteria and time. 


In (count) 


6.0 4 


5.5 4 


Time (min) 


b. Here is output from a linear regression analysis of the transformed 
data. Give the equation of the least-squares regression line. Be sure 
to define any variables you use. 


Predictor 
Constant 


Time 


S=0.110016 R-Sq = 98.8% 


Coef SE Coef T P 
5.97316 0.05978 99.92 0.000 
—0.218425 0.006575 -33.22 0.000 


R-Sq(adj) = 98.7% 


c. Use your model to predict the number of surviving bacteria after 


17 minutes. 


44. Light through water Some college students collected data on the 
intensity of light at various depths in a lake. Here is a scatterplot of 


their data: 


Light intensity (lumens) 


528 
fa} | 


100 + 


60 - 
40 - 


T ] T T T 
6 7 8 9 10 11 


Depth (m) 


a. At top right is a scatterplot of the natural logarithm of light 
intensity versus depth. Based on this graph, explain why it would 
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be reasonable to use an exponential model to describe the 


relationship between light intensity and depth. 
5.0 


4.5 - 


In (intensity) 


Depth (m) 


b. Here is computer output from a linear regression analysis of the 
transformed data. Give the equation of the least-squares regression 


line. Be sure to define any variables you use. 


Predictor Coef SE Coef rT 
Constant 6.78910 0.00009 78575.46 
Depth (m) -0.333021 0.000010 -31783.44 


P 
0.000 
0.000 


S = 0.000055 R-Sq=100.0% R-Sq(adj) = 100.0% 


c. Use your model to predict the light intensity at a depth of 12 


meters. 


45. European population growth Many populations grow 


exponentially. Here are the data for the estimated population of 
Europe (in millions) from 1700 to 2012.22 The dates are recorded as 
years since 1700 so that x=312@ = 312 is the year 2012. 


Years since 1700 O 50 100 150 200 


Population (in 125 163 203 276 408 
millions) 


Years since 1700 = 250 299 308 310 312 


Population (in 547 729 732 738 740 
millions) 


a. Use a logarithm to transform population size. Then calculate and 
state the least-squares regression line using the transformed 
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variable. 


b. Use your model from part (a) to predict the population size of 
Europe in 2020. 


46. North American population growth Many populations grow 
exponentially. Here are the data for the estimated population of North 
America (in millions) from 1700 to 2012.24 The dates are recorded as 
years since 1700 so that x=312% = 312 is the year 2012. 


Years since 1700 O 50 100 150 200 


Population (in 2 2 Ff 26 82 
millions) 


Years since 1700 = 250 299 308 310 312 


Population (in 172 307 337 345 351 
millions) 


a. Use a logarithm to transform population size. Then calculate and 
State the least-squares regression line using the transformed 
variable. 


b. Use your model from part (a) to predict the population size of 
North America in 2020. 
47. pg 807 |») Putting success How well do professional golfers putt 


from various distances to the hole? The scatterplot shows various 
distances to the hole (in feet) and the percent of putts made at each 
distance for a sample of golfers.22 


100 5 
80 = 
60 - 


40 4 


Percent made 


T 
2 4 6 8 10 12 14 = 16 18 20 
Distance (ft) 


The graphs show the results of two different transformations of the 
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data. The first graph plots the natural logarithm of percent made 
against distance. The second graph plots the natural logarithm of 
percent made against the natural logarithm of distance. 


46-7 
44- 
4.2 4 
4.0 4 
3.8 4 
3.6 4 
3.4 4 
3.2 - 
3.0 4 
2.8 4 
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In (percent made) 
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Distance (ft) 


In (percent made) 
Ww 
os 
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In (distance) 


a. Based on the scatterplots, would an exponential model or a power 
model provide a better description of the relationship between 
distance and percent made? Justify your answer. 


b. Here is computer output from a linear regression analysis of 
In(percent made) and distance. Give the equation of the least- 
squares regression line. Be sure to define any variables you use. 


Predictor Coef SE Coef T P 
Constant 4.6649 0.0825 56.511 0.000 
Distance —0.1091 0.0067 —16.238 0.000 


S = 0.160385 R-Sq=93.9% R-Sq(adj) = 93.4% 


c. Use your model from part (b) to predict the percent made for putts 
of 21 feet. 


d. Here is a residual plot for the linear regression in part (b). Do you 
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expect your prediction in part (c) to be too large, too small, or 
about right? Justify your answer. 


0.3 4 
0.2 4 


Residual 


, 4 6 8 10 12 14 16 18 20 
Distance (ft) 

48. Counting carnivores Ecologists look at data to learn about nature’s 
patterns. One pattern they have identified relates the size of a 
carnivore (body mass in kilograms) to how many of those carnivores 
exist in an area. A good measure of “how many” is to count 
carnivores per 10,000 kg of their prey in the area. The scatterplot 
shows this relationship between body mass and abundance for 25 


carnivore species.2© 


e 1800 5 
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1200 4 
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= 


0 1 ee meee - 7 ° 
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The following graphs show the results of two different 
transformations of the data. The first graph plots the logarithm (base 
10) of abundance against body mass. The second graph plots the 
logarithm (base 10) of abundance against the logarithm (base 10) of 
body mass. 
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log (abundance) 
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log (abundance) 


-1.0 -0.5 0.0 0.5 1.0 1.5 2.0 2.5 
log (body mass) 


a. Based on the scatterplots, would an exponential model or a power 
model provide a better description of the relationship between 
abundance and body mass? Justify your answer. 


b. Here is computer output from a linear regression analysis of 
log(abundance) and log(body mass). Give the equation of the least- 
squares regression line. Be sure to define any variables you use. 


Predictor Coef SE Coef T P 
Constant 1.9503 0.1342 14.53 0.000 
log(body mass)  -1.04811 0.09802 -10.69 0.000 
S = 0.423352 R-Sq = 83.3% R-Sq(adj) = 82.5% 


c. Use your model from part (b) to predict the abundance of black 
bears, which have a body mass of 92.5 kg. 


d. Here is a residual plot for the linear regression in part (b). Do you 
expect your prediction in part (c) to be too large, too small, or 
about right? Justify your answer. 
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49. Heart weights of mammals Here are some data on the hearts of 
various mammals:22 


Heart 

Length of cavity of weight 
Mammal left ventricle (cm) (g) 
Mouse 0.55 0.13 
Rat 1.00 0.64 
Rabbit 2.20 5.80 
Dog 4.00 102.00 
Sheep 6.50 210.00 
Ox 12.00 2030.00 


Horse 16.00 3900.00 


a. Make an appropriate scatterplot for predicting heart weight from 
length. Describe what you see. 


b. Use transformations to linearize the relationship. Does the 
relationship between heart weight and length seem to follow an 
exponential model or a power model? Justify your answer. 


c. Perform least-squares regression on the transformed data. Give the 
equation of your regression line. Define any variables you use. 


d. Use your model from part (c) to predict the heart weight of a 
human who has a left ventricle 6.8 cm long. 


50. Click-through rates Companies work hard to have their website 
listed at the top of an Internet search. Is there a relationship between a 
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website’s position in the results of an Internet search (1=top 
position,2=2nd position, 1 = top position, 2 = 2nd position, etc.) and 
the percentage of people who click on the link for the website? Here 


are click-through rates for the top 10 positions in searches on a 


mobile device:22 


Click- Click- 
through rate through 
Position (%) Position rate (%) 
1 23.53 6 3.80 
2 14.94 7 279 
3 11.19 8 211 
4 7.47 9 157 
5 5.29 10 1.18 


a. Make an appropriate scatterplot for predicting click-through rate 
from position. Describe what you see. 


b. Use transformations to linearize the relationship. Does the 
relationship between click-through rate and position seem to 
follow an exponential model or a power model? Justify your 
answer. 


c. Perform least-squares regression on the transformed data. Give the 
equation of your regression line. Define any variables you use. 


d. Use your model from part (c) to predict the click-through rate for a 
website in the 11th position. 


Multiple Choice Select the best answer for Exercises 51—54. 


51. 51. Suppose that the relationship between a response variable y¥ and 
an explanatory variable x is modeled by y=2.7(0.316)x. 


y = 2.7(0.316)". which of these scatterplots would approximately 
follow a straight line? 


a. A plot of y¥ against xx 


b. A plot of y¥ against log xlo8 # 
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c. A plot of log yl°8 ¥ against xa’ 
d. A plot of log y!°S ¥ against log xlog # 


e. A plot of yv¥ against xz 


52. Some high school physics students dropped a ball and measured the 
distance fallen (in centimeters) at various times (in seconds) after its 
release. If you have studied physics, you probably know that the 
theoretical relationship between the variables is distance=490(time)2. 
distance = 490(time)°. Which of the following scatterplots would not 

approximately follow a straight line? 


a. A plot of distance versus (time)2(time)° 


b. A plot of distance Vv distance versus time 


c. A plot of distance versus timev time 
d. A plot of In(distance) versus In(time) 


e. A plot of log(distance) versus log(time) 


53. Students in Mr. Handford’s class dropped a kickball beneath a motion 
detector. The detector recorded the height of the ball (in feet) as it 
bounced up and down several times. Here is computer output from a 
linear regression analysis of the transformed data of log(height) 
versus bounce number. Predict the highest point the ball reaches on 
its seventh bounce. 


Predictor Coef SE Coef T P 
Constant 0.45374 0.01385 32.76 0.000 
Bounce —0.117160 0.004176 -28.06 0.000 
S = 0.0132043 R-Sq=99.6% R-Sq(adj) = 99.5% 


a. 0.35 feet 
b. 2.26 feet 
c. 0.37 feet 
d. 2.32 feet 
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34. 


59. 


e. 0.43 feet 


A scatterplot of y¥ versus x® shows a positive, nonlinear association. 
Two different transformations are attempted to try to linearize the 
association: using the logarithm of the y¥-values and using the square 
root of the y¥-values. Two least-squares regression lines are 


calculated, one that uses x2 to predict log(y)!°8 (¥) and the other that 


uses x& to predict y¥¥ Which of the following would be the best 
reason to prefer the least-squares regression line that uses x# to 


predict log(y)?!o8(y)? 

a. The value of r2r” is smaller. 

b. The standard deviation of the residuals is smaller. 
c. The slope is greater. 

d. The residual plot has more random scatter. 


e. The distribution of residuals is more Normal. 
Recycle and Review 


Shower time (1.3, 2.2, 6.3, 7.3) Marcella takes a shower every 
morning when she gets up. Her time in the shower varies according to 
a Normal distribution with mean 4.5 minutes and standard deviation 
0.9 minute. 


a. Find the probability that Marcella’s shower lasts between 3 and 6 
minutes on a randomly selected day. 


b. If Marcella took a 7-minute shower, would it be classified as an 
outlier by the 1.5/QR rule? Justify your answer. 


c. Suppose we choose 10 days at random and record the length of 
Marcella’s shower each day. What’s the probability that her 
shower time is 7 minutes or greater on at least 2 of the days? 


d. Find the probability that the mean length of her shower times on 
these 10 days exceeds 5 minutes. 
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56. Tattoos (8.2) What percent of U.S. adults have one or more tattoos? 
The Harris Poll conducted an online survey of 2302 adults and found 
29% of respondents had at least 1 tattoo. According to the published 
report, “Respondents for this survey were selected from among those 
who have agreed to participate in Harris Interactive surveys.”22 

Explain why it would not be appropriate to use these data to construct 
a 95% confidence interval for the proportion of all U.S. adults who 
have tattoos. 

Exercises 57 and 58 refer to the following setting. About 1100 high school 

teachers attended a weeklong summer institute for teaching AP® Statistics 

classes. After learning of the survey described in Exercise 56, the teachers 
in the AP® Statistics class wondered whether the results of the tattoo 
survey would be similar for teachers. They designed a survey to find out. 

The class opted to take a random sample of 100 teachers at the institute. 

One of the questions on the survey was: Do you have any tattoos on your 

body? 


(Circle one) YES NO 


57. Tattoos (8.2, 9.2) Of the 98 teachers who responded, 23.5% said that 
they had one or more tattoos. 


a. Construct and interpret a 95% confidence interval for the true 
proportion of all teachers at the AP® institute who would say they 
have tattoos. 


b. Does the interval in part (a) provide convincing evidence that the 
proportion of all teachers at the institute who would say they have 
tattoos is different from 0.29 (the value cited in the Harris Poll 
report)? Justify your answer. 


c. Two of the selected teachers refused to respond to the survey. If 
both of these teachers had responded, could your answer to part (b) 
have changed? Justify your answer. 


58. Tattoos (4.1) One of the first decisions the class had to make was 
what kind of sampling method to use. 


a. They knew that a simple random sample was the “preferred” 
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method. With 1100 teachers in 40 different sessions, the class 
decided not to use an SRS. Give at least two reasons why you think 
they made this decision. 


. The AP® Statistics class believed that there might be systematic 
differences in the proportions of teachers who had tattoos based on 
the subject areas that they taught. What sampling method would 
you recommend to account for this possibility? Explain a statistical 
advantage of this method over an SRS. 
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MM — FRAPPY! FREE RESPONSE 


AP® PROBLEM, 
YAY! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, concise 
response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods you 
use, because you will be scored on the correctness of your methods 
as well as on the accuracy and completeness of your results and 
explanations. 


A random sample of 14 golfers was selected from the 147 players on 
the Ladies Professional Golf Association (LPGA) tour in a recent 
year. The total amount of money won during the year (in dollars) and 
the scoring average for each player in the sample was recorded. 
Lower scoring averages are better in golf. 


The scatterplot below displays the relationship between money 
and scoring average for these 14 players. 
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04 e 6 «© e 


70 71 72 73 74 75 

Scoring average 
a. Explain why it would not be appropriate to construct a confidence 
interval for the slope of the least-squares regression line relating 
money to scoring average. 
A scatterplot of the natural logarithm of money versus scoring 
average is shown, along with some computer output for a least- 
squares regression using the transformed data. 


z 
& 
Scoring average 
Predictor Coef SE Coef T P 
Constant 77.5937 7.035 11.02 0.000 


Scoring average ~—0.90470 0.09679 -9.35 0.000 
S = 0.475059 R-Sq = 87.9% R-Sq(adj) = 86.9% 


b. Predict the amount of money won for an LPGA golfer with a 
scoring average of 70. 


c. Calculate and interpret a 95% confidence interval for the slope of 
the least-squares regression line relating In(money) to scoring 
average. Assume that the conditions for inference have been met. 


After you finish, you can view two example solutions on the book’s 
website (highschool.bfwpub.com/tps6e). Determine whether you 
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think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 


score. 
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Chapter 12 Review 


Section 12.1: Inference for Linear Regression 


In this section, you learned that the sample regression line yA=b0+b1x 
y = bo + bi estimates the population (true) regression line Ly=B0+B1x. 
Hy = 80 + Pi®- The sampling distribution of the sample slope b11 is the 
foundation for doing inference about the population (true) slope B1.1- 


When the conditions are met, the sampling distribution of b151 has an 
approximately Normal distribution with mean pb1=61/" = 1 and 


b orn 
ry 


Co 
standard deviation ob1=ooxn. 


There are five conditions for performing inference about the slope of a 
population (true) least-squares regression line. Remember them with the 
acronym LINER. 


e The linear condition says that the mean value of the response variable 
uy» falls on the population (true) regression line py=B0+B1x. 
Hy = Bo + Pie. To check the linear condition, verify that there are no 
leftover curved patterns in the residual plot. 


e The independent condition says that individual observations are 
independent of each other. To check the independent condition, verify 
that the sample size is less than 10% of the population size when 
sampling without replacement from a population. Also, convince 
yourself that knowing the response for one individual won’t help you 
predict the response for another individual. 


e The Normal condition says that the distribution of y¥-values is 
approximately Normal for each value of x.£- To check the Normal 
condition, graph a dotplot, histogram, stemplot, boxplot, or Normal 
probability plot of the residuals and verify that there are no outliers or 
strong skewness. 


e The equal SD condition says that for each value of x,”> the distribution 
of y¥ should have the same standard deviation. To check the equal SD 
condition, verify that the residuals have roughly the same amount of 
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scatter around the residual=Oresidual = 0 line for each value of xz on 
the residual plot. 


e The random condition says that the data are from a random sample or a 
randomized experiment. To check the random condition, verify that 
randomness was properly used in the data collection process. 

To construct and interpret a confidence interval for the slope of the 
population (true) least-squares regression line, follow the familiar four- 
step process. The formula for the confidence interval is b1+t*SEb1, 

b) + SE», ; where t#t* is the tt critical value with df=n-2.4f = n — 2. 

The standard error of the slope SEb1°2h describes how far the sample 

slope typically varies from the population (true) slope in repeated random 

samples or random assignments. The formula for the standard error of the 


slope is SEb1=ssxn-1. ’\ ~ seVi-1" The standard error of the slope is 


typically provided with standard computer output for least-squares 
regression. 
When you conduct a significance test for the slope of the population 


(true) least-squares regression line, use the standardized test statistic 

b, —hypothesized slope 
t=b1—-hypothesized slopeSEb1 SEp, with df=n-2. 
df = n — 2. In most cases, the null hypothesis is H0:81=0.40 : 81 = 0. 
This hypothesis says that a straight-line relationship between x# and y¥Y is 
no better at predicting y¥Y than using the mean value y .Y: The value of the 
standardized test statistic for a test of HO:81=0,/0 : 61 = 9; along with a 
two-sided PP-value, is typically provided with standard computer output 


for least-squares regression. 


Section 12.2: Transforming to Achieve Linearity 


When the association between two variables is nonlinear, transforming one 
or both of the variables can result in a linear association. 


If the association between two variables follows a power model in the 
form y=axp,¥ = @2”, there are several transformations that will result in a 
linear association. 


e Raise the values of xz to the power of p? and plot y¥ versus xp.2”. 


e Calculate the pP?th root of the y¥-values and plot ypv¥ versus X.@. 
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e Calculate the logarithms of the x%-values and the y¥-values, and plot 
log(y)los(y) versus log(x).l08(# )- You can use base-10 logarithms (log) 
or base-ee logarithms (In). 

If the association between two variables follows an exponential model 
in the form y=abx,¥ = 25"; transform the data by computing the 
logarithms of the y¥-values and plot log(y)os(y ) versus xa (or In(y¥) 
versus xX), 

Once you have achieved linearity, calculate the equation of the least- 
squares regression line using the transformed data. Remember to include 
the transformed variables when you are writing the equation of the line. 
Likewise, when using the line to make predictions, make sure that the 
prediction is in the original units of y.¥: If you transformed the y¥ 
variable, you will need to undo the transformation after using the least- 
squares regression line. 


To decide between two or more transformations, choose the one that 
produces the most linear association and whose residual plot has the most 
random scatter. 


What Did You Learn? 


Related 
Example on Relevant Chapter 
Learning Target Section Page(s) Review Exercise(s) 
Check the conditions 124 775 Higa 


for performing 
inference about the 
slope 811 of the 
population (true) 
regression line. 
Interpret the values of 12.1 Vid R12.1 
b0,b1,s0, 61, § and SE 
b 1 Lb in context, 
and determine these 
values from computer 
output. 
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Construct and interpret 
a confidence interval 
for the slope B11 of 
the population (true) 
regression line. 


Perform a significance 
test about the slope 
B11 of the population 
(true) regression line. 


Use transformations 
involving powers and 
roots to find a power 
model that describes 
the relationship 
between two 
quantitative variables, 
and use the model to 
make predictions. 


Use transformations 
involving logarithms 
to find a power model 
that describes the 
relationship between 
two quantitative 
variables, and use the 
model to make 
predictions. 


Use transformations 
involving logarithms 
to find an exponential 
model that describes 
the relationship 
between two 
quantitative variables, 
and use the model to 
make predictions. 


Determine which of 
several 
transformations does a 
better job of producing 


12.1 779 
12.1 783 
2 797, 799 
12.2 800 
12,2 804 
12.2 807 
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a linear relationship. 
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Chapter 12 Review Exercises 


These exercises are designed to help you review the important ideas and 
methods of the chapter. 


Exercises R12.1—R12.4 refer to the following setting. Do taller students 
require fewer steps to walk a fixed distance? The scatterplot shows the 
relationship between x=height® = height (jn inches) and y=number 

y = number of steps required to walk the length of a school hallway for a 
random sample of 36 students at a high school. 


Number of steps 


60 62 64 66 68 70 72 74 
Height (in.) 


A least-squares regression analysis was performed on the data. Here is 
some computer output from the analysis: 
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Residual 


Height (in.) 


Frequency 


Residual 
Predictor Coef SE Coef T P 
Constant 113.57 13.085 8.679 0.000 
Height —0.9211 0.1938 ates saci 


S = 3.50429 R-Sq= 39.9% R-Sq(adj) = 38.1% 


R12.1 |») Long legs 


a. Describe what the scatterplot tells you about the relationship 
between height and number of steps. 


b. What is the equation of the least-squares regression line? Define 
any variables you use. 


c. Identify the value of each of the following from the computer 
output. Then provide an interpretation of each value. 


i. bOPo 
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ii, b1O1 
iii. ss 
iv. SEbISE 
R12.2 |») Long legs Verify that the conditions for inference about the 
slope of the least-squares regression line are met in this context. 


R12.3 |p) Long legs Do these data provide convincing evidence at the 


a=0.05a = 0.05 level that taller students at this school require fewer 
steps to walk a fixed distance? Assume that the conditions for 
inference are met. 


R12.4 |») Long legs Construct and interpret a 95% confidence interval 


for the slope of the population regression line. Assume that the 
conditions for inference are met. Explain how the interval provides 
more information than the test in R12.3. 


R12.5 |») Light intensity In a physics class, the intensity of a 100-watt 


lightbulb was measured by a sensor at various distances from the 
light source. Here is a scatterplot of the data. Note that a candela is a 
unit of luminous intensity in the International System of Units. 


0.304 « 
0.25 4 


0.20 4 


Intensity (candelas) 


10 1.2 14 ‘e 1.8 2.0 
Distance (m) 
Physics textbooks suggest that the relationship between light 
intensity y¥ and distance x® should follow an “inverse square law,” 
that is, a power law model of the form y=ax-—2=a1x2. 


—2 ] 
y=ar “=a-. ; 
z* We transformed the distance measurements by 


squaring them and then taking their reciprocals. Here is some 
computer output and a residual plot from a least-squares regression 
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analysis of the transformed data. Note that the horizontal axis on the 
residual plot displays predicted light intensity. 


0.006 + 
0.005 4 
0.004 = 
— 0.003 4 
E 0.002 4 
E 0.001 4 
0.000 
-0.001 4 * 
0.002 4 
ss T ae 
0.10 0.15 0.20 0.25 0.30 
Predicted intensity (candelas) 
Predictor Coef SE Coef T P 
Constant —-0.000595 0.001821 -0.33 0.751 


Distance\(-2) 0.299624 0.003237 92.56 0.000 
S=0.0024837  R-Sq=99.9% — R-Sq(adj)=99.9% 


a. Did this transformation achieve linearity? Give appropriate 
evidence to justify your answer. 


b. What is the equation of the least-squares regression line? Define 
any variables you use. 


c. Predict the intensity of a 100-watt bulb at a distance of 2.1 
meters. 


R12.6 |») Pricey diamonds Here is a scatterplot showing the 


relationship between the weight (in carats) and price (in dollars) of 
round, clear, internally flawless diamonds with excellent cuts:22 


We 


100 - Pa 


ae 
0 wn ome oe 
T T T T T T T T T T 
0.0 0.5 1.0 iS 20 25 30 35 40 45 


Weight (carats) 


Price ($ 1000s) 


a. Explain why a linear model is not appropriate for describing the 
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relationship between price and weight of diamonds. 


b. We used software to transform the data in hopes of achieving 
linearity. The output shows the results of two different 
transformations. Would an exponential model or a power model 
describe the relationship better? Justify your answer. 
Transformation 1: 


13 4 te 
= so] Re a 
g u- A 
=| 
= 10 oe 
a, m 
o %S Z 
8 | 
7 | ca 
l T T T T a T 
-1.5 -1.0 -0.5 0.0 0.5 1.0 tI | 
In (weight) 
Term Coef SECoef T-Value P-Value 


Constant 9.7062 0.0209 465.10 0.000 
InWeight 2.2913 (0.0332. 68.91 —_—0.000 
S=0.171328  R-sq=98.10% — R-sq(adj)=98.08% 


Transformation 2: 


In (price) 


Weight (carats) 


Term Coef SECoef T-Value P-Value 
Constant 8.2709 0.0988 83.72 0.000 
Weight 1.3791 0.0549 25.12 0.000 


S=0.443287 R-sq=87.28% — R-sq(adj)=87.14% 
c. Use each model to predict the price for a diamond of this type 


that weighs 2 carats. Which prediction do you think will be 
better? Explain your reasoning. 
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Chapter 12 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T12.1 Which of the following is not one of the conditions that must be 
satisfied in order to perform inference about the slope of a least- 
Squares regression line? 


a. For each value of x,”> the population of y-values is Normally 
distributed. 


b. The standard deviation o¢ of the population of y-values 
corresponding to a given value of x# is always the same, 
regardless of the specific value of x.@. 


c. The sample size—that is, the number of paired observations (x, 
y)—exceeds 30. 


d. There exists a straight line such that, for each value of x,”> the 
mean py" of the corresponding population of y¥-values lies 
on that straight line. 


e. The data come from a random sample or a randomized 
experiment. 


T12.2 Students in a statistics class drew circles of varying diameters and 
counted how many Cheerios could be placed in the circle. The 
scatterplot shows the results. 

The students want to determine an appropriate equation for the 
relationship between diameter and the number of Cheerios. The 
students decide to transform the data to make it appear more linear 
before computing a least-squares regression line. Which of the 
following transformations would be reasonable for them to try? 
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Cheerios 
& 


of T T 4 T T 
0 2 4 6 8 10 12 14 16 
Diameter 


I. Plot the square root of the number of Cheerios against 
diameter. 


II. Plot the cube of the number of Cheerios against diameter. 


III. Plot the log of the number of Cheerios against the log of the 
diameter. 


IV. Plot the number of Cheerios against the log of the diameter. 
a. Iand II 

b. Tand II 

c. Il and Ill 

d. Il and IV 

e. Land IV 


T12.3 Inference about the slope 81/1 of a least-squares regression line 
is based on which of the following distributions? 


a. The té distribution with n—1” — | degrees of freedom 

b. The standard Normal distribution 

c. The chi-square distribution with n—1” ~ | degrees of freedom 
d. The t¢ distribution with n-2” — 2 degrees of freedom 

e. The Normal distribution with mean yp and standard deviation 


oo 


Exercises T12.4—T12.8 refer to the following setting. An old 
saying in golf is “You drive for show and you putt for dough.” The 
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point is that good putting is more important than long driving for 
shooting low scores and hence winning money. To see if this is the 
case, data from a random sample of 69 of the nearly 1000 players 
on the PGA Tour’s world money list are examined. The average 
number of putts per hole (fewer is better) and the player’s total 
winnings for the previous season are recorded and a least-squares 
regression line was fitted to the data. Assume the conditions for 
inference about the slope are met. Here is computer output from 
the regression analysis: 


Predictor Coef SE Coef T P 
Constant 7897179 3023782 6.86 0.000 
Avg. putts -4139198 1698371. “Fee es 
S = 281777 R-Sq = 8.1% R-Sq(adj) = 7.8% 


T12.4 By about how much does the sample slope typically vary from 
the population slope in repeated random samples of n=69n = 69 
golfers? 


a. 7,897,179 

b. 1,698,371 

c. 3,023,782 

d. 281,777 

e. —4,139,198~4, 139, 198 


T12.5 Suppose that the researchers test the hypotheses HO: 81=0 
Ho : 8 = 0 versus Ha:B1<0.40:1 &lt; 0. Which of the following 
is the value of the t¢ statistic for this test? 


a. 2.61 

b. -2.44~2.44 
c. 2.44 

d. —20.24~—20.24 
e. 0.081 


T12.6 The PP-value for the test in Exercise T12.5 is 0.0087. Which of 
the following is a correct interpretation of this result? 
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a. The probability there is no linear relationship between average 
number of putts per hole and total winnings for these 69 players 
is 0.0087. 


b. The probability there is no linear relationship between average 
number of putts per hole and total winnings for all players on 
the PGA Tour’s world money list is 0.0087. 


c. If there is no linear relationship between average number of 
putts per hole and total winnings for the players in the sample, 
the probability of getting a random sample of 69 players that 
yields a least-squares regression line with a slope of 
-4,139,198~4; 139, 198 oy less is 0.0087. 


d. If there is no linear relationship between average number of 
putts per hole and total winnings for the players on the PGA 
Tour’s world money list, the probability of getting a random 
sample of 69 players that yields a least-squares regression line 
with a slope of —4,139,198 4,139,198 oF less is 0.0087. 


e. The probability of making a Type I error is 0.0087. 


T12.7 Which of the following is the 95% confidence interval for the 
slope B11 of the population regression line? 


a. 7,897,179+3,023,782",897,179 + 3,023,782 

b. 7,897,179+2.000(3,023,782)!:897,179 + 2.000 (3,023,782) 

c. —4,139,198+1,698,371—4,139,198 = 1,698,371 

d. —4,139,198+1.960(1,698,371)~4:139,198 + 1.960 (1,698,371) 
e. —4,139,198+2.000(1,698,371)~4:139,198 + 2.000 (1,698,371) 


T12.8 Which of the following would make the calculation in Exercise 
T12.7 invalid? 


a. If the scatterplot of the sample data wasn’t perfectly linear 
b. If the distribution of earnings has an outlier 


c. If the distribution of earnings wasn’t approximately Normal 


a 


If the earnings for golfers with small putting averages was 
much more variable than the earnings for golfers with large 
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putting averages 


e. If the standard deviation of earnings is much larger than the 
standard deviation of putting average 


T12.9 Which of the following would provide evidence that a power 
model of the form y=axp,¥ = @#”; where p~0P * 9 and p#1,P * 1; 
describes the relationship between a response variable y¥ and an 
explanatory variable x? 


a. A scatterplot of y¥ versus x looks approximately linear. 

b. A scatterplot of In y¥ versus x® looks approximately linear. 

c. A scatterplot of y¥ versus In x% looks approximately linear. 
d. A scatterplot of In y¥ versus In x# looks approximately linear. 
e. None of these 


T12.10 We record data on the population of a particular country from 
1960 to 2010. A scatterplot reveals a clear curved relationship 
between population and year. However, a different scatterplot 
reveals a strong linear relationship between the logarithm (base 10) 
of the population and the year. The least-squares regression line 
for the transformed data is 


log (population)A=—13.5+0.01(year) 
log (population) =—13.5 + 0.01 (year) 


Based on this equation, which of the following is the best estimate 
for the population of the country in the year 2020? 

a. 6.7 

b. 812 

c. 5,000,000 

d. 6,700,000 

e. 8,120,000 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
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explanations. 


T12.11 Growth hormones are often used to increase the weight gain of 
chickens. In an experiment using 15 chickens, 3 chickens were 
randomly assigned to each of 5 different doses of growth hormone 
(0, 0.2, 0.4, 0.8, and 1.0 milligrams). The subsequent weight gain 
(in ounces) was recorded for each chicken. A researcher plots the 
data and finds that a linear relationship appears to hold. Here is 
computer output from a least-squares regression analysis of these 
data. Assume that the conditions for performing inference about the 
slope B1°1 of the true regression line are met. 


Predictor Coef SE Coef T P 
Constant 45459 0.6166 7.37  <0.0001 
Dose 4.8323 1.0164 4.75 0.0004 


S=3.135 R-Sq=38.4% R-Sq(adj) = 37.7% 


a. Interpret each of the following in context: 
i. The slope 
ii. The y¥ intercept 
iii. The standard deviation of the residuals 
iv. The standard error of the slope 


b. Do the data provide convincing evidence of a linear relationship 
between dose and weight gain? Carry out a significance test at 
the a=0.05a = 0.05 level. 


c. Construct and interpret a 95% confidence interval for the slope 
parameter. 


T12.12 Foresters are interested in predicting the amount of usable lumber 
they can harvest from various tree species. They collect data on the 
diameter at breast height (DBH) in inches and the yield in board 
feet of arandom sample of 20 Ponderosa pine trees that have been 
harvested. (Note that a board foot is defined as a piece of lumber 
12 inches by 12 inches by 1 inch.) Here is a scatterplot of the data. 
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a. Here is some computer output and a residual plot from a least- 
Squares regression on these data. Explain why a linear model 
may not be appropriate in this case. 


Predictor Coef SE Coef T P 
Constant —191.12 16.98  —11.25 0.000 
DBH (inches) 11.0413 0.5752 19.19 0.000 
S = 20.3290 R-Sq=95.3% R-Sq(adj) = 95.1% 


40 4 
30 4 a ~ 
e os 
20 4 
8 10 ; ‘i 
; eo 
g 0 — = 
=" -10 +4 e as 2 
20 4 
—30 bd . 
40 4 * 
a es | ea | (ama) aman (aun 
20 25 30 35 40 
DBH (in.) 


The foresters are considering two possible transformations of 
the original data: (1) cubing the diameter values or (2) taking 
the natural logarithm of the yield measurements. After 
transforming the data, a least-squares regression analysis is 
performed. Here is some computer output and a residual plot for 
each of the two possible regression models: 


Option 1: Cubing the diameter values 
Predictor Coef SE Coef T P 
Constant 2.078 5.444 0.38 0.707 
DBHA3 0.0042597 0.0001549 27.50 0.000 
S = 14.3601 R-Sq=97.7% R-Sq(adj) = 97.5% 
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Residual 


100 150 200 
Predicted yield 


T 
300 


250 


Option 2: Taking natural logarithm of yield 


measurements 
Predictor Coef SE Coef r P 
Constant 1.2319 0.1795 6.86 0.000 
DBH 0.113417 0.006081 18.65 0.000 
S = 0.214894 R-Sq=95.1% R-Sq(adj) = 94.8% 
0.4 7 e 
03 - . : 
0.2 4 ee . 
2 ‘01 z 
= 0.0 a oe so 
2 0.14 lias : eS 
02 4 = a 
-03 4 . 
0.4 4 ° 
a6 35 40 ie Ba 55 6.0 
Predicted In (yield) 


b. Use both models to predict the amount of usable lumber from a 
Ponderosa pine with diameter 30 inches. 


c. Which of the predictions in part (b) seems more reliable? Give 
appropriate evidence to support your choice. 


1816 


Final Project 


In this project, your team will formulate a statistical question, design a 
study to answer the question, conduct the study, collect the data, analyze 
the data, and use statistical inference to answer the question. You may do 
your study on any topic, but you must be able to include the six steps listed 
above. 


. Write a proposal describing the design of your study. Make sure to 
include the following: 


e Describe the statistical question you are trying to answer. 


e List the explanatory and response variables (or just the response 
variable when appropriate). 


e State the null and alternative hypothesis you will be testing, along 
with the name of the test you will use to analyze the results. 


e Describe how you will collect the data, so the conditions for inference 
will be satisfied. 


e Explain how your study will be safe and ethical if you are using 
human subjects. 


’. Once your teacher has approved your proposal, carry out the study. 


}. Create a poster to summarize your project. Make sure to include the 
following: 


e Title (in the form of a question). 


e Introduction. The introduction should discuss what question you are 
trying to answer, why you chose this topic, what your hypotheses are, 
and how you will analyze your data. 


e Data Collection. In this section, you will describe how you obtained 
your data. Be specific. 


e Graphs and Summary Statistics, Including Raw Data. Begin by 
providing the raw data. If the data are quantitative, list them in a 
table. If the data are categorical, summarize them in a two-way table. 
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Then make graphs that are well labeled and easy to compare, and list 
appropriate summary statistics. Use the graphs and summary statistics 
to describe the evidence for the alternative hypothesis. 


e Analysis and Conclusion. Identify the inference procedure you used 
and discuss the conditions for inference. Give the (standardized) test 
statistic and PP-value (with interpretation), along with the 
appropriate conclusion. Then provide the corresponding confidence 
interval (with interpretation) or follow-up analysis (for chi-square 
tests). 


e Reflections. In this section, you should also discuss any possible 
errors (e.g., Type I or Type II), limitations to your conclusion, what 
you could do to improve the study the next time, and any other 
critical reflections. 


e The key to a good statistical poster is communication and 
organization. Make sure all components of the poster are focused on 
answering the question of interest and that statistical vocabulary is 
used correctly. Include live action pictures of your data collection in 
progress. 
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Cumulative AP® Practice Test 4 


Section I: Multiple Choice Choose the best answer for Questions AP4. 1— 


AP4.40. 


AP4.1 A major agricultural company is testing a new variety of wheat 
to determine whether it is more resistant to certain insects than the 
current wheat variety. The proportion of a current wheat crop lost 
to insects is 0.04. Thus, the company wishes to test the following 
hypotheses: 


Ho : p = 0.04 
H0:p=0.04Ha:p<0.04 Ha : p < 0.04 


Which of the following significance levels and sample sizes would 
lead to the highest power for this test? 


a. n=200n = 200 and a=0.01e = 0.01 


b. 
C. 
d. 


e. 


n=400n = 400 and a=0.05e = 0.05 
n=400n = 400 and a=0.01a = 0.01 
n=500n = 500 and a=0.01e = 0.01 
n=500n = 500 and a=0.05a = 0.05 


AP4.2 If P(A)=0.24P(A) = 9.24 and p(B)=0.52P(B) = 9-52 and events 
A and B are independent, what is P(A or B)? 


a. 0.1248 
b. 

C. 
d. 


e. 


0.28 
0.6352 
0.76 


The answer cannot be determined from the given information. 


AP4.3 Sam has determined that the weights of unpeeled bananas from 
his local store have a mean of 116 grams with a standard deviation 
of 9 grams. Assuming that the distribution of weight is 
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approximately Normal, to the nearest gram, the heaviest 30% of 
these bananas weigh at least how much? 


a. 107 ¢ 
b. 121 ¢ 
c. 1llg 
d. 125¢ 
e. 116g 


AP4.4 The school board in a certain school district obtained a random 
sample of 200 residents and asked if they were in favor of raising 
property taxes to fund the hiring of more statistics teachers. The 
resulting confidence interval for the true proportion of residents in 
favor of raising taxes was (0.183, 0.257). Which of the following 
is the margin of error for this confidence interval? 


a. 0.037 
b. 0.074 
c. 0.183 
d. 0.220 
e. 0.257 


AP4.5 After a name-brand drug has been sold for several years, the 
Food and Drug Administration (FDA) will allow other companies 
to produce a generic equivalent. The FDA will permit the generic 
drug to be sold as long as there isn’t convincing evidence that it is 
less effective than the name-brand drug. For a proposed generic 
drug intended to lower blood pressure, the following hypotheses 
will be used: 


HO:G=uN versus Ha:G<pN/40:He = bn Versus Ha:c &lt; 1 


where 


G=true mean reduction in blood pressureusing the generic 
drugy1G=true mean reduction in blood pressureusing the 
name@brand drug 
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fig = true mean reduction in blood pressure 
using the generic drug 
fig = true mean reduction in blood pressure 


using the name@brand drug 


In the context of this situation, which of the following describes a 
Type I error? 


a. The FDA finds convincing evidence that the generic drug is 
less effective, when in reality it is less effective. 


b. The FDA finds convincing evidence that the generic drug is 
less effective, when in reality it is equally effective. 


c. The FDA finds convincing evidence that the generic drug is 
equally effective, when in reality it is less effective. 


d. The FDA fails to find convincing evidence that the generic 
drug is less effective, when in reality it is less effective. 


e. The FDA fails to find convincing evidence that the generic 
drug is less effective, when in reality it is equally effective. 


AP4.6 The town council wants to estimate the proportion of all adults in 
their medium-sized town who favor a tax increase to support the 
local school system. Which of the following sampling plans is 
most appropriate for estimating this proportion? 


a. A random sample of 250 names from the local phone book 


b. A random sample of 200 parents whose children attend one of 
the local schools 


c. A sample consisting of 500 people from the city who take an 
online survey about the issue 


d. A random sample of 300 homeowners in the town 


e. A random sample of 100 people from an alphabetical list of all 
adults who live in the town 


AP4.7 Which of the following is a categorical variable? 
a. The weight of an automobile 


b. The time required to complete the Olympic marathon 
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c. The fuel efficiency (in miles per gallon) of a hybrid car 
d. The brand of shampoo purchased by shoppers in a grocery store 


e. The closing price of a particular stock on the New York Stock 
Exchange 


AP4.8 A large machine is filled with thousands of small pieces of 
candy, 40% of which are orange. When money is deposited, the 
machine dispenses 60 randomly selected pieces of candy. The 
machine will be recalibrated if a group of 60 candies contains 
fewer than 18 that are orange. What is the approximate probability 
that this will happen if the machine is working correctly? 


Plz &it- 03204 


a. P(z<0.3-0.4(0.4)(0.6)60) Vw 


Piz &lit: 0.3—0.4 


/(0.3)(0.7) 


. P(z<0.3-0.4(0.3)(0.7)60) yo 


oO 


| 0.3-0.4 
P | z &it; 


\ (0.4)(0.6) 


. P(z<0.3-0.4(0.4)(0.6)60) 60 


io) 


P (: &It: 0.3—0.4 
. P(z<0.3-0.4(0.4)(0.6)60) JO 


jar 


P| 2 &lt. 04223 


\ (0.3)(0.7) 
e. P(z<0.4—0.3(0.3)(0.7)60) 60 
AP4.9 A random sample of 900 students at a very large university was 
asked which social networking site they used most often during a 
typical week. Their responses are shown in the table. 


Gender 
Male Female Total 
Facebook 504 
Twitter 80 
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Networking LinkedIn 195 
site Pinterest 49 
Snapchat 72 
Total 900 


Assuming that gender and preferred networking site are 
independent, how many females do you expect to choose 
LinkedIn? 


a. 87.00 
b. 90.00 
c. 95.40 
d. 97.50 
e. 103.35 


AP4.10 Insurance adjusters are always concerned about being 
overcharged for accident repairs. The adjusters suspect that Repair 
Shop 1 quotes higher estimates than Repair Shop 2. To check their 
suspicion, the adjusters randomly select 12 cars that were recently 
involved in an accident and then take each of the cars to both 
repair shops to obtain separate estimates of the cost to fix the 
vehicle. The estimates are given in hundreds of dollars. 


Car 1 2 3 4 fs) 6 
Shop 1 21.2 20.2 39.0 11.3 15.0 18.1 
Shop 2 pa es 24.1 36.8 11.5 13.7 17.6 
Car 7 8 9 10 11 12 
Shop 1 2008 2.2 12.4 42.6 27.6 123 
Shop 2 24.8 21.3 12.1 42.0 26.7 12.5 


Assuming that the conditions for inference are met, which of the 
following significance tests should be used to determine whether 
the adjusters’ suspicion is correct? 


a. A paired tt? test 
b. A two-sample tf? test 


c. A té test to see if the slope of the population regression line is 0 
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d. A chi-square test for homogeneity 
e. A chi-square test for goodness of fit 


AP4.11 A survey firm wants to ask a random sample of adults in Ohio if 
they support an increase in the state sales tax from 5.75% to 6%, 
with the additional revenue going to education. Let p\P denote the 
proportion in the sample who say that they support the increase. 
Suppose that 40% of all adults in Ohio support the increase. If the 
survey firm wants the standard deviation of the sampling 
distribution of p/P to equal 0.01, how large a sample size is 
needed? 


a. 1500 
b. 2400 
c. 2401 
d. 2500 
e. 9220 


AP4.12 A set of 10 cards consists of 5 red cards and 5 black cards. The 
cards are shuffled thoroughly, and you choose one at random, 
observe its color, and replace it in the set. The cards are thoroughly 
reshuffled, and you again choose a card at random, observe its 
color, and replace it in the set. This is done a total of four times. 
Let XX be the number of red cards observed in these four trials. 
The random variable XX has which of the following probability 
distributions? 


a. The Normal distribution with mean 2 and standard deviation 1 
b. The binomial distribution with n=10n = 10 and p=0.5? = 9-5 
c. The binomial distribution with n=5n = 5 and p=0.5? = 9-5 

d. The binomial distribution with n=4n = 4 and p=0.5P = 9.5 

e. The geometric distribution with p=0.5? = 9-5 


AP4.13 A study of road rage asked random samples of 596 men and 523 
women about their behavior while driving. Based on their answers, 
each respondent was assigned a road rage score on a scale of 0 to 
20. The respondents were chosen by random-digit dialing of 
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telephone numbers. Are the conditions for inference about a 
difference in means satisfied? 


a. Maybe; the data came from independent random samples, but 
we should examine the data to check for Normality. 


b. No; road rage scores on a scale of 0 to 20 can’t be Normal. 
c. No; a paired té test should be used in this case. 


d. Yes; the large sample sizes guarantee that the corresponding 
population distributions will be Normal. 


e. Yes; we have two independent random samples and large 
sample sizes, and the 10% condition is met. 


AP4.14 Do hummingbirds prefer store-bought food made from 
concentrate or a simple mixture of sugar and water? To find out, a 
researcher obtains 10 identical hummingbird feeders and fills 5, 
chosen at random, with store-bought food from concentrate and 
the other 5 with a mixture of sugar and water. The feeders are then 
randomly assigned to 10 possible hanging locations in the 
researcher’s yard. Which inference procedure should you use to 
test whether hummingbirds show a preference for store-bought 
food based on amount consumed? 


a. A one-sample z2 test for a proportion 

b. A two-sample z2 test for a difference in proportions 
c. A chi-square test for independence 

d. A two-sample tt? test 

e. A paired tt? test 


AP4.15 A Harris poll found that 54% of American adults don’t think 
that human beings developed from earlier species. The poll’s 
margin of error for 95% confidence was 3%. This means that 


a. there is a 95% chance the interval (51%, 57%) contains the true 
percent of American adults who do not think that human beings 
developed from earlier species. 


b. the poll used a method that provides an estimate within 3% of 
the truth about the population in 95% of samples. 
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c. if Harris conducts another poll using the same method, the 
results of the second poll will lie between 51% and 57%. 


d. there is a 3% chance that the interval is incorrect. 


e. the poll used a method that would result in an interval that 
contains 54% in 95% of all possible samples of the same size 
from this population. 


AP4.16 Two six-sided dice are rolled and the sum of the faces showing 
is recorded after each roll. Let X=theX = the number of rolls 
required to obtain a sum greater than 7. If 100 trials are conducted, 
which of the following is most likely to be the result of the 


simulation? 
a. 
Number of | Number of _ 
rolls XX trails 

1 34 
2 20 
3 16 
4 10 
5 6 
6 6 
7 3 
8 2 
9 1 
10 0 
11 1 
12 0 
13 1 

b. 

Number of | Number of _ 
rolls XX trails 

0 34 
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1 20 
2 16 
3 10 
4 6 
S) 6 
5 3 
7 2 
8 1 
9 0 
10 1 
11 0 
12 1 
Number of Number of 
rolls XX trails 
1 18 
2 23 
3 26 
4 15 
5 9 
6 6 
7 1 
8 0 
9 1 
10 0 
11 0 
12 0 
13 1 
Number of Number of 
rolls XX trails 
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1 10 
2 9 
3 10 
4 i 
S) 7 
6 13 
7 10 
8 7 
9 

10 10 
11 2 
12 1 

e, 
Number of Number of 
rolls XX trails 

1 2 
2 2 
3 S) 
4 10 
5 11 
6 15 
7 22 
8 17 
9 9 
10 4 
d bal 2 
12 0 
13 1 


AP4.17 Women who are severely overweight suffer economic 
consequences, a study has shown. They have household incomes 
that are $6710 less than other women, on average. The findings are 
from an eight-year observational study of 10,039 randomly 
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selected women who were 16 to 24 years old when the research 
began. If the difference in average incomes is statistically 
significant, does this study give convincing evidence that being 
severely overweight causes a woman to have a lower income? 


a. Yes; the study included both women who were severely 
overweight and women who were not. 


b. Yes; the subjects in the study were selected at random. 


c. Yes, because the difference in average incomes is larger than 
would be expected by chance alone. 


d. No; the study showed that there is no connection between 
income and being severely overweight. 


e. No; the study suggests an association between income and 
being severely overweight, but we can’t draw a cause-and- 
effect conclusion. 


Questions AP 4.18 and 4.19 refer to the following situation. Could 
mud wrestling be the cause of a rash contracted by University of 
Washington students? Two physicians at the university’s student 
health center wondered about this when one male and six female 
students complained of rashes after participating in a mud- 
wrestling event. Questionnaires were sent to a random sample of 
students who participated in the event. The results, by gender, are 
summarized in the following table. 


Gender 
Male Female Total 
Yes 24 
Developed 
rash? Ne oe 
Total 74 


Here is some computer output for the preceding table. The output 
includes the observed counts, the expected counts, and the chi- 
Square statistic. 


Expected counts are printed below observed counts 
MALE FEMALE Total 
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Yes 12 12 24 


16.22 7.78 
No 38 12 50 
33.78 16.22 
Total 50 24 74 
ChiSgq = 5.002 


AP4.18 The cell that contributes most to the chi-square statistic is 


a. 
b. 
‘a 
d. 


e. 


men who developed a rash. 

men who did not develop a rash. 
women who developed a rash. 
women who did not develop a rash. 


both (a) and (d). 


AP4.19 From the chi-square test performed in this study, we may 
conclude that 


a. 


there is convincing evidence of an association between the 
gender of an individual participating in the event and 
development of a rash. 


mud wrestling causes a rash, especially for women. 


there is absolutely no evidence of any relationship between the 
gender of an individual participating in the event and the 
subsequent development of a rash. 


development of a rash is a real possibility if you participate in 
mud wrestling, especially if you do so regularly. 


the gender of the individual participating in the event and the 
development of a rash are independent. 


AP4.20 Random assignment is part of a well-designed comparative 
experiment because 


a. 


b. 


Cc. 


it is more fair to the subjects. 


it helps create roughly equivalent groups before treatments are 
imposed on the subjects. 


it allows researchers to generalize the results of their 
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experiment to a larger population. 
d. it helps eliminate any possibility of bias in the experiment. 
e. it prevents the placebo effect from occurring. 


AP4.21 The following back-to-back stemplots compare the ages of 


players from two minor-league hockey teams (1|7=17 years). 
(1|7 = 17 years) . 


Team A Team B 
98777 1 788889 
44333221 2) 00123444 
7766555 2 956679 
521 3 023 
86 3 55 


Which of the following cannot be justified from the plots? 


a. Team A has the same number of players in their 30s as does 
Team B. 


b. The median age of both teams is the same. 

c. Both age distributions are skewed to the right. 

d. The range of age is greater for Team A 

e. There are no outliers by the 1.5/QR rule in either distribution. 
AP4.22 A distribution that represents the number of cars XX parked in 

a randomly selected residential driveway on any night is given by 

piPi 0.10 0.20 0.35 0.20 0.15 


Given that there is at least 1 car parked on a randomly selected 
residential driveway on a particular night, which of the following 
is closest to the probability that exactly 4 cars are parked on that 
driveway? 


a. 0.10 
b. 0.15 
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C. 
d. 


e. 


0.17 
0.75 
0.90 


AP4.23 Which sampling method was used in each of the following 
settings, in order from I to IV? 


I. 


Il. 


IIL 


I. A student chooses to survey the first 20 students to arrive at 
school. 


II. The name of each student in a school is written on a card, 
the cards are well mixed, and 10 names are drawn. 


III. A state agency randomly selects 50 people from each of 
the state’s senatorial districts. 


. IV. A city council randomly selects eight city blocks and then 


surveys all the voting-age residents on those blocks. 
Voluntary response, SRS, stratified, cluster 
Convenience, SRS, stratified, cluster 

Convenience, cluster, SRS, stratified 

Convenience, SRS, cluster, stratified 


Cluster, SRS, stratified, convenience 


AP4.24 Western lowland gorillas, whose main habitat is in central 
Africa, have a mean weight of 275 pounds with a standard 
deviation of 40 pounds. Capuchin monkeys, whose main habitat is 
Brazil and other parts of Latin America, have a mean weight of 6 
pounds with a standard deviation of 1.1 pounds. Both distributions 
of weight are approximately Normally distributed. If a particular 
western lowland gorilla is known to weigh 345 pounds, 
approximately how much would a capuchin monkey have to 
weigh, in pounds, to have the same standardized weight as the 
gorilla? 


a. 
b. 


Cc. 


4.08 
Tid 
7.93 
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d. 


e. 


8.20 


There is not enough information to determine the weight of a 
capuchin monkey. 


AP4.25 Suppose that the mean weight of a certain breed of pig is 280 
pounds with a standard deviation of 80 pounds. The distribution of 
weight for these pigs tends to be somewhat skewed to the right. A 
random sample of 100 pigs is taken. Which of the following 
statements about the sampling distribution of the sample mean 
weight x @ is true? 


a. 


It will be Normally distributed with a mean of 280 pounds and 
a standard deviation of 80 pounds. 


. It will be Normally distributed with a mean of 280 pounds and 


a standard deviation of 8 pounds. 


It will be approximately Normally distributed with a mean of 
280 pounds and a standard deviation of 80 pounds. 


It will be approximately Normally distributed with a mean of 
280 pounds and a standard deviation of 8 pounds. 


There is not enough information to determine the mean and 
standard deviation of the sampling distribution. 


AP4.26 Which of the following statements about the t¢ distribution with 
degrees of freedom df is (are) true? 


L. 
Il. 


IIL 


I. It is symmetric. 


II. It has more variability than the té distribution with df+1 
df +1 degrees of freedom. 


III. As df increases, the t¢ distribution approaches the standard 
Normal distribution. 


I only 

II only 

III only 

I and II 

I, Il, and HI 
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Questions AP4.27—AP4.29 refer to the following situation. Park 
rangers are interested in estimating the weight of the bears that 
inhabit their state. The rangers have data on weight (in pounds) 
and neck girth (distance around the neck in inches) for 10 
randomly selected bears. Here is some regression output for these 


data: 

450 

400 y 

350 ee 
2 300-4 wo 
= 2505 i : 
2 200 e ea 

150 - ie 

100 4 - oa 

50 + a T T T T T 

15.0 175 20.0 22.5 25.0 275 30.0 32.5 
Neck girth (in.) 
Predictor Coef SE T P 
Coef 

Constant —241.70 38.57 -6.270.000 
Neck 20.230 1.695 11.930.000 
girth 
S= R-Sq = 94.7% 
26.7565 


AP4.27 Which of the following is the correct value of the correlation 
and its corresponding interpretation? 


a. The correlation is 0.947, and 94.7% of the variability in a 
bear’s weight can be accounted for by the least-squares 
regression line using neck girth as the explanatory variable. 


b. The correlation is 0.947. The linear association between a 
bear’s neck girth and its weight is strong and positive. 


c. The correlation is 0.973, and 97.3% of the variability in a 
bear’s weight can be accounted for by the least-squares 
regression line using neck girth as the explanatory variable. 


d. The correlation is 0.973. The linear association between a 
bear’s neck girth and its weight is strong and positive. 


e. The correlation cannot be calculated without the data. 


AP4.28 Which of the following represents a 95% confidence interval 
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for the slope of the population least-squares regression line relating 
the weight of a bear and its neck girth? 


a. 20.230+1.69520.230 + 1.695 
b. 20.230+3.8320.230 + 3.83 
c. 20.230+3.9120.230 + 3.91 
d. 20.230+20.2220.230 + 20.22 
e. 26.7565+3.8326.7565 + 3.83 


AP4.29 A bear was recently captured whose neck girth was 35 inches 
and whose weight was 466.35 pounds. If this bear were added to 
the data set, what would be the effect on the value of s?s? 


a. It would decrease the value of ss because the added point is an 
outlier. 


b. It would decrease the value of ss because the added point lies 
on the least-squares regression line. 


c. It would increase the value of ss because the added point is an 
outlier. 


d. It would increase the value of ss because the added point lies 
on the least-squares regression line. 


e. It would have no effect on the value of ss because the added 
point lies on the least-squares regression line. 


AP4.30 An experimenter wishes to test if one of two types of fish food 
(a standard fish food and a new product) is better for producing 
fish of equal weight after a two-month feeding program. The 
experimenter has two identical fish tanks (1 and 2) and is 
considering how to assign 40 fish, each of which has a numbered 
tag, to the tanks. The best way to do this would be to 


a. put all the odd-numbered fish in Tank 1 and the even-numbered 
fish in Tank 2. Give the standard food to Tank 1 and the new 
product to Tank 2. 


b. obtain pairs of fish whose weights are roughly equal at the start 
of the experiment and randomly assign one of the pair to Tank 
1 and the other to Tank 2. Give the standard food to Tank 1 and 
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the new product to Tank 2. 


proceed as in option (b), but put the heavier of each pair into 
Tank 2. Give the standard food to Tank 1 and the new product 
to Tank 2. 


assign the fish completely at random to the two tanks using a 
coin flip: heads means Tank 1 and tails means Tank 2. Give the 
standard food to Tank 1 and the new product to Tank 2. 


divide the 40 fish into two groups, with the 20 heaviest fish in 
one group. Randomly choose which tank to assign the heaviest 
fish and assign the lightest fish to the other tank. Give the 
standard food to Tank 1 and the new product to Tank 2. 


AP4.31 A city wants to conduct a poll of taxpayers to determine the 
level of support for constructing a new city-owned baseball 
stadium. Which of the following is the main reason for using a 
large sample size in constructing a confidence interval to estimate 
the proportion of city taxpayers who would support such a project? 


a. 
b. 
C. 
d. 


e. 


To increase the confidence level 

To eliminate any confounding variables 
To reduce nonresponse bias 

To increase the precision of the estimate 


To reduce undercoverage 


AP4.32 A standard deck of playing cards contains 52 cards, of which 4 
are aces and 13 are hearts. You are offered a choice of the 
following two wagers: 


I. 


Il. 


Draw one card at random from the deck. You win $10 if the 
card drawn is an ace. Otherwise, you lose $1. 


Draw one card at random from the deck. If the card drawn is a 
heart, you win $2. Otherwise, you lose $1. 


Which of the two wagers should you prefer? 


a. Wager 1, because it has a greater expected value 


b. Wager 2, because it has a greater expected value 
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c. Wager 1, because it has a greater probability of winning 
d. Wager 2, because it has a greater probability of winning 
e. Both wagers are equally favorable. 


AP4.33 Here are boxplots of SAT Critical Reading and Math scores for 
a randomly selected group of female juniors at a highly 
competitive suburban school: 


| 
400 500 600 700 800 
Score 


Which of the following cannot be justified by the plots? 


a. The maximum Critical Reading score is greater than the 
maximum Math score. 


b. Critical Reading scores are skewed to the right, whereas Math 
scores are somewhat skewed to the left. 


c. The median Critical Reading score and the median Math score 
for females are about the same. 


d. There appear to be no outliers in the distributions of SAT 
Critical Reading score. 


e. The mean Critical Reading score and the mean Math score for 
females are about the same. 


AP4.34 A distribution of exam scores has mean 60 and standard 
deviation 18. If each score is doubled, and then 5 is subtracted 
from that result, what will the mean and standard deviation of the 
new scores be? 


a. mean=115mean = 115 and standard deviation=31 
standard deviation = 31 


b. mean=115mean = 115 and standard deviation=36 
standard deviation = 36 
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c. mean=120mean = 120 and standard deviation=6 
standard deviation = 6 


d. mean=120mean = 120 and standard deviation=31 
standard deviation = 31 


e. mean=120mean = 120 and standard deviation=36 
standard deviation = 36 


AP4.35 In a clinical trial, 30 patients with a certain blood disease are 
randomly assigned to two groups. One group is then randomly 
assigned the currently marketed medicine, and the other group 
receives the experimental medicine. Every week, patients report to 
the clinic where blood tests are conducted. The clinic technician is 
unaware of the kind of medicine each patient is taking, and the 
patient is also unaware of which medicine he or she has been 
given. This design can be described as 


a. a double-blind, completely randomized experiment, with the 
currently marketed medicine and the experimental medicine as 
the two treatments. 


b. asingle-blind, completely randomized experiment, with the 
currently marketed medicine and the experimental medicine as 
the two treatments. 


c. a double-blind, matched pairs design, with the currently 
marketed medicine and the experimental medicine forming a 
pair. 

d. a double-blind, block design that is not a matched pairs design, 
with the currently marketed medicine and the experimental 
medicine as the two blocks. 


e. a double-blind, randomized observational study. 


AP4.36 A local investment club that meets monthly has 200 members 
ranging in age from 27 to 81. A cumulative relative frequency 
graph of age is shown. Approximately how many members of the 
club are more than 60 years of age? 
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a f 


C. 
d. 


e. 


Cumulative relative frequency (%) 


Age of member (years) 


20 
44 
78 
90 
110 


AP4.37 A manufacturer of electronic components is testing the 
durability of a newly designed integrated circuit to determine 
whether its life span is longer than that of the earlier model, which 
has a mean life span of 58 months. The company takes a simple 
random sample of 120 integrated circuits and simulates typical use 
until they stop working. The null and alternative hypotheses used 
for the significance test are HO: p=5840:4 = 58 and Ha: ue&et;58. 


A, 


‘/ > 58. The PP-value for the resulting one-sample t? test is 


0.035. Which of the following best describes what the PP-value 
measures? 


a. 


The probability that the new integrated circuit has the same life 
span as the current model is 0.035. 


The probability that the test correctly rejects the null hypothesis 
in favor of the alternative hypothesis is 0.035. 


The probability that a single new integrated circuit will not last 
as long as one of the earlier circuits is 0.035. 


The probability of getting a sample mean as far or farther above 
08 if there really is no difference between the new and the old 
circuits is 0.035. 
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e. The probability of getting a sample mean for the new integrated 
circuit that is less than the mean for the earlier model is 0.035. 


Questions AP4.38 and AP4.39 refer to the following situation. Do 
children’s fear levels change over time and, if so, in what ways? 
Little research has been done on the prevalence and persistence of 
fears in children. Several years ago, two researchers surveyed a 
randomly selected group of 94 third- and fourth-grade children, 
asking them to rate their level of fearfulness about a variety of 
situations. Two years later, the children again completed the same 
survey. The researchers computed the overall fear rating for each 
child in both years and were interested in the relationship between 
these ratings. They then assumed that the true regression line was 

later rating=$0+(1(initial rating) 

Hiater rating = So + f; (initial rating) 


and that the assumptions for regression inference were satisfied. 
This model was fitted to the data using least-squares regression. 
The following results were obtained from statistical software. 


Predictor Coefficient St. Dev. 
Constant 0.877917 0.1184 
Initial rating 0.397911 0.0676 
S = 0.2374 R-Sq = 0.274 


Here is a scatterplot of the later ratings versus the initial ratings 
and a plot of the residuals versus the initial ratings: 
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AP4.38 Which of the following statements is supported by these plots? 


a. The abundance of outliers and influential observations in the 
plots means that the assumptions for regression are clearly 
violated. 


b. These plots contain dramatic evidence that the standard 
deviation of the response about the true regression line is not 
approximately the same for each x#-value. 


c. These plots call into question the validity of the assumption that 
the later ratings vary Normally about the least-squares line for 
each value of the initial ratings. 


d. A linear model isn’t appropriate here because the residual plot 
shows no association. 


e. There is no striking evidence that the assumptions for 
regression inference are violated. 


AP4.39 George’s initial fear rating was 0.2 higher than Jonny’s. What 
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does the model predict about their later fear ratings? 
a. George’s will be about 0.96 higher than Jonny’s. 
b. George’s will be about 0.40 higher than Jonny’s. 
c. George’s will be about 0.20 higher than Jonny’s 

d. George’s will be about 0.08 higher than Jonny’s. 
e. George’s will be about the same as Jonny’s. 


AP4.40 The table provides data on the political affiliation and opinion 
about the death penalty of 850 randomly selected voters from a 
congressional district. 


Opinion about death penalty 


Favor Oppose Total 
Republican 397 
Political Democrat 248 
party Other 205 
Total 850 


Which of the following does not support the conclusion that being 
a Republican and favoring the death penalty are not independent? 


299 , O8 
a. 299494498356 494 7 356 
299 _, 397 


b. 2994944397850 494 7 850 
494 _, 299 


c. 4948504299397 850 * 397 


494 _,» 397 
d. 4948504397850 850 7 850 
(397)(494) 


e. (397)(494)8504299 850 


+ 


# 299 


Section II: Free Response Show all your work. Indicate clearly the 
methods you use, because you will be graded on the correctness of your 
methods as well as on the accuracy and completeness of your results and 
explanations. 


AP4.41 The body’s natural electrical field helps wounds heal. If diabetes 
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changes this field, it might explain why people with diabetes heal 
more slowly. A study of this idea compared randomly selected 
normal mice and randomly selected mice bred to spontaneously 
develop diabetes. The investigators attached sensors to the right hip 
and front feet of the mice and measured the difference in electrical 
potential (in millivolts) between these locations. Graphs of the data 
for each group reveal no outliers or strong skewness. The following 
computer output provides numerical summaries of the data.2! 


Variable N Mean StDev 
Diabetic mice 2413.090 4.839 
Normal mice 1810.022 2.915 


Mininum Qi Median Q3 #£Maximum 
1.050 10.038 12.650 17.038 22.600 
4.950 8.238 9.250 12.375 16.100 


Is there convincing evidence at the a=0.05a@ = 0.05 level that the 
mean electrical potential differs for normal mice and mice with 
diabetes? 


AP4.42 Can physical activity in youth lead to mental sharpness in old 
age? A 2010 study investigating this question involved 9344 
randomly selected, mostly white women over age 65 from four 
U.S. states. These women were asked about their levels of physical 
activity during their teenage years, 30s, 50s, and later years. Those 
who reported being physically active as teens enjoyed the lowest 
level of cognitive decline—only 8.5% had cognitive impairment— 
compared with 16.7% of women who reported not being physically 
active at that time. 


a. State an appropriate pair of hypotheses that the researchers 
could use to test whether the proportion of women who suffered 
a cognitive decline was significantly smaller for women who 
were physically active in their youth than for women who were 
not physically active at that time. Be sure to define any 
parameters you use. 


b. Assuming the conditions for performing inference are met, what 
inference method would you use to test the hypotheses you 
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identified in part (a)? Do not carry out the test. 


c. Suppose the test in part (b) shows that the proportion of women 
who suffered a cognitive decline was significantly smaller for 
women who were physically active in their youth than for 
women who were not physically active at that time. Can we 
generalize the results of this study to all women aged 65 and 
older? Justify your answer. 


d. We cannot conclude that being physically active as a teen 
causes a lower level of cognitive decline for women over 65, 
due to possible confounding with other variables. Explain the 
concept of confounding and give an example of a potential 
confounding variable in this study. 


AP4.43 In a recent poll, randomly selected New York State residents at 
various fast-food restaurants were asked if they supported or 
opposed a “fat tax” on sugared soda. Thirty-one percent said that 
they were in favor of such a tax and 66% were opposed. But when 
asked if they would support such a tax if the money raised were 
used to fund health care given the high incidence of obesity in the 
United States, 48% said that they were in favor and 49% were 
opposed. 


a. In this situation, explain how bias may have been introduced 
based on the way the questions were worded and suggest a way 
that the questions could have been worded differently in order 
to avoid this bias. 


b. In this situation, explain how bias may have been introduced 
based on the way the sample was taken and suggest a way that 
the sample could have been obtained in order to avoid this bias. 


c. This poll was conducted only in New York State. Suppose the 
pollsters wanted to ensure that estimates for the proportion of 
people who would support a tax on sugared soda were available 
for each state as well as an overall estimate for the nation as a 
whole. Identify a sampling method that would achieve this goal 
and briefly describe how the sample would be taken. 


AP4.44 Each morning, coffee is brewed in the school workroom by one of 
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three faculty members, depending on who arrives first at work. Mr. 
Worcester arrives first 10% of the time, Dr. Currier arrives first 
50% of the time, and Mr. Legacy arrives first on the remaining 
mornings. The probability that the coffee is strong when brewed by 
Dr. Currier is 0.1, while the corresponding probabilities when it is 
brewed by Mr. Legacy and Mr. Worcester are 0.2 and 0.3, 
respectively. Mr. Worcester likes strong coffee! 


a. What is the probability that on a randomly selected morning the 
coffee will be strong? 


b. If the coffee is strong on a randomly selected morning, what is 
the probability that it was brewed by Dr. Currier? 


AP4.45 The following table gives data on the mean number of seeds 
produced in a year by several common tree species and the mean 
weight (in milligrams) of the seeds produced. Two species appear 
twice because their seeds were counted in two locations. We might 
expect that trees with heavy seeds produce fewer of them, but what 
mathematical model best describes the relationship?=2 


Tree species Seed count Seed weight (mg) 
Paper birch 27,239 0.6 
Yellow birch 12,158 1.6 
White spruce 7202 2.0 
Engelmann spruce 3671 3.3 
Red spruce 5051 3.4 
Tulip tree 13,509 9.1 
Ponderosa pine 2667 a) 
White fir 5196 40.0 
Sugar maple 1751 48.0 
Sugar pine 1159 216.0 
American beech 463 247.0 
American beech 1892 247.0 
Black oak 93 1851.0 
Scarlet oak 525 1930.0 
Red oak A11 2475.0 
Red oak 253 2475.0 
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Pignut hickory 40 3423.0 
White oak 184 3669.0 
Chestnut oak 107 4535.0 


a. Describe the association between seed count and seed weight 
shown in the scatterplot. 
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b. Two alternative models based on transforming the original data 
are proposed to predict the seed weight from the seed count. 
Here are graphs and computer output from a least-squares 
regression analysis of the transformed data. 


Model A: 
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Seed count 
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3 
FE e 
—2 0 2 4 6 
Fitted value 
Predictor Coef SE Coef T P 
Constant 6.1394 0.5726 10.72 0.000 
Seed —0.00033869 0.00007187 —-4.71 0.000 
count 
S= R-Sq = R-Sq(adj) = 
2.08100 56.6% 54.1% 
Model B: 
E 
£ 


In (count) 
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Residual 


. 
-2 T T T 


Fitted value 


Predictor Coef SE T P 
Coef 

Constant 15.491 1.081 14.33 0.000 

In(count) =] 5222 0.1470 -10.35 0.000 

S= R-  R-Sq(adj) = 85.5% 

1.16932 Sq=86.3% 


Which model, A or B, is more appropriate for predicting seed 
weight from seed count? Justify your answer. 


c. Using the model you chose in part (b), predict the seed weight if 
the seed count is 3700. 


AP4.46 A company manufactures plastic lids for disposable coffee cups. 
When the manufacturing process is working correctly, the 
diameters of the lids are approximately Normally distributed with a 
mean diameter of 4 inches and a standard deviation of 0.02 inch. 
To make sure the machine is not producing lids that are too big or 
too small, each hour a random sample of 25 lids is selected and the 
sample mean x @ is calculated. 


a. Describe the shape, center, and variability of the sampling 
distribution of the sample mean diameter, assuming the machine 
is working properly. 


The company decides that it will shut down the machine if the 
sample mean diameter is less than 3.99 inches or greater than 
4.01 inches, because this indicates that some lids will be too 
small or too large for the cups. If the sample mean is less than 
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3.99 or greater than 4.01, all the lids manufactured that hour are 
thrown away because the company does not want to sell bad 
products. 


b. Assuming that the machine is working properly, what is the 
probability that a random sample of 25 lids will have a mean 
diameter less than 3.99 inches or greater than 4.01 inches? 


Also, to identify any trends, each hour the company records the 
value of the sample mean on a chart, like the one given here. 


4.01 


4.00 ts 


3.99 t ' —- 


Sample mean 
diameter (in.) 


Hour 


One benefit of using this type of chart is that out-of-control 
production trends can be noticed before it is too late and lids 
have to be thrown away. For example, if the sample mean is 
consistently greater than 4 (but less than 4.01), this would 
suggest that something might be wrong with the machine. If 
such a trend is noticed before the sample mean gets larger than 
4.01, then the machine can be fixed without having to throw 
away any lids. 


c. Assuming that the manufacturing process is working correctly, 
what is the probability that the sample mean diameter will be 
above the desired mean of 4.00 but below the upper boundary 
of 4.01? 


d. Assuming that the manufacturing process is working correctly, 
what is the probability that in 5 consecutive samples, 4 or 5 of 
the sample means will be above the desired mean of 4.00 but 
below the upper boundary of 4.01? 


e. Which of the following results gives more convincing evidence 
that the machine needs to be shut down? Explain your answer. 


|. Getting a single sample mean below 3.99 or above 4.01 
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or 


’, Taking 5 consecutive samples and having at least 4 of the sample 
means be between 4.00 and 4.01 
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“Estimating the mean annual seed production of trees,” Ecology, 75 
(1994), pp. 642-647. 


1882 


Chapter 4 Notes 


i This is an important topic, but it is not required for the AP® Statistics 
exam. 


ii This is an important topic, but it is not required for the AP® Statistics 
exam. 


iii This is an important topic, but it is not required for the AP® Statistics 
exam. 


Chapter 6 Notes 


i This topic is not required for the AP® Statistics exam. Some teachers 
prefer to discuss this topic when presenting the sampling distribution of p/ 
P (Chapter 7). 
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Chapter 1 


Introduction 


Answers to Check Your Understanding 

page 5: 1. The individuals are the cars in the student parking lot. 

2. He recorded the car’s license plate to identify the individuals. The variables 
recorded for each individual are the model (categorical), year (quantitative), color 
(categorical), highway gas mileage (quantitative), weight (quantitative), and 
whether or not it has a navigation system (categorical). 


Answers to Odd-Numbered Introduction Exercises 


I.1 (a) The AP® Statistics students who completed a questionnaire on the first day 
of class. (b) gender (categorical), grade level (categorical), GPA (quantitative), 
children in family (quantitative), homework last night (min) (categorical), and type 
of phone (categorical). 

1.3 The individuals are movies. The variables are year (quantitative), rating 
(categorical), time (min) (quantitative), genre (categorical), and box office ($) 
(quantitative). Note: Year might be considered categorical if we want to know how 
many of these movies were made each year, rather than the average year. 


1.5 The categorical variables are type of wood, type of water repellent, and paint 
color. The quantitative variables are paint thickness and weathering time. 

1.7 Student answers will vary. Examples of categorical variables could include 
region of the country and type of institution (2-year college, 4-year college, 
university). Examples of quantitative variables could include retention rate, 
graduation rate, class sizes, and faculty salaries. 

L9b 


Section 1.1 
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Answers to Check Your Understanding 


page 12: 


Percent 
s 
4 


Famous Rich Healthy Happy 


Preference 


A slight majority (52.5%) of students in the sample said they would prefer to be 
Happy. Rich and Famous were preferred about equally (20% and 17.5%). The least 
popular choice was Healthy (10%). 

page 16: 1. 113/338 = 0.334. This value makes sense because there were three 
treatments, so we would expect about one-third of the subjects to be assigned to 
this treatment. 

2. The distribution of change in depression is: Full response: 91/338 = 0.269, 
Partial response: 55/338 = 0.163, and No response: 192/338 = 0.568. 

3. 27/338 = 0.08 5 8% 

page 22: 1. 27/91 = 0.297 

2. 70/113 = 0.619 = 61.9% 


3. The distribution of change in depression for the subjects receiving each of the 
three treatments is: 


St. John’s wort Full response: 27/113 = 0.239; Partial response: 16/113 = 0.142; 
No response: 70/113 = 0.619 


Zoloft Full response: 27/109 = 0.248; Partial response: 26/109 = 0.239; No 
response: 56/109 = 0.514 


Placebo Full response: 37/116 = 0.319; Partial response: 13/116 = 0.112; No 
response: 66/116 = 0.569 


BB No response {J Partial response 
["] Full response 


100 
80 
60 
40 
20 
0 1 


St.John'’s Zoloft Placebo 
wort treatment 


Heights (cm) 


Percent 


4. There does not appear to be a strong association between treatment and change 
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in depression for these subjects because the distribution of response status is very 
similar for the three different treatments. Also, note that the treatment with the 
highest rate of “full response” was the placebo. 


Answers to Odd-Numbered Section 1.1 Exercises 


1.11 (a) The individuals are the babies. (b) It appears that births occurred with 
similar frequencies on weekdays (Monday through Friday), but with noticeably 
smaller frequencies on the weekend days (Saturday and Sunday). 


14,000 4 
12,000 + = 
10,000 > 
8000 5 
6000 4 
4000 + 
2000 
= i T L, T 


Births 


Day 


1.13 First, a relative frequency table must be constructed. 


Camera brand Relative frequency 
Canon 23/45 = 51.1% 
Sony 6/45 = 13.3% 
Nikon 11/45 = 24.4% 
Fujifilm 3/45 = 6.7% 
Olympus 2/45 = 4.4% 


The relative frequency bar graph is given here. 
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Camera brand 


The most popular brand of camera among the 45 most recent purchases on the 
Internet auction site is Canon, followed by Nikon, Sony, Fujifilm, and Olympus. 
Canon is the overwhelming favorite with over 50% of the customers purchasing 
this brand. Also noteworthy is that almost 25% of the customers purchased a Nikon 
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camera. 


1.15 (a) 2% (b) The most popular color of vehicles sold that year was white, 
followed by black, silver, and gray. It appears that a majority of car buyers that 
year preferred vehicles that were shades of black and white. 


35 
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Vehicle color 


(c) It would be appropriate to make a pie chart of these data (including the other 
category) because the numbers in the table refer to parts of a single whole. 


1.17 Estimates will vary, but should be close to 63% Mexican and 9% Puerto 
Rican. 


1.19 The areas of the pictures should be proportional to the numbers of students 
they represent. As drawn, it appears that most of the students arrived by car but in 
reality, most came by bus (14 took the bus, 9 came in cars). 

1.21 By starting the vertical scale at 12 instead of 0, it looks like the percent of 
binge-watchers who think that 5 to 6 episodes is too many to watch in one viewing 
session is almost 20 times higher than the percent of binge-watchers who think that 
3 to 4 episodes is too many to watch in one viewing session. In truth, the percent of 
binge-watchers who think that 5 to 6 episodes is too many to watch in one viewing 
session (31%) is less than 3 times higher than the percent of binge-watchers who 
think that 3 to 4 episodes is too many to watch in one viewing session (13%). 
Similar arguments can be made for the relative sizes of the other categories 
represented in the bar graph. 


1.23 (a) 50/150 = 0.333 (b) 10.7% said they saw broken glass at the accident; 
89.3% said they did not. (c) 10.67% 


1.25 (a) 71.25% (b) 0.633 (c) 12.1% 
1.27 (a) 0.241 (b) 68% 
1.29 (a) The distributions of responses for the three treatment groups are: 


“Smashed into” “Hit” Control 
Yes: 16/50 = 32% Yes: 7/50 = 14% Yes: 6/50 = 12% 
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No: 34/50 = 68% No: 43/50 = 86% No: 44/50 = 88% 


The segmented bar graph is shown here. 


Mi No & Yes 
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(b) The segmented bar graph reveals that there is an association between opinion 
about broken glass at the accident and treatment received for subjects in the study. 
Knowing which treatment a subject received helps us predict whether or not that 
person will respond that he or she saw broken glass at the accident. 

1.31 (a) 0.655 (b) 73.7% (c) Based on the segmented bar graph, there is an 
association between perceived body image and gender. Males are about 4 times as 
likely as females to perceive that they are underweight. Males are also less likely to 
perceive that they are overweight. However, the overwhelming majority of both 
genders perceive that their body image is about right. 


1.33 Answers may vary. Regardless of whether a student went to a private or 
public college, most students chose a school that was at least 11 miles from home. 
Those who went to a public university were most likely to choose a school that was 
11 to 50 miles from home (about 30%), while those who went to a private 
university were most likely to choose a school that was 101 to 500 miles from 
home (about 29%). 

1.35 (a) The graph reveals that as age increases, the percent that use smartphones 
for navigation decreases. (b) It would not be appropriate to make a pie chart of 
these data because the category percentages are not parts of the same whole. 


1.37 Answers will vary. Two possible tables are given here. 


10 40 30 20 
30 0 30 20 


1.39 (a) 64 of the 200 patients transported by helicopter, or 32% of patients died. 
260 of the 1100 patients transported by ambulance, or 23.6% of patients died. (b) 
Of the patients who were in serious accidents, 48% who were transported by 
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helicopter died and 60% who were transported by ambulance died. Of the patients 
who were in less serious accidents, 16% who were transported by helicopter died 
and 20% who were transported by ambulance died. Whether the patients were in a 
serious accident or less serious accident, the percentage who died was greater for 
those who were transported by ambulance. (c) Overall, a greater percentage of 
patients who were transported by helicopter died, but when broken down by 
seriousness of the accident, in both instances, a greater percentage of patients who 
were transported by ambulance died. This is because people in more serious 
accidents were also more likely to be transported by helicopter and were more 
likely to die. People who were involved in less serious accidents were less likely to 
be transported by helicopter and were less likely to die. Overall, this makes it 
appear that those who are transported by helicopters are more likely to die; but 
when the patients are broken out by seriousness of the accident, we can see that the 
driving factor for the overall death rates is the seriousness of the accident, not the 
method of transportation. 


1.41 d 
1.43 ¢ 


Section 1.2 
Answers to Check Your Understanding 


page 34: 1. 


Cost ($) 


2. The distribution of cost is skewed to the right, with a single peak at $1.50. There 
are two small gaps at $2.25 and $2.75. 

page 37: 1. Shape: The distribution of time to eat ice cream (in seconds) for the 
female sample is skewed to the right with no clear peak. The distribution of time to 
eat ice cream (in seconds) for the male sample is skewed to the right, with a single 
peak at about 17.5 seconds. Outliers: The female distribution appears to have one 
outlier: the female who took approximately 105 seconds to eat the ice cream. The 
male distribution does not appear to contain any outliers. Center: The time it took 
female students to eat ice cream was generally longer (median * 45 seconds) than 
for male students (median * 20 seconds). Variability: The ice cream eating times 
for female students varied more (from about 13 seconds to about 107 seconds) than 
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for the male students (from about 5 seconds to about 50 seconds). 


page 40: 1. Shape: The distribution of resting pulse rates is skewed to the right, 
with a single peak on the 70s stem. The distribution of after-exercise pulse rates is 
also skewed to the right, with a single peak on the 90s stem. Outliers: The 
distribution of resting pulse rates appears to have one outlier: the student whose 
resting pulse rate was120 bpm. The distribution of after-exercise pulse rates 
appears to have one outlier as well: the student whose after-exercise pulse rate was 
146 bpm. Center: The students’ resting pulse rates tended to be lower (median = 
76 bpm) than their “after exercise” pulse rates (median = 98 bpm). Variability: The 
“after exercise” pulse rates vary more (from 86 bpm to 146 bpm) than the resting 
pulse rates (which vary from 68 bpm to 120 bpm). 

2.b 

3.e 

4.c 


page 44: 1. One possible histogram is displayed here. 


Frequency 


80 90 100 110 120 130 140 150 
1Q 
2. The distribution of IQ scores is roughly symmetric and bell shaped with a single 
peak. There are no obvious outliers. The typical IQ score appears to be between 
110 and 120 (median = 114). The IQ scores vary from about 80 to 150. 


page 46: 1. The distribution of word length for both the journal and the magazine 
have shapes that are skewed to the right and single-peaked. Neither distribution 
appears to have any outliers. The centers for both distributions are about the same 
at approximately 5-6 letters per word. Both distributions have similar variability as 
the length of words in the journal varies from 1 letter to 14 letters and the length of 
words in the magazine varies from 2 letters to 14 letters. 

2. This is a bar graph. It displays categorical data about first-year students’ planned 
field of study. 


3. No, because the variable is categorical and the categories could be listed in any 
order on the horizontal axis. 
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Answers to Odd-Numbered Section 1.2 Exercises 
1.45 (a) The graph is shown here. 


nttea (ul des 


3 4 = 6 7 8 9 10 11 
Amount of sleep (h) 


(b) 0.179 


1.47 (a) The dot above 3 indicates that there was one game in which the U.S 
women’s soccer team scored 3 more goals than their opponent. (b) All 20 of the 
values are zero or more, which indicates that the U.S. women’s soccer team had a 
very good season. They won 17/20 = 85% of their games, tied the other team in 
3/20 = 15% of their games, and never lost. 

1.49 The shape of the distribution is left-skewed with a peak between 90 and 100 
years. There is a small gap around 70 years. 

1.51 The shape of the distribution is roughly symmetric with a single peak at 7. 
1.53 The distribution of difference (U.S. — Opponent) in goals scored is skewed to 
the right, with two potential outliers: when the U.S. team outscored their opponents 
by 9 and 10 goals. The median difference was 2 goals and the differences varied 
from 0 to 10 goals. 

1.55 The distribution of total family income for Indiana is roughly symmetric, 
while the distribution of total family income for New Jersey is slightly skewed 
right. The value of $125,000 may be an outlier in the Indiana distribution. There 
are no obvious outliers in the New Jersey distribution. The median for both 
distributions is about the same, approximately $49,000. The distribution of total 
family income in Indiana is less variable than the New Jersey distribution. The 
incomes in Indiana vary from $0 to about $125,000. The incomes in New Jersey 
vary from $0 to about $170,000. 

1.57 (a) Both distributions have about the same amount of variability. The 
“external reward” distribution varies from 5 to about 24. And the “internal reward” 
distribution varies from about 12 to 30. (b) The center of the internal distribution is 
greater than the center of the external distribution, indicating that external rewards 
do not promote creativity. 


1.59 (a) The stemplot is shown here. 
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Key: 15 | 9 = 15.9 grams 


(b) The graph reveals that there was one Fun Size Snickers” bar that is “gigantic”! 
It weighs 19.2 grams. (c) 0.412 


1.61 (a) The area of the largest South Carolina county is 1,220 square miles 
(rounded to the nearest 10 mi*). (b) The distribution of the area for the 46 South 
Carolina counties is right-skewed with distinct peaks on the 500 mi? and 700 mi? 
stems. There are no clear outliers. A typical South Carolina county has an area of 
about 655 square miles. The area of the counties varies from about 390 square 
miles to 1,220 square miles. 


1.63 (a) If we had not split the stems, most of the data would appear on just a few 
stems, making it hard to identify the shape of the distribution. (b) Key: 16 | 0 
means that 16.0% of that state’s residents are aged 25 to 34. (c) The distribution of 
percent of residents aged 25—34 is roughly symmetric with a possible outlier at 
16.0%. 


1.65 The distribution of acorn volume for the Atlantic coast is skewed to the right. 
The distribution of acorn volume for California is roughly symmetric with one high 
outlier of 17.1 cubic centimeters. The distribution of volume of acorn for the 
Atlantic coast has 3 potential outliers: 8.1, 9.1, and 10.5 cubic centimeters. The 
typical acorn volume for Atlantic coast oak tree species (median = 1.7 cubic 
centimeters) is less than the typical acorn volume for California oak tree species 
(median = 4.1 cubic centimeters). The Atlantic coast distribution (with acorn 
volumes from 0.3 to 10.5 cubic centimeters) varies less than the California 
distribution (with acorn volumes from 0.4 to 17.1 cubic centimeters.) 


1.67 (a) The histogram is shown here. 
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(b) The distribution of amount of CO, emissions per person in these 48 countries is 


right-skewed. Visually, none of the countries appear to be outliers. 


1.69 The data vary from 14 to 54. We chose intervals of width 6, beginning at 12. 
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DRP scores 


The distribution of DRP scores is roughly symmetric. There do not appear to be 
any outliers. The center of the DRP score distribution is between 30 and 36 (with 
median = 35). The DRP scores vary from 14 to 54. 

1.71 (a) The shape of the distribution is slightly skewed to the left with a single 
peak. (b) The center is between 0% and 2.5% return on common stocks. (c) The 
exact value for the minimum return cannot be identified because we only have a 
histogram of the returns, not the actual data. The lowest return was in the interval 
—22.5% to —25%. (d) About 37% of these months (102 out of 273) had negative 
returns. 


1.73 It is difficult to effectively compare the salaries of the two teams with these 
two histograms because the scale on the horizontal axis is very different from one 
graph to the other. It also does not help that the scales on the y-axis differ as well. 
1.75 (a) No, it would not be appropriate to use frequency histograms instead of 
relative frequency histograms in this setting because there were many more 
graduates surveyed (314) than non-graduates (57). (b) The distribution of total 
personal income for each group is skewed to the right and single peaked. There are 
some possible high outliers in the graduate distribution. There do not appear to be 
any outliers in the non-graduate distribution. The center of the personal income 
distribution is larger for graduates than non-graduates, indicating that graduates 
typically have higher incomes than non-graduates in this sample. The incomes for 
graduates vary a lot more (from $0 to $150,000) than non-graduates (from $0 to 
$60,000). 

1.77 A bar graph should be used because birth month is a categorical variable. A 
possible bar graph is given here. 
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1.79 Score earned on the AP® Statistics exam is quantitative, so the histograms 
shown here use relative frequencies because there are many more students who 
take the AP® Calculus AB exam. 
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The shapes of the two distributions are very different. The distribution of scores on 
the AP® Calculus AB exam has a peak at 1 and another slightly lower peak at 5. 
The distribution of scores on the AP® Statistics exam, however, is more uniform 
with scores of 1, 3, and 4 being the most frequent and scores of 5 being the least 
frequent. Neither distribution has any outliers. The center of both distributions is 3. 
Although scores on both exams vary from 1 to 5, there are more scores close to the 


center on the AP® Statistics exam and more scores at the extremes on the AP® 
Calculus exam. 


1.8la 
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Section 1.3 
Answers to Check Your Understanding 
page 59: 1. The mean weight of the pumpkins is 


xX =3.6+4.0+9.6+...+5.4+31+3323=9.935 pounds. 


3.6+4.0+-9.6+...+5.4+31+3 
zr — eee 9.935 pounds. 


2. First we must put the weights in order: 2.0, 2.8, 3.4, 3.6, 4.0, 4.0, 5.4, 5.4, 6.0, 
6.0, 6.1, 6.6, 9.6, 9.6, 11.0, 11.9, 12.4, 12.7, 13.0, 14.0, 15.0, 31.0, and 33.0. 
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Because there are 23 weights, the median is the 12th weight in this ordered list. 
The median weight of the pumpkins is 6.6 pounds. 


3. I would use the median to summarize the typical weight of a pumpkin in this 
contest because the distribution of pumpkin weights is skewed to the right with two 
possible upper outliers: 31.0 and 33.0 pounds. 

page 65: 1. Range = Max — Min = 23.3 — 21.5 = 1.8 mpg 

2. The standard deviation of 0.363 mpg tells us that the highway fuel economy of 
these 25 model year 2018 Toyota 4 Runners typically varies by about 0.363 mpg 
from the mean of 22.404 mpg. 

3. Q1=22.2+22.22=22.2; Q3=22.6+22.62=22.6; IQR=Q3-Q1=22.6-22.2=0.4 
mpg 


22.2+22.2 22.6+2: 


Q, = SH = 22.2; Qs = SRE = 22.6; IQR = Q3 — Qi = 22.6 — 22.2 = 0.4mpg 


4. I would use the interquartile range (IQR) to describe variability because there 
appears to be one upper outlier and one lower outlier. The [QR is resistant to 
outliers, but the range and standard deviation are not. 

page 70: 1. M=166+1672=166.5 grams; Q1=163 grams; Q3=170 grams. 


M= wa = 166.5 grams; @, = 163 grams; @3 = 170 grams. The IQR = 


170 - 163 = 7 grams. An outlier is any value below Q, — 1.5(/QR) = 163 - 1.5(7) 
= 152.5 grams or above Q3 + 1.5(/QR) = 170 1 1.5(7) = 180.5 grams. This means 
that the value 152 grams is an outlier. The boxplot is displayed here. 


150 155 160 165 170)=«=175 
Weight (g) 
2. No, the graph does not support their suspicion. The first quartile of the 
distribution of weight is 163 grams, which means that at least 75% of the large 
fries that they purchased weighed more than the advertised weight of 160 grams. 


Answers to Odd-Numbered Section 1.3 Exercises 

1.87 (a) The mean of Joey’s first 14 quiz scores is 85. (b) Including a 15th quiz 
score of 0, Joey’s mean would be 79.3. This illustrates the property of 
nonresistance. The mean is not resistant. It is sensitive to extreme values. 
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1.89 (a) 85 (b) Since there are now 15 quiz scores, the median is 84. Notice that 
the median did not change much. This shows that the median is resistant to 
outliers. 


1.91 (a) 8 electoral votes (b) Since the distribution of number of electoral votes is 
skewed to the right, the mean of this distribution is greater than its median. 

1.93 The mean house price is $276,200 and the median is $234,200. The 
distribution of house prices is likely to be quite skewed to the right because of a 
few very expensive homes, some of which may be outliers. When a distribution is 
skewed to the right, the mean is bigger than the median. 


1.95 (a) The median is 2 servings of fruit per day. 
— _ 194 


(b) x =19474=2.62' 74 ~~ “"* servings of fruit per day 

1.97 (a) Range = Max — Min = 98 — 74 = 24. The range of Joey’s quiz grades after 
his unexcused absence is 98 — 0 = 98. (b) The range may not be the best way to 
describe variability for a distribution of quantitative data because the range can be 
heavily affected by outliers. 


1.99 The standard deviation is 2.52 cm. The foot lengths of these 14-year-olds 
from the United Kingdom typically vary by about 2.52 cm from the mean of 24 
cm. 

1.101 (a) The size of these 18 files typically varies by about 1.9 megabytes from 
the mean of 3.2 megabytes. (b) If the music file that takes up 7.5 megabytes of 
storage space is replaced with another version of the file that only takes up 4 
megabytes, the mean would decrease slightly. The standard deviation would 
decrease as well because a file of size 4 megabytes will be closer to the new mean 
than the file of 7.5 megabytes was to the former mean. 

1.103 Variable B has a smaller standard deviation because more of the 
observations have values closer to the mean than in Variable A’s distribution. That 
is, the typical distance from the mean is smaller for Variable B than for Variable A. 
1.105 Q; = 1.9 megabytes; Q3 = 4.7; IQR = Q3 - Q) = 4.7 — 1.9 = 2.8 megabytes 
1.107 An outlier is any value below Q, — 1.5(7/QR) = 1.9 — 1.5(2.8) = 22.3 
megabytes or above Q3 + 1.5UQR) = 4.7 + 1.5(2.8) = 8.9 megabytes. There are no 
files in the data set that have fewer than —2.3 megabytes or more than 8.9 
megabytes, so there are no outliers in the distribution. 

1.109 (a) The distribution is skewed to the right because the mean is much larger 
than the median. Also, Q3 is much further from the median than Q). (b) The 
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amount of money spent typically varies by about $21.70 from the mean of $34.70. 
(c) The first quartile is 19.27 and the third quartile is 45.40, so the JQR is 45.40 — 
19.27 = 26.13. Any points below 19.27 — 1.5(26.13) = —-19.925 or above 45.40 + 
1.5(26.13) = 84.595 are outliers. Because the maximum of 93.34 is greater than 
84.595, there is at least one outlier. 

1.111 (a) The median is 9, the first quartile is 3, and the third quartile is 43. The 
IQR is 43 — 3 = 40. An outlier would be any value below 3 — 1.5(40) = —57 or 
above 43 + 1.5(40) = 103. This means the value of 118 is an outlier. The boxplot is 
shown here. 
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(b) The article claims that teens send 1742 texts a month, which works out to be 
about 58 texts a day (assuming a 30-day month). Nearly all of the members of the 
class (21 of 25) sent fewer than 58 texts per day, which seems to contradict the 
claim in the article. 

1.113 (a) the median and IQR would be a better choice for summarizing the center 
and variability of the distribution of electoral votes than the mean and standard 
deviation because the boxplot reveals that there are three outliers in the data set. 
The mean and standard deviation are not resistant measures of center and 
variability, so their values are sensitive to these extreme values. (b) The stemplot 
reveals that the distribution has a single peak, which cannot be discerned from the 
boxplot. Also, the stemplot reveals that there are actually four upper outliers rather 
than three. The value of 29, which is an outlier, gives the number of electoral votes 
for two states. In the boxplot, this appears as one asterisk. However, there are two 
states that have that many electoral votes, not one. 


1.115 Shape: The distribution of energy cost (in dollars) for top freezers looks 
roughly symmetric. The distribution of energy cost (in dollars) for side freezers 
looks roughly symmetric. The distribution of energy cost (in dollars) for bottom 
freezers looks skewed to the right. Outliers: There are no outliers for the top or 
side freezers. There are at least two bottom freezers with unusually high energy 
costs (over $140 per year). Center: The typical energy cost for the side freezers 
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(median * $75) is greater than the typical cost for the bottom freezers (median ~ 
$69), which is greater than the typical cost for the top freezers (median * $56). 
Variability: There is much more variability in the energy costs for bottom freezers 
(IQR © $20) than for side freezers (IQR ~ $12) or for top freezers (IQR ~ $8). 


1.117 (a) 
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(b) Numerical summaries are given here. 


nit x & S, Minimum Q, Median Q3;  Maximun 
Male 15 62.4 71.4 0 6 28 83 214 
Female 16 128.3 116.0 0 34 107 191 379 


(c) The data from this survey project give very strong evidence that male and 
female texting habits differ considerably at the school. A typical female sends and 
receives about 79 more text messages in a 2-day period than a typical male. The 
males as a group are also much more consistent in their texting frequency than the 
females. 

1.119 (a) All five income distributions are skewed to the right. This tells us that 
within each level of education, there is much more variability in incomes for the 
upper 50% of the distribution than for the lower 50% of the distribution. (b) Even 
though the boxplots are not modified (visually showing outliers), it is clear that the 
upper whisker for the advanced degree boxplot is much longer than 1.5 times its 
IQR, indicating that there is at least one upper outlier in the group that earned an 
advanced degree. (c) As education level rises, the median, quartiles, and extremes 
rise as well—that is, every value in the 5-number summary gets larger. This makes 
sense because one would expect that individuals with more education tend to attain 
jobs that yield more income. (d) The variability in income increases as the 
education level increases. Both the width of the box (the QR) and the distance 
from one extreme to the other increase as education levels increase. 


1.121 (a) One possible answer is 1, 1, 1, and 1. (b) 0, 0, 10, 10 (c) For part (a), any 
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set of four identical numbers will have s, = 0. For part (b), however, there is only 
one possible answer. We want the values to be as far from the mean as possible, so 
the squared deviations from the mean can be as big as possible. Our best choice is 
two values at each extreme, which makes all four squared deviations equal to 52. 
1.123d 

1125 ¢ 


1.127 A histogram is given here. 


Frequency 


o Lal nel 
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Height (cm) 

This distribution is roughly symmetric with a single peak at 170 cm. There do not 
appear to be any outliers. The center of the distribution of heights can be described 
by the mean of 169.88 cm or the median of 169.5. The heights vary from 145.5 cm 
to 191 cm, so the range is 45.5 cm. The standard deviation of heights is 9.687 cm 
and the IQR is 177 — 163 = 14cm. 


Answers to Chapter 1 Review Exercises 


R1.1 (a) The individuals are buyers. (b) The variables are zip code (categorical), 
gender (categorical), buyer’s distance from the dealer (in miles) (quantitative), car 
model (categorical), model year (categorical), and price (quantitative). 


R1.2 First, a relative frequency table must be constructed. 


Candy selected Relative frequency 
Snickers® 8/30 = 26.7% 
Milky Way® 3/30 = 10% 
Butterfinger® 7/30 = 23.3% 
Twix® 10/30 = 33.3% 

3 Musketeers® 2/30 = 6.7% 


The relative frequency bar graph is given here. 
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Students preferred Twix the most (one-third of the students chose this candy), 
followed by Snickers (27% relative frequency), Butterfinger (23% relative 
frequency), Milky Way (10% relative frequency), and lastly 3 Musketeers (7% 
relative frequency). 

R1.3 (a) The graph is misleading because the “bars” are different widths. For 
example, the bar for “Send/receive text messages” should be roughly twice the size 
of the bar for “Camera,” but it is actually much more than twice as large in area. 
(b) It would not be appropriate to make a pie chart for these data because they do 
not describe parts of the same whole. Students were free to answer in more than 
one category. 

R1.4 (a) 148/219 = 0.676 = 67.6% were Facebook users. (b) 67/219 = 0.306 = 
30.6% were aged 28 and over. (c) 21/219 = 0.096 = 9.6% were older Facebook 
users. (d) 78/148 = 0.527 = 52.7% of the Facebook users were younger students. 
R1.5 (a) 


[i Not a Facebook user 
(1) Is a Facebook user 
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(b) From both the table and the graph, we can see that there is an association 
between age and Facebook use. As age increases, the percent of Facebook users 
decreases. For younger students, about 95% use Facebook. That drops to 70% for 
23- to 27-year-old students and drops even further to 31.3% for older students. 
R1.6 (a) A stemplot is shown here. 
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Key: 48 | 8 = 4.88 


(b) The distribution is roughly symmetric with one possible outlier at 4.88. The 
center of the distribution is between 5.4 and 5.5. The densities vary from 4.88 to 
5.85. (c) The mean of the distribution of Cavendish’s 29 measurements is 5.45. So 
these estimates suggest that the Earth’s density is about 5.45 times the density of 
water. The currently accepted value for the density of the earth (5.51 times the 
density of water) is slightly larger than the mean of the distribution of density 


measurements. 
R1.7 (a) 
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The survival times are right-skewed, as expected. 


(b) 
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(c) While both graphs clearly show that the distribution is strongly skewed to the 
right, the boxplot shows that there are several high outliers. 
R1.8 (a) About 11% of low-income and 40% of high-income households consisted 
of four or more people. (b) The shapes of both distributions are skewed to the 
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right. However, the skewness is much stronger in the distribution for low income 
households. On average, household size is larger for high income households. In 
fact, the majority of low-income households consist of only one person. Only about 
7% of high-income households consist of one person. One-person households 
might have less income because they would include many young single people 
who have no job or retired single people with a fixed income. 

R1.9 (a) The amount of mercury per can of tuna will typically vary by about 0.3 
ppm from the mean of 0.285 ppm. (b) The JQR = 0.380 — 0.071 = 0.309, so any 
point below 0.071 — 1.5(0.309) = —0.393 or above 0.38 + 1.5(0.309) = 0.8435 
would be considered an outlier. Because the smallest value is 0.012, there are no 
low outliers. According to the histogram, there are values above 0.8435, so there 
are several high outliers. (c) The mean is much larger than the median of the 
distribution because the distribution is strongly skewed to the right and there are 
several high outliers. 

R1.10 The distribution for light tuna is skewed to the right with several high 
outliers. The distribution for albacore tuna is more symmetric with just a couple of 
high outliers. The albacore tuna generally has more mercury. Its minimum, first 
quartile, median, and third quartile are all greater than the respective values for 
light tuna. But that doesn’t mean that light tuna is always better. It has much larger 
variation in mercury concentration, with some cans having as much as twice the 
amount of mercury as the largest amount in the albacore tuna. 


Answers to Chapter 1 AP® Practice Test 
T1l.1d 
T1.2d 
T1.3 b 
T1.4b 
TLS 
T1.6e 
Thc 
T1.8e 
T1.9 b 
T1.10d 
T1.11 (a) 


1902 


Lot 


i 


i 


Frequency 
Sr NWHUD~ISBOCOP” 


i 


20 40 60 80 100 120 140 160 


— 


Number of contacts 

(b) The first quartile is 30 contacts. The third quartile is 77. IQR = 77 — 30 = 47. 
Any value below 30 - 1.5(47) = —40.5 or above 77 + 1.5(47) = 147.5 is an outlier. 
So the observation of 151 contacts is an outlier. (c) It would be better to use the 
median and JQR to describe the center and variability because the distribution of 
number of contacts is skewed to the right and has a high outlier. 

T1.12 (a) 53/1207 = 0.044 (b) 22/1207 = 1.8% (c) The conditional probabilities are 
shown in the table. 


Diabetic status 


Number of birth 

defects Nondiabetic Prediabetic Diabetic 
None 96.1% 96.5% 80.9% 
One or more 3.9% 3.5% 19.1% 


fH One or more birth defects 
[_] No birth defects 


100 


~ 80 
~| 
S 6 
= 
& 40 
20 
0 — 
- ee . Pd 
Ss i ms 
Ss 


Diabetic status 


(d) There is an association between diabetic status and number of birth defects for 
the women in this study. Nondiabetics and prediabetics appear to have babies with 
birth defects at about the same rate. However, those with diabetes have a much 
higher rate of babies with birth defects. 


T1.13 (a) The longest that any battery lasted was between 550 and 559 hours. (b) 
Someone might prefer to use Brand X because it has a higher minimum lifetime or 
because its lifetimes are more consistent (less variable). (c) Someone might prefer 
Brand Y because it has a higher median lifetime. 
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T1.14 The distribution of reaction time for the Athlete group is slightly skewed to 
the right. The distribution of reaction time for the Other group is roughly 
symmetric with two high outliers. It appears that the Athlete distribution has one 
high outlier while the Other distribution has two high outliers. The reaction times 
for the students who have not been varsity athletes tended to be slower (median = 
292.0 milliseconds) than for the athletes (median = 261.0 milliseconds). The 
distribution of reaction time for the Other group also has more variability, as their 
reaction times have an JQR of 70 milliseconds and the athletes’ reaction times have 
an IQR of 64 milliseconds. 


Chapter 2 

Section 2.1 

Answers to Check Your Understanding 

page 95: 1.c 

2. Her daughter weighs more than 87% of girls her age and she is taller than 67% 
of girls her age. 

3. About 65% of calls lasted less than 30 minutes. This means that about 35% of 
calls lasted 30 minutes or longer. 

4. Q; = 13 minutes; Q3 = 32 minutes; JQR = 32 — 13 = 19 minutes. 


page 97: 1. 2=62-674,29=-1,166,7 ~ 429 —7*F©8- 1  ttes height is 1.166 


standard deviations below the mean height of the class. 


—0.85 = 


2. Because Brent’s z-score is —0.85, we know that —0.85=74-760. ad 
Solving for o we find that o = 2.35 inches. 

page 103: 1. Converting the cost of the rides from dollars to cents will not change 
the shape. However, it will multiply the mean and standard deviation by 100. 


2. Adding 25 cents to the cost of each ride will not change the shape of the 
distribution, nor will it change the variability. It will, however, add 25 cents to the 
measures of center (mean, median). 


3. Converting the costs to z-scores will not change the shape of the distribution. It 
will change the mean to 0 and the standard deviation to 1. 


Answers to Odd-Numbered Section 2.1 Exercises 


2.1 (a) Because 17 of the 20 students (85%) own fewer pairs of shoes than Jackson 
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(who owns 22 pairs of shoes), Jackson is at the 85th percentile in the number of 
pairs of shoes distribution. (b) 45% of the boys had fewer pairs of shoes than Raul 
did. Raul was at the 45th percentile. This means that 45% of the 20 boys, or 9 
boys, have fewer pairs of shoes. Therefore, Raul’s response is the 10th value in the 
ordered list. Raul owns 8 pairs of shoes. 

2.3 (a) Because 10 of the 30 observations (33.3%) are below Antawn’s head 
circumference (22.4 inches), Antawn is at the 33.3rd percentile in the head 
circumference distribution. (b) The player with a head circumference of 24 inches 
is at the 90th percentile of the distribution. 

2.5 This means that the speed limit is set at such a speed that 85% of the vehicle 
speeds are slower than the posted speed. 

2.7 The girl in question weighs more than 48% of girls her age, but is taller than 
78% of the girls her age. Because she is taller than 78% of girls, but only weighs 
more than 48% of girls, she is probably fairly thin. 

2.9 (a) No; a sprint time of 8 seconds is not unusually slow. A student with an 8- 
second sprint is at the 75th percentile, so 25% of the students took that long or 
longer. (b) The 20th percentile of the distribution is approximately 6.7 seconds. 
20% of the students completed the 50-yard sprint in less than 6.7 seconds. 

2.11 (a) The first quartile is the 25th percentile. Find 25 on the y-axis, read over to 
the line and then down to the x-axis to get about Q; = 4%. The 3rd quartile is the 
75th percentile. Find 75 on the y-axis, read over to the line and then down to the x- 
axis to get about Q3 = 14%. The IQR is approximately 14 — 4 = 10%. (b) Arizona, 
which had 15.1% foreign-born residents that year, is at the 90th percentile. (c) The 
graph is fairly flat between 20% and 27.5% foreign-born residents because there 
were very few states that had 20% to 27.5% foreign-born residents that year. 


(d) 


Relative frequency % 


Percent foreign-born residents 


2.13 (a) The z-score for Montana is 2=1.9-8.736.12=-1.12.” 6.12 
Montana’s percent of foreign-born residents is 1.12 standard deviations below the 
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mean percent of foreign-born residents for all states. (b) If we let x denote the 
percent of foreign-born residents in New York at that time, then we can solve for x 


_ z-8.73 
in the equation 2.10=x-8.736.12,2" 19 612° Thus, x = 21.582% foreign-born 


residents. 


2.15 (a) The number of pairs of shoes owned by Jackson is 1.10 standard 
deviations above the average number of pairs of shoes owned by the students in the 
sample. (b) If we let x © denote the mean number of pairs of shoes owned by 
students in the sample, then we can solve for x © in the equation 1.10=22-x 9.42. 
110= Fa Thus, x =11.64% = 11.64 is the mean number of pairs of shoes 


owned by the students in the sample. 

2.17 (a) The fact that your standardized score is negative indicates that your bone 
density is below the average for your peer group. In fact, your bone density is 
about 1.5 standard deviations below average among 25-year-old women. (b) If we 
let o denote the standard deviation of the bone density in Judy’s reference 


population, then we can solve for o in the equation —1.45=948-9560. 
948—956 


—1.45 = —F—- Thus, o = 5.52g/em2. 
, — 880-500 _ 1 
2.19 Eleanor’s standardized score of z=680-500100=1.8 100 ~~ is 
29-21 
= — = 1.6. 


higher than Gerald’s standardized score of 2=29-215=1.6." 5 

2.21 (a) The shape of the distribution of corrected long-jump distance will be the 
same as the original distribution of long-jump distance: roughly symmetric with a 
single peak. (b) The mean of the distribution of corrected long-jump distance is 
577.3 — 20 = 557.3. The median of the distribution of corrected long-jump distance 
is 577 — 20 = 557 centimeters. (c) The standard deviation of the distribution of 
corrected long-jump distance is the same as the standard deviation of the 
distribution of long-jump distance, 4.713 centimeters. The JQR of the distribution 
of corrected long-jump distance is the same as the JQR of the distribution of long- 
jump distance, 7 centimeters. 

2.23 (a) The shape of the new salary distribution will be the same as the shape of 
the original salary distribution. (b) The mean and median salaries will each 
increase by $1000. (c) The standard deviation and JQR of the new salary 
distribution will each be the same as they were for the original salary distribution. 
2.25 (a) The shape of the distribution of corrected long-jump distance in meters 
will be the same as the distribution of corrected long-jump distance in centimeters: 
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roughly symmetric with a single peak. (b) The mean of the distribution of 
corrected long-jump distance, in meters, is 577.3 — 20 = 557.3 cm + 100 cm/m = 
5.973 meters. (c) The standard deviation of the distribution of corrected long-jump 
distance, in meters, is 4.713 centimeters + 100 = 0.04713 meters. 

2.27 (a) The shape of the resulting salary distribution will be the same as the 
original distribution of salaries. (b) The median will increase by 5% because each 
value in the distribution is being multiplied by 1.05. (b) The JQR will increase by 
5% because each value in the distribution is being multiplied by 1.05. 


£(77) — = = 25 
2.29 (a) The mean temperature reading is 59(77)—1609=25 9 9 


degrees Celsius. (b) The standard deviation of the temperature reading is 


59(3)=1.667 9 (3) aaa Ls degrees Celsius. 


2.31 We are told that Mean(Fare) = 2.85 + 2.7 Mean(miles): 15.45 = 2.85 + 2.7 
Mean(miles) — Mean(miles) = 4.667 miles. 


To determine the standard deviation of the lengths of his cab rides in miles, we use 
the following equation: 


SD(Fare) = 2.7 SD(miles) = 10.20 = 2.7 SD(miles) — SD(miles) = 3.778 miles 
The mean and standard deviation of the lengths of his cab rides, in miles, are 4.667 
miles and 3.778 miles, respectively. 

2500 

285d 

Z2.37-¢ 


2.39 The distribution is skewed to the right. The two largest values appear to be 
outliers. The data are centered roughly around a median of 15 minutes and the 
interquartile range is approximately 10 minutes. 


Section 2.2 
Answers to Check Your Understanding 


page 119: 1. The graph is shown here. 


——E 
57.0 59.5 62.0 64.5 67.0 69.5 72.0 
Height (in.) 


2. Because 69.5 inches is 2 standard deviations above the mean, approximately 
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100%—95% _ 9 & 
100%-95%2=2.5% 2 2.5% of young women have heights greater than 


69.5 inches. Therefore, 97.5% of young women have heights less than 69.5 inches. 


3. Because 62 is 1 standard deviation below the mean, approximately 100% 


100%—-68% _ ey 
—68%2=16% 2 7 ” of young women have heights below 62 inches. 
This is not unusually short. 

» — 240-170 _ 5 99 
page 127: 1. (i) z=240-17030=2.33" 30 = ~**”. Table A: The proportion 


of z-scores above 2.33 is 1 — 0.9901 = 0.0099. Tech: normalcdf(lower: 2.33, upper: 
1000, mean: 0, SD: 1) = 0.0099 (ii) normalcdf(lower: 240, upper: 1000, mean: 170, 
SD: 30) = 0.0098 About 1% of 14-year-old boys have cholesterol above 240 mg/dl. 
2. (i) z=200-17030=1 and z=240-17030=2.33. 

200-170 240-170 _ 9 99° 


z= —— =1 z= 
30 and 30 Table A: The proportion of z-scores 


below z = 1.00 is 0.8413 and the proportion of z-scores below 2.33 is 0.9901. Thus, 
the proportion of z-scores between 1 and 2.33 is 0.9901 — 0.8413 = 0.1488. Tech: 
normalcdf(lower: 1, upper: 2.33, mean: 0, SD: 1) = 0.1488 (ii) normalcdf(lower: 
200, upper: 240, mean: 170, SD: 30) = 0.1488. About 15% of 14-year-old boys 
have cholesterol between 200 and 240 mg/dl. 

page 133: 1. (i) Table A: Look in the body of Table A for the value closest to 0.10. 
A z-score of —1.28 gives the closest value (0.1003). Tech: invNorm(area: 0.10, 
mean: 0, SD: 1) = -1.28 

—1.28=x-17030 |= -38.4=x-170 | x=131.6 


—1.28 = —-— — —38.4= 2-170 > z= 131.6 


(ii) invNorm(area: 0.10, mean: 170, SD: 30) = 131.6 About 10% of 14-year old 
boys have cholesterol levels that are less than 131.6, so a 14-year-old boy who has 
a cholesterol level of 131.6 would be at the 10th percentile of the distribution. 


Answers to Odd-Numbered Section 2.2 Exercises 


2.41 (a) The density curve is shown here. 


Cat ei 
01234567 8 9 10 
Time spent waiting for 

the bus (min) 
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(b) Area 5 (base)(height) = (5.3 — 2.5)(0.1) = 0.28 = 28%. On about 28% of days, 
Sally waits between 2.5 and 5.3 minutes for the bus. (c) The 70th percentile will 
have 70% of the wait times to the left of it, so the 70th percentile of Sally’s wait 
times is 7 minutes. 


2.43 (a) The density curve must have the height 0.25 because the area must equal 
1. (base)(height) = (4)(0.25) = 1. (b) Area = (base) (height) = (5 — 3.75)(0.25) = 
0.3125 = 31.25%. About 31.25% of the time, the light will flash more than 3.75 
seconds after the subject clicks “Start.” (c) The 38th percentile of this distribution 
is the time for which 38% of the observations are below it. 

Area = (base)(height) — 0.38 = (x — 1)(0.25) = 1.552 =x-1 = x=2.52 
Therefore, the 38th percentile of this distribution is 2.52 seconds. Thirty-eight 
percent of the time the light will flash within 2.52 seconds after the subject clicks 
“Start.” 

2.45 (a) Mean is C, median is B (the right skew pulls the mean to the right of the 
median). (b) Mean is B, median is B (this distribution is symmetric, so mean = 
median). 

2.47 The Normal density curve with mean 9.12 and standard deviation 0.05 is 
shown here. 


8.97 9.02 9.07 9.12 9.17 9.22 9.27 
Weight (07) 

2.49 The mean is at the balance point of the distribution, which appears to be 10. 
We know that the curve gets very close to the horizontal axis around 3 standard 
deviations from the mean. Observing the positive side of the curve, the curve 
shown seems to approach the horizontal axis around the value 16, so 16 should be 
approximately 3 standard deviations above the mean. Because 16 — 10 = 6, we 
estimate 3 standard deviations to be 6 and therefore 1 standard deviation would be 
about 2 units. 


2.51 (a) The value 9.02 is 2 standard deviations below the mean, so approximately 


100 %—95 % . 
100%-95%2=2.5% 2 = 2-9 of bags weigh less than 9.02 ounces. (b) 


The value 9.07 is 1 standard deviation below the mean. About 100%-68%2=16% 


100 %—68 % 16% 
2 a ° of the bags weigh less than 9.07 ounces. In other words, 9.07 is 
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approximately the 16th percentile of the weights of these potato chip bags. 

z= 22 = -2.40; 
2.53 (i) z=9-9.120.05=-2.40;" 0.05 =" ”* ‘the proportion of z-scores 
below —2.4 is 0.0082. (ii) normalcdf(lower: —1000, upper: 9, mean: 9.12, SD: 0.05) 
= 0.0082 


About 0.82% of 9-ounce bags of this brand of potato chips weigh less than the 


advertised 9 ounces. This is not likely to pose a problem because the percentage of 


bags that weigh less than the advertised amount is very small. 


, — 2400-2000 _ 9 go. 
2.55 (i) z=2400-2000500=0.80;5 500 = ~*~ ”* the proportion of z-scores 


above 0.80 is 1 — 0.7881 = 0.2119. (ii) normalcdf(lower: 2400, upper: 1000000, 
mean: 2000, SD: 500) = 0.2119. About 21.19% of the meals ordered exceeded the 
recommended daily allowance of 2400 mg of sodium. 


2.57 (i) 2=1200-2000500=-1.60, z=1800-—2000500=—0.40; 


1200—2000 1800—2000 
= —_— = - 1.60, z = ——— = |-0.40; , 
500 500 the proportion of z-scores 


between —1.60 and —0.40 is 0.3446 — 0.0548 = 0.2898. (ii) normalcdf(lower: 1200, 
upper: 1800, mean: 2000, SD: 500) = 0.2898. About 28.98% of meals ordered had 
between 1200 mg and 1800 mg of sodium. 

2.59 (a) z = —-1.66 

Table A: The proportion of z-scores above z = —1.66 is 1 — 0.0485 = 0.9515. Tech: 

normalcdf(lower: —1.66, upper: 1000, mean: 0, SD: 1) = 0.9515. The proportion of 
observations in a standard Normal distribution that satisfy z > —-1.66 is 0.9515. (b) 

z = —1.66 and z = 2.85 Table A: The proportion of z-scores between —1.66 and 2.85 
is 0.9978 — 0.0485 = 0.9493. Tech: normalcdf(lower: —1.66, upper: 2.85, mean: 0, 

SD: 1) = 0.9494. The proportion of observations in a standard Normal distribution 
that satisfy -1.66 < z < 2.85 is 0.9494. 


A 3-5.3 — —*% 456@- 
2.61 (a) (i) z=3-5.30.9=—2.56;~ 0.9 si the proportion of z-scores 
below —2.56 is 0.0052. (ii) normalcdf(lower: —1000, upper: 3, mean: 5.3, SD: 0.9) 
= 0.0053. About 53 out of every 10,000 times Mrs. Starnes completes an easy 


& 


Sudoku puzzle, she does so in less than 3 minutes. This is a proportion of 0.0053. 


— £53 — 9.78. 
09 “*""* the proportion of z-scores above 0.78 


(b) (i) z=6-5.30.9=0.78;7 
is 1 — 0.7823 = 0.2177. 
(ii) normalcdf(lower: 6, upper: 1000, mean: 5.3, SD: 0.9) = 0.2183 


About 21.83% of the time, it takes Mrs. Starnes more than 6 minutes to complete 
an easy puzzle. (c) (i) z=6-5.30.9=0.78, z=8-5.30.9=3; 
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653 _ 9.78, z= £53 = 3. 
09 ~~? “~~ 0.9“? the proportion of z-scores between 0.78 and 


3.00 is 0.9987 — 0.7823 = 0.2164. (ii) normalcdf(lower: 6, upper: 8, mean: 5.3, SD: 
0.9) = 0.2170. About 21.7% of easy puzzles take Mrs. Starnes between 6 and 8 
minutes to complete. 


2.63 (i) 0.20 area to the left of zz=-0.84.2 > 2 = —0.84. solving -0.84=x 


4 _. 2-53 
5309 °°" — 09° gives x = 4.544. (ii) invNorm(area: 0.2, mean: 5.3, SD: 


0.9) = 4.543 minutes. The 20th percentile of Mrs. Starnes’s Sudoku times for easy 


ra 
“a” 


problems is about 4.54 minutes. 


2.65 (i) 0.10 area to the left of z — z = —1.28 and 0.90 area to the left of z = z= 
1.28. (ii) invNorm(area: 0.10, mean: 0, SD: 1) = —1.282 and invNorm(area: 0.90, 
mean: 0, SD: 1) = 1.282. 


The first and last deciles of the standard Normal distribution are z = —1.282 and z = 
1.282, respectively. 
2.67 (a) (i) z2=125-11025=0.6, z=150-11025=1.6; 
125-110 _qg , — 150-110 _ 1 ¢. 
25 7) “25? the proportion of z-scores between 0.6 


and 1.6 is 0.9452 — 0.7257 = 0.2195. (ii) normalcdf(lower: 125, upper: 150, mean: 
110, SD: 25) = 0.2195. About 22% of 20- to 34-year-olds have IQ scores between 
125 and 150. (b) (i) 98% area to the left of z = z = 2.05. Solving 2.05=x-11025 


_ z-110 
ol a gives X = 161.25. (ii) invNorm(area: 0.98, mean: 110, SD: 25) = 


161.34. Scores greater than 161.34 qualify for MENSA. 
3.95-3.98 _ 


A 
a 


y 


2.69 (a) (i) z=3.95-3.980.02=-1.5;~ “Tia the proportion of z- 
scores below —1.5 is 0.0668. (ii) normalcdf(lower: —1000, upper: 3.95, mean: 3.98, 
SD: 0.02) = 0.0668. About 7% of the large lids are too small to fit. (b) (i) 


_ 4.05-3.98 _ og. 


z7=4,05-3.980.02=3.5;, 002. ~”*”? the proportion of z-scores above 3.50 
is approximately 0. (ii) normalcdf(lower: 4.05, upper: 1000, mean: 3.98, SD: 0.02) 
= 0.0002. Approximately 0% of the large lids are too big to fit. (c) It makes more 
sense to have a larger proportion of lids too small rather than too big. If lids are too 
small, customers will just try another lid. But if lids are too large, the customer 
may not notice and then spill the drink. 


2.71 We are looking for the value of z with an area of 0.15 to the right and 0.85 to 
the left, and the value of z with an area of 0.03 to the right and 0.97 to the left. We 
get the values z = 1.04 and z = 1.88, respectively. Now we need to solve the 
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following system of equations for p and o:1.04=60-p~o and 1.88=75-p0. 

F _ 60-p a 75—u 
panda : 1.04 = ~~ and 1.88 = ——. Multiplying both sides of the equations 
by o and subtracting yields 0.840 = 15 or o = 17.86 minutes. Substituting this value 
back into the first equation, we obtain 1.04=60-117.86, or 


60— jp 
1=60-1,04(17.86)=41.43!°04 = Tap? OF pf = 60 — 1.04(17.86) = 41.43 


minutes. 

2.73 The distribution of highway gas mileage is not approximately Normal because 
the distribution is skewed to the right. 

2.75 The histogram of these data is roughly symmetric and bell-shaped. The mean 
and standard deviation of these data are x _=15.825% = 15.825 cubic feet and s, = 
1.217 cubic feet. 


e x +1sx=(14.608, 17.042);2 + 182 = (14.608, 17.042); 94 of 36 
observations, or 66.7% of the observations, are within 1 standard 
deviation of the mean. 


© x +2sx=(13.391, 18.259);2 + 282 = (13.391, 18.259); 34 of 36 
observations, or 94.4% of the observations, are within 2 standard 
deviations of the mean. 


e x +3sx=(12.174, 19.476);2 = 382 = (12.174, 19.476); 36 of 36 
observations, or 100% of the observations, are within 3 standard 
deviations of the mean. 

These percentages are quite close to what we would expect based on the 

68-95-99.7 rule. Combined with the graph, this provides good evidence that this 

distribution is approximately Normal. 

2.77 The distribution of tuitions in Michigan is not approximately Normal. If it 

were Normal, then the minimum value would be around 3 standard deviations 

below the mean. However, the actual minimum has a z-score of just 
1873—10,614 


2=1873~-10,6148049=-1.09.” - 8049 iiiane Also, if the distribution 


were Normal, the minimum and maximum would be about the same distance from 
the mean. However, the mean is much farther from the maximum (30,823 — 10,614 
= 20,209) than from the minimum (10,614 — 1873 = 8741). 

2.79 The distribution is approximately Normal because the Normal probability plot 
is nearly linear. 
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2.81 The sharp curve in the Normal probability plot suggests that the data are right- 
skewed. This can be seen in the steep, nearly vertical section in the lower left. 
These numbers were much closer to the mean than would be expected in a Normal 
distribution, meaning that the values that would be in the left tail are piled up close 
to the center of the distribution. 

2.83 (a) A Normal probability plot is shown. 


> ul 


Expected z-score 


Usable capacity (ft5) 


(b) The plot is fairly linear, indicating that the distribution of usable capacity is 
approximately Normal. 

2.85 b 

2.87 b 

2.89 a 

2.91 36 of the 38 family incomes in Indiana are below $95,000. Because 36/38 = 
0.95, or 95%, this individual’s income is at the 95th percentile. 35 of the 44 family 
incomes in New Jersey are below $95,000. Because 35/44 = 0.80, or 80%, this 


individual’s income is at the 80th percentile. 

x = 25:000-47,400 _ 4 go 
Indiana: z=95,000-47,40029,400=1.62 29,400 

x = 25:000-58,100 _ 4 9g 
New Jersey: z=95,000-58,10014,900=0.88 14,900 


The individual from Indiana has an income that is 1.62 standard deviations 
above the mean of $47,400. The individual from New Jersey has an income that is 
0.88 standard deviation above the mean of $58,100. 


> 


> 


The individual from Indiana has a higher income, relative to others in his or her 
state because he or she had a higher percentile (95th versus 80th) and had a higher 
z-score (1.62 versus 0.88) than the individual from New Jersey. 


Answers to Chapter 2 Review Exercises 


, — 179-170 _ j 9. 
R2.1 (a) z=179-1707.5=1.20;~ 75 ~*""* Paul’s height is 1.20 standard 
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deviations above the average male height for his age. (b) 85% of boys Paul’s age 
are shorter than Paul. 


R2.2 (a) Reading up from 7 hours on the x-axis to the graphed line and then across 
to the y-axis, we see that 7 hours corresponds to about the 58th percentile. (b) To 
find Q, start at 25 on the y-axis, move across to the line and down to the x-axis. 
Q, is approximately 2.5 hours. To find Q3, start at 75 on the y-axis, move across to 
the line and down to the x-axis. Q3 is approximately 11 hours. Thus, [QR = 11 - 
2.5 = 8.5 hours per week. 


R2.3 (a) If we converted the guesses from feet to meters, the shape of the 
distribution would not change. The new mean would be 43.73.28=13.32 

7 _ y) aE 

3.28 aia meters, the median would be 423.28=12.80 3.28 12.80 meters, the 


12.5 


standard deviation would be 12.53.28=3.81 3.28 81 meters, and the IQR would 
12.5 _ « 
be 12.53.28=3.81 3.28 — ot meters. (b) The mean error would be 43.7 — 42.6 = 


1.1 feet. The standard deviation of the errors would be the same as the standard 
deviation of the guesses, 12.5 feet. (c) A total of 61 of the 66 students estimated the 
width of their classroom to be less than 63 feet, so the student who estimated the 
classroom width as 63 feet is at the 61/66 = 0.924 = 92.4th percentile. 


R2.4 (a) The percent of observations that have values less than 13 is 1 — 0.08 = 
0.92 = 92%. (b) Answers will vary, but the line indicating the median (line A in the 
graph) should be slightly to the right of the main peak, with half of the area to the 
left and half to the right. (c) Answers will vary, but the line indicating the mean 
(line B in the graph) should be slightly to the right of the line for the median at the 
balance point. 


AB 


R2.5 (a) About 99.7% of the observations will fall within 3 standard deviations of 
the mean. About 97.7% of the babies had birth weights between 2135 and 5201 
2500-3668 


grams. (b) (i) 2=2500-3668511=-2.29;7— 811 — "3 the proportion of 


z-scores below —2.29 is 0.0110. (ii) normalcdf(lower: —1000, upper: 2500, mean: 
3668, SD: 511) = 0.0111. About 1% of babies will be identified as low birth 
weight. (c) (i) The first quartile is the boundary value with 25% of the area to its 
left — z = -0.67. The third quartile is the boundary value with 75% of the area to 
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_0.67 — 223668 


its left — z = 0.67. Solving -0.67=x-3668511 511 gives Q3 = 
0.67 — 723668 
3325.63. Solving 0.67=x-3668511 °° = 511_—s gives Q3 = 4010.37. (ii) 


invNorm(area: 0.25, mean: 3668, SD: 511) gives Q; = 3323.34 invNorm(area: 
0.75, mean: 3668, SD: 511) gives Q3 = 4012.66. The quartiles are Q; = 3323.34 
grams and Q3 = 4012.66 grams. 


R2.6 (a) (i) z=500-694112=-1.73, z=900-694112=1.84; 
= aN = -1.73, 2= SO = 1.84; 


vA 
“~ 


the proportion of z-scores below 
—1.73 is 0.0418. The proportion of z-scores above 1.84 is 1 — 0.9671 = 0.0329. (ii) 
normalcdf(lower: —1000, upper: 500, mean: 694, SD: 112) = 0.0418 and 
normalcdf(lower: 900, upper: 100000, mean: 694, SD: 112) = 0.0329. The percent 
of test-takers who earn a score less than 500 or greater than 900 on the GRE 
Chemistry test is 4.18% + 3.29% = 7.47%. 


_ z—694 
(b) (i) 99% area to left of z + z = 2.33. Solving 2.33=x-6941122°99 = Ta 


gives x = 954.96. (ii) invNorm(area: 0.99, mean: 694, SD: 112) = 954.55 
The 99th percentile score on the GRE Chemistry test is 954.55. 
R2.7 (a) (i) z=1.2-1.050.08=1.88, z=1-1.050.08=—-0.063; 


1,2—1.05 1—1.05 
—— = 1.88, z= = —0.063; 
0.08 0.08 the proportion of z-scores between 


-0.63 and 1.88 is 0.9699 — 0.2643 = 0.7056. (ii) normalcdf(lower: 1, upper: 1.2, 
mean: 1.05, SD: 0.08) = 0.7036. About 70% of the time the dispenser will put 
between 1 and 1.2 ounces of ketchup on a burger. (b) Because the mean of 1.1 is in 
the middle of the interval from 1 to 1.2, we are looking for the middle 99% of the 
distribution. This leaves 0.5% in each tail. 


= 
~~ 


AE 
(i) 0.005 area to the left of z ~ z = -2.58. Solving -2.58=1-1.10 7°28 = —s 


gives o = 0.039. A standard deviation of at most 0.039 ounce will result in at least 


99% of burgers getting between 1 and 1.2 ounces of ketchup. 

R2.8 The distribution of percent of residents aged 65 and older in the 50 states and 
the District of Columbia is roughly symmetric and somewhat bell-shaped. The 
mean and standard deviation of these data are x _=13.255%@ = 13.255 % ands, = 
1.668%. 


© x +1sx=(11.587, 14.923);2 = 182 = (11.587, 14.923); 49 of 51 
observations, or 78.4% of the observations, are within 1 standard 
deviation of the mean. 
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° x +2sx=(9.919, 16.591);2 + 282 = (9.919, 16.591); 4g of 51 
observations, or 94.1% of the observations, are within 2 standard 
deviations of the mean. 


© x +3sx=(8.251, 18.259);2 + 382 = (8.251, 18.259); 59 of 51 
observations, or 98.0% of the observations, are within 3 standard 
deviations of the mean. 

These percentages are close to what we would expect based on the 68-95-99.7 

rule. Combined with the graph, this provides good evidence that this distribution is 

approximately Normal. 

R2.9 The curve in the Normal probability plot suggests that the data are slightly 

right-skewed. This can be seen in the steep, nearly vertical section in the lower left. 

These numbers were much closer to the mean than would be expected in a Normal 

distribution, meaning that the values that would be in the left tail are piled up close 

to the center of the distribution. 


Answers to Chapter 2 AP® Practice Test 
T2Z1¢ 
T2.2d 
T2366 
T2.4b 
T2.5 a 
T2.6e 
T2.7 c 
T2.8d 
T2.9 b 
T2.10 c 


T2.11 (a) A total of 26 of the 40 sale prices were less than the house indicated in 


red on the dotplot, so that home is at the 26/40 = 0.65 = 65th percentile. (b) 
234000—203388 


Z=234000-20338887609=0.35. 87609 ici Interpretation: The sale 
price for this home is 0.35 standard deviation above the average sale price for the 
homes in the sample. 


6—7.11 


T2.12 (a) (i) z=6-7.110.74=-1.5;" en =e the proportion of z-scores 


1916 


below —1.5 is 0.0668. (ii) normalcdf(lower: —1000, upper: 6, mean: 7.11, SD: 0.74) 
= 0.0668. About 6.68% of the students ran the mile in less than 6 minutes, which 
means that (0.0668)(12000) = 801.6, or about 802, students ran the mile in less 
than 6 minutes. (b) (i) 0.90 area to the left of z = z = 1.28. Solving 1.28=x 


_ z-7.11 
-7,110.741'78 0.74 gives x = 8.06 minutes. (ii) invNorm(area: 0.90, mean: 


7.11, SD: 0.74) = 8.06 minutes. It took about 8.06 minutes for the slowest 10% of 
students to run the mile. (c) If the mile run times were converted from minutes to 

seconds, the mean would be (0.74)(60) = 44.4 seconds and the standard deviation 
would be (0.74)(60) = 44.4 seconds. (i) z=400-426.644.4=—0.60, 


_ 400-4266 , — 500-4266 _ ; 
2=500-426.644.4=1.65;7 — — aaa — 9-60, 2= “GF = 1.69; ine 


difference of z-scores between —0.60 and 1.65 is 0.9505 — 0.2743 = 0.6762. (ii) 
normalcdf(lower: 400, upper: 500, mean: 426.6, SD: 44.4) = 0.6763. About 67.6% 
of students who ran the mile had times between 400 and 500 seconds. 


T2.13 No, these data do not seem to follow a Normal distribution. First, there is a 
large difference between the mean and the median. In a Normal distribution, the 
mean and median are the same, but in this distribution the mean is 48.25 and the 
median is 37.80. Second, the distance between the minimum and the median is 
37.80 — 2 = 35.80, but the distance between the median and the maximum is 
204.90 — 37.80 = 167.10. In a Normal distribution, these distances should be the 
same. Because the mean is larger than the median and the distance from the 
median to the maximum is larger than the distance from the minimum to the 
median, the distribution of oil recovered appears to be skewed to the right. 


Chapter 3 

Section 3.1 

Answers to Check Your Understanding 

page 159: 1. The explanatory variable is the amount of sugar (in grams). The 
response variable is the number of calories. The amount of sugar helps to explain, 
or predict, the number of calories in movie-theater candy. 


2. A scatterplot is shown here. 
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3. There is a moderately strong, positive, linear relationship between the amount of 
sugar contained in movie-theater candy and the number of calories in the candy. 
The point for peanut M&M’S® is a potential outlier with 79 grams of sugar and 
790 calories. 


page 162: The correlation of r = —0.838 confirms that the linear association 
between dash time and long-jump distance is strong and negative. 


Answers to Odd-Numbered Section 3.1 Exercises 

3.1 (a) Water temperature is the explanatory variable and weight gain is the 
response variable. Water temperature may help predict or explain changes in the 
response variable, weight gain. Weight gain measures the outcome of the study. (b) 
Either variable could be the explanatory variable because each one could be used 
to predict or explain the other. 


3.3 A scatterplot is shown here. 
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3.5 Other than several athletes who weigh much more than other athletes of the 
same height, there is a moderately strong, positive, linear relationship between 
height and weight for these athletes. 

3.7 There is a moderately strong, positive, linear association between backpack 
weight and body weight for these students. There is one possible outlier in the 
graph—the hiker with body weight 187 pounds and pack weight 30 pounds. This 
hiker makes the form appear to be nonlinear for weights above 140 pounds. 


3.9 (a) A scatterplot with speed as the explanatory variable is shown. 
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Fuel used (liters/100 km) 
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Speed (km/h) 
(b) There is a strong, nonlinear relationship between speed and amount of fuel 
used. The relationship is negative for speeds up to 60 km/h and positive for speeds 
beyond 60 km/h. There are no clear outliers. 


3.11 For both groups of athletes, there is a moderately strong, positive, linear 
association between height and weight; however, athletes who participate in the 
shot put, discus throw, and hammer throw tend to weigh more than other track and 
field athletes of the same height. 

3.13 The relationship is positive, so r > 0. Also, r is closer to 1 than to 0 because 
the relationship is strong. 


3.15 The correlation of 0.92 indicates that the linear relationship between number 
of turnovers and number of points scored for players in the recent NBA season is 
strong and positive. 

3.17 Probably not. Although there is a strong, positive association, an increase in 
turnovers is not likely to cause an increase in points for NBA players. It is likely 
that both of these variables are changing due to several other variables, such as 
time played. 

3.19 (a) The correlation of 0.87 indicates that the linear relationship between 
amount of sodium and number of calories is strong and positive. (b) The hot dog 
with the lowest calorie content increases the correlation. It falls in the linear pattern 
of the rest of the data. 


3.21 (a) The scatterplot shows a strong, positive, linear relationship between the 
femur lengths and humerus lengths. It appears that all 5 specimens come from the 
same species. 
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Humerus length (cm) 
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Femur length (cm) 


(b) The femur measurements have mean of 58.2 cm and a standard deviation of 
13.2 cm. The humerus measurements have a mean of 66 cm and a standard 

deviation of 15.89 cm. The table shows the standardized measurements (labeled 
zfemur and zhumerus) and the product (zfemur x zhumerus) of the standardized 


measurements. 
Femur Humerus z femur z humerus Product 
38 41 —1.53030 =1,57332 2.40765 
56 63 —1.16667 —0.18880 0.03147 
59 70 0.06061 0.25173 0.01526 
64 72 0.43939 0.37760 0.16591 
74 84 1.19697 1.13279 1.35591 


The sum of the products is 3.97620, so the correlation coefficient is 


_ Mewes. ste 
'=14(3.97620)=0,9941." = ¢(3-97620) = 0.9941. 


correlation confirms the strong, positive linear association between femur length 


The very high value of the 


and humerus length in the scatterplot from part (a). 


3.23 (a) Correlation is unitless. (b) The correlation would stay the same. 
Correlation makes no distinction between explanatory and response variables. (c) 
If sodium was measured in grams instead of milligrams, the correlation would still 
be 0.87. Because r uses the standardized values of the observations, r does not 
change when we change the units of measurement of x, or y, or both. 


3.25 We would expect the height of women at age 4 and their height as women at 
age 18 to be the highest correlation because it is reasonable to expect taller children 
to become taller adults and shorter children to become shorter adults. The next 
highest would be the correlation between the heights of male parents and their 
adult children because they share genes. Tall fathers tend to have relatively tall 
sons, and short fathers tend to have relatively short sons. The lowest correlation 
would be between husbands and their wives. Some tall men may prefer to marry 
tall women, but this isn’t always the case. 
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3.27 Answers will vary. Here is one possibility. 


Rob’s score 


T 
50 60 70 80 90 100 


Marc’s score 
3.29 a 
3.31d 
3.33 b 
3.35 One possible histogram is shown here. 
25 4 
20 4 
g 15 a 
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Weight (mg) 
The distribution of weight is skewed to the right with several possible high outliers. 
The median weight is 5.4 mg and the JQR is 5.5 mg. 


Section 3.2 

Answers to Check Your Understanding 

page 181: 1. y\=100+40(16)=740 grams¥ = 100 + 40(16) = 740 grams 

2. The residual = actual y — predicted y = 700 — 740 = —40. Interpretation: This rat 
weighed 40 grams less than the weight predicted by the regression line with x = 16 
weeks old. 

3. The time is measured in weeks for this equation, so 2 years converts to 104 
weeks. We then predict the rat’s weight to be yA=100+40(104)=4260 grams, 

y = 100 + 40(104) = 4260 grams, \ nich is equivalent to 9.4 pounds (about the 
weight of a large newborn human). This is unreasonable and is the result of 
extrapolation. 

page 182: 1. The slope is 40. Interpretation: The predicted weight goes up by 40 
grams for each increase of 1 week in the rat’s age. 2. Yes, the y intercept has 
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meaning in this context. The y intercept, 100, is the predicted weight (in grams) for 
a rat at birth (x = 0 weeks). 

page 188: 1. The equation of the least-squares regression line is 
y\=16.2649+0.0908x,Y = 16.2649 + 0.09082, where x = body weight and y\¥ = 
the predicted backpack weight. 


2. A residual plot is given. 


Residual (Ib) 
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3. Because there appears to be a negative—positive—negative pattern in the residual 
plot, a linear model is not appropriate for these data. 

page 194: 1. yA=33.347+13.2854x,Y = 33.347 + 13.2854a, where x = duration of 
the most recent eruption (min) and yy = predicted interval of time until the next 
eruption (min). 

2. The predicted interval of time until the next eruption goes up by 13.2854 
minutes for each increase of 1 minute in the duration of the most recent eruption. 

3. The actual interval of time until the next eruption (min) is typically about 6.49 
minutes away from the time predicted by the least-squares regression line with x = 


duration of the most recent eruption (min). 


4. r? = 85.4% of the variability in interval is accounted for by the least-squares 
regression line with x = duration. 


Answers to Odd-Numbered Section 3.2 Exercises 

3.37 (a) y\=60.7+0.139(200)=88.59 = 60.7 + 0.139(200) = 88.5 (p) 
y=60.7+0.139(400)=116.3(c)Y = 60.7 + 0.139(400) = 116.3(¢) (¢y Tam more 
confident in the prediction in part (a) than the prediction in part (b). The payroll for 
the teams varied from about $75 million to about $275 million. $200 million is in 


this interval of payrolls, but $400 million is not, which makes the prediction in part 
(b) an extrapolation. 


3.39 The predicted number of wins for the Chicago Cubs, who spent $182 million 
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on payroll, is y\=60.7+0.139(182)=85.998 wins. 

y = 60.7 + 0.139(182) = 85.998 wins. 10 residual = actual y — predicted y = 
103 — 85.998 = 17.002. Interpretation: The Chicago Cubs won 17.002 more games 
than the number of games predicted by the regression line with x = $182 million. 
3.41 (a) The slope is 0.139. Interpretation: The predicted number of wins goes up 
by 0.139 for each increase of $1 million in payroll. (b) The y intercept does not 
have meaning in this context. It is not reasonable for a team to have a payroll of $0. 
3.43 (a) The predicted number of steps for Kiana, who is 67 inches tall, is 
y\=113.6-0.921(67)=51.898 steps.J = 113.6 — 0.921(67) = 51.893 steps. The 
residual = actual y — predicted y = 49 — 51.893 = —2.893. Interpretation: Kiana 
took about 2.893 fewer steps than the number of steps predicted by the regression 
line with x = 67 inches. (b) Since Matthew is 10 inches taller than Samantha, I 
expect Matthew to take (10)(—0.921) = 9.21 fewer steps than Samantha. 

3.45 (a) The regression lines are nearly parallel, but the y intercept is much greater 
for the throwers. (b) Discus thrower: yA=—-115+5.13(72)=254.36 pounds; 

= —115 + 5.13(72) = 254.36 pounds; 79 inch sprinter: yA= 
-297+6.41(72)=164.52 pounds.Y = —297 + 6.41(72) = 164.52 pounds. Baseq 
on the least-squares regression lines computed from the data, we expect a 72-inch 
discus thrower to weigh about 89.84 pounds more than a 72-inch sprinter. 

3.47 No; there is an obvious negative—positive—negative pattern in the residual plot 
so a linear model is not appropriate for these data. A curved model would be better. 
3.49 The predicted mean weight of infants in Nahya who are 1 month old is 
y\=4,88+0.267(1)=5.147 kg.Y = 4-88 + 0.267(1) = 5.147 kg. From the residual 
plot, the mean weight of 1-month-old infants is about 0.85 kg less than predicted. 
So the actual mean weight of the infants when they were 1 month old is about 
5.147 — 0.85 = 4.297 kg. 

3.51 (a) See part (b). (b) y\=300.04+2.829x,Y = 300.04 + 2.8292, where yrY = 
the predicted number of calories and x = the amount of sugar (in grams). 


Number of calories 
| 


40 60 80 100 120 140 160 
Sugar (g) 
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(c) The line calculated in part (b) is called the “least-squares” regression line 
because this is the line that makes the sum of the squares of the residuals as small 
as possible. 


3.53 A residual plot is shown. 


Residual 


40 60 Ai io 120 ae 
Amount of sugar (g) 

The linear model relating the amount of sugar to the number of calories is 
appropriate because there is no leftover pattern in the residual plot. The residuals 
look randomly scattered around the residual = 0 line. 
3.55 (a) The actual number of strides required to walk the length of a school 
hallway is typically about 3.50 away from the number predicted by the least- 
squares regression line with x = height of a student (in inches). (b) About 39.9% of 
the variability in number of steps required to walk the length of a school hallway is 
accounted for by the least-squares regression line with x = height of a student (in 
inches). 
3.57 (a) The predicted free skate score is y\=—-16.2+2.07(78.5)=146.295. 
y = —16.2 + 2.07(78.5) = 146.295. The residual is y 


—yA=150.06-146.295=3.765.¥ ~ ¥ = 150.06 — 146.295 = 3.765. Interpretation: 
Yu-Na Kim’s free skate score was 3.765 points higher than predicted based on her 
short program score. (b) The slope is 2.07. Interpretation: The predicted free skate 
score increases by 2.07 points for each additional 1-point increase in the short 
program score. (c) The actual free skate score is typically about 10.2 points away 
from the score predicted by the least-squares regression line with x = short program 
score. (d) About 73.6% of the variability in free skate score is accounted for by the 
least-squares regression line with x = short program score. 

3.59 (a) Yes; there is no leftover pattern in the residual plot, so a linear model is 
appropriate for these data. (b) r = 60.21% and the slope is positive, so the 
correlation is r=0.6021=0.776." = V0.6021 = 0.776. (c) yA=1,0021+0.0708x, 

y = 1.0021 + 0.07082, where x is the number of Mentos and yA is the predicted 
amount expelled. (d) The value of s = 0.067 ml. Interpretation: The actual amount 
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expelled is typically about 0.067 ml away from the amount predicted by the least- 
squares regression line with x = number of Mentos. The value of r is 60.21%. 
Interpretation: About 60.21% of the variability in amount expelled is accounted 
for by the least-squares regression line with x = number of Mentos. 


3.61 (a) y\=11.898-0.041(42)=10.176 mph. 


y = 11.898 — 0.041(42) = 10.176 mph. The residual = actual y _ predicted y = 


2.2 — 10.176 = —-7.976 mph. Interpretation: The actual average wind speed was 
7.976 mph less than the average wind speed predicted by the regression line with x 
= 42°F, (b) The slope is —0.041. Interpretation: The predicted average wind speed 
decreases by 0.041 mph for each additional 1-degree increase in average 
temperature (in degrees Fahrenheit). (c) The actual average wind speeds typically 
vary Ss = 3.66 mph from the values predicted by the least-squares regression line 
using x = average temperature. (d) r* = 4.8% of the variability in average wind 
speed is accounted for by the least-squares regression line using x = average 
temperature. 

| b, = 0.5 (32) = 0.54. | 
3.63 (a) The slope is b1=0.5(2.72.5)=0.54. _ The y intercept 
is bg = 68.5 — 0.54(64.5) = 33.67. So the equation for predicting y = husband’s 
height from x = wife’s height is y\=33.67+0.5x,Y = 33.67 + 0.52, (b) If the value 
of x is one standard deviation below x @, the predicted value of y will be r 
standard deviations of y below y. So, the predicted value for the husband is 68.5 — 
0.5(2.7) = 67.15 inches. 


3.65 (a) yA=x,Y = ©, where yy = predicted grade on final and x = grade on 
midterm. (b) A student with a score of 50 on the midterm is predicted to score 
y\=46.6+0.41(50)=67.19 = 46-6 + 0.41(50) = 67.1 on the final. A student with 


a score of 100 on the midterm is predicted to score y\=46.6+0.41(100)=87.6 


y = 46.6 + 0.41(100) = 87.6 oy the final. (c) These predictions illustrate 


regression to the mean because the student who did poorly on the midterm (50) is 
predicted to do better on the final (closer to the mean), whereas the student who did 
very well on the midterm (100) is predicted to do worse on the final (closer to the 
mean). 

3.67 (a) Because Jacob has an above-average height and an above-average vertical 
jump, his point increases the positive slope of the least-squares regression line and 
decreases the y intercept. (b) Jacob’s vertical jump is farther from the least-squares 
regression line than the other students’ vertical jumps. Because Jacob’s point has 
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such a large residual, it increases the standard deviation of the residuals. Also, 
because Jacob’s vertical jump is farther from the least-squares regression line than 
the other students’ vertical jumps, the value of r* decreases. The linear association 
is weaker because of the presence of this point. 

3.69 (a) A scatterplot of this relationship is shown here. There is a moderate, 
positive, linear association between HbA and FBG. There are possible outliers to 
the far right (subject 18) and near the top of the plot (subject 15). 
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(b) Because the point for subject 18 is in the positive, linear pattern formed by 
most of the data values, it will make the correlation closer to 1. Also, because the 
point is likely to be below the least-squares regression line, it will “pull down” the 
line on the right side, making the slope closer to 0. Without the outlier, the 
correlation decreases from r = 0.4819 to r = 0.3837, as expected. Likewise, without 
the outlier, the equation of the line changes from y\=66.4+10.4x to y4=52.34+12.1x. 
y = 66.4+ 10.42 toy = 52.3 + 12.12. (c) The point for subject 15 makes the 
correlation closer to 0 because it decreases the strength of what would otherwise be 
a moderately strong positive association. Because this point’s x coordinate is very 
close to x ©, it won’t influence the slope very much. However, it will make the y 
intercept increase because its y coordinate is so large compared to the rest of the 
values. Without the outlier, the correlation increases from r = 0.4819 to r = 0.5684, 
the slope changes from 10.4 to 8.92, and the y intercept increases from 66.4 to 
69.5. 

aia 

3.73 C 

3.75 d 

3.77 b 


» — 25-187 _ 
3.79 (a) (i) z=25-18.74.3=1.47;~ 4.3 Bet 0.9292. (ii) normalcdf(lower: 


—1000, upper: 25, mean: 18.7, SD: 4.3) = 0.9286. About 93% percent of vehicles 
get worse combined mileage than the Chevrolet Malibu. (b) Table A: Look in the 
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body of Table A for the value closest to 0.90. A z-score of 1.28 gives the closest 


1.28 — r—18.7 
value (0.8997). Solving 1.28=x-18.74.3 43° gives x = 24.2. Tech: 


invNorm(area: 0.9, mean: 18.7, SD: 4.3) = 24.2. The top 10% of all vehicles get at 
least 24.2 mpg. 


Answers to Chapter 3 Review Exercises 

R3.1 (a) There is a moderate, positive, linear association between gestation and life 
span. Without the outliers at the top and in the upper right, the association appears 
moderately strong, positive, and curved. (b) The hippopotamus makes the 
correlation closer to 0 because it decreases the strength of what would otherwise be 
a moderately strong positive association. Because this point’s x coordinate is very 
close to x ©, it won’t influence the slope very much. However, it makes the y 
intercept higher because its y coordinate is so large compared to the rest of the 
values. Because it has such a large residual, it increases the standard deviation of 
the residuals. (c) Because the Asian elephant is in the positive, linear pattern 
formed by most of the data values, it will make the correlation closer to 1. Also, 
because the point is likely to be above the least-squares regression line, it will “pull 
up” the line on the right side, making the slope larger and the y intercept smaller. 
Because this point is likely to have a small residual, it decreases the standard 
deviation of the residuals. 

R3.2 (a) A positive relationship means that dives with larger values of depth also 
tend to have larger values of duration. (b) A linear relationship means that when 
depth increases by 1 meter, dive duration tends to change by a constant amount, on 
average. (c) A strong relationship means that the (dive depth, dive duration) data 
points fall close to a line. (d) If the variables are reversed, the correlation will 
remain the same. However, the slope and y intercept will be different. 

R3.3 (a) A linear model is appropriate for these data because there is no leftover 
pattern in the residual plot. (b) Because r* = 0.837 and the slope is positive, the 
correlation r=+0.837=0.915." = +V0.837 = 0.915. Interpretation: The 
correlation of r = 0.915 confirms that the linear association between the age of cars 
and their mileage is strong and positive. (c) y\=3704+12,188x, 

y = 3704 + 12, 1882, where x represents the age and yA represents the predicted 
mileage of the cars. (d) For a 6-year-old car, the predicted mileage is 
y\=3704+12,188(6)=76,832.Y = 3704 + 12, 188(6) = 76, 832. The residual for 
this particular car is y—y\=65,000-—76,832=-11,832. 
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y—y = 65,000 — 76,832 = —11,832. Interpretation: The actual number of 
miles this teacher has driven was 11,832 less than the number of miles predicted by 
the regression line with x = 6 years. (e) The value of s = 20,870.5 miles. 
Interpretation: The actual number of miles is typically about 20,870.5 miles away 
from the number of miles predicted by the least-squares regression line with x = 
age (in years). The value of r* = 83.7%. Interpretation: About 83.7% of the 
variability in mileage is accounted for by the least-squares regression line with x = 
age (in years). 

R3.4 (a) Average March temperature is the explanatory variable because changes 
in March temperature probably have an effect on the date of first blossom. Also, 
we are predicting the date of first blossom from temperature. 


30 4 


23 = 


Days in April to first blossom 


Temperature in Celsius 


(b) The correlation is r = —0.85; y\=33.12-4.69x,Y = 33.12 — 4.692, where x 
represents the average March temperature and yA represents the predicted number 
of days. The slope is —4.69. Interpretation: The predicted number of days in April 
to the first blossom decreases by 4.69 days for each additional 1-degree increase in 
average March temperature (in degrees Celsius). The y intercept tells us that if the 
average March temperature was 0 degrees Celsius, the predicted number of days in 
April to first blossom is 33.12 (May 3). However, x = 0 is outside of the range of 
data, so this prediction is an extrapolation and may not be trustworthy. (c) No, x = 
8.2 is well beyond the values of x we have in the data set (1.5 to 6.2). This 
prediction would be an extrapolation. (d) The predicted number of days until first 
blossom when the average March temperature was 4.5°C is 
y\=33.12—4.69(4.5)=12.015.9 = 33-12 — 4.69(4.5) = 12.015. The residual is y 
~yA=10-12.015=-2.015.¥ ~ ¥ = 10 —12.015 = —2.015. Inrerpretation: The 
actual number of days until first blossom was 2.015 days less than the number of 
days predicted by the regression line with x = 4.5°C. 
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Temperature in Celsius 
There is no leftover pattern in the residuals, indicating that a linear model is 
appropriate. 
R3.5 (a) b1=0.6(830)=0.16; boO=75-0.16(280)=30.2; yA=30.2+0.16x, 


po (3) = 0.16; bp = 75 — 0.16(280) = 30.2; 7 = 30.2 + 0.162, 
, where 


yy = the predicted final exam score and x = total score before the final 
examination. (b) y\=30.2+0.16(300)=78.29 = 30.2 + 0.16(300) = 78.2 (.) The 
least-squares regression line is the line that minimizes the sum of squared distances 
between the actual exam scores and predicted exam scores. (d) Because r = 0.36, 
only 36% of the variability in the final exam scores is accounted for by the linear 
model relating final exam scores to total score before the final exam. More than 
half (64%) of the variation in final exam scores is not accounted for by the least 
squares regression line, so Julie has a good reason to think this is not a good 
estimate. 

R3.6 Even though there is a high correlation between number of calculators and 
math achievement, we shouldn’t conclude that increasing the number of calculators 
will cause an increase in math achievement. It is possible that students who are 
more serious about school have better math achievement and also have more 
calculators. 


Answers to Chapter 3 AP® Practice Test 
T3408 
Ta.2d 
T3.3¢ 
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T3.4a 
T35c 
T3.6 b 
17 e 
T3.8 b 
13,9 
Ta10.d 


T3.11 (a) There is a strong, positive linear association between Sarah’s age and her 
height. 


90.0 e 


Height (cm) 


35 40 45 50 55 60 
Age (months) 


(b) The regression line for predicting y = height from x = age is 
y\=71.95+0.3833x.Y = 71.95 + 0.38332. (c) At age 48 months, we predict 


Sarah’s height to be y\=71.95+0.3833(48)=90.348 cm. 
§ = 71.95 + 0.3833(48) = 90.348 cm. The residual for Sarah is y 


~y=90-90.348=-0.348.Y — ¥ = 90 — 90.348 = —0.348. Interpretation: Sarah’s 
actual height was 0.348 cm less than the height predicted by the regression line 
with x = 48 months. (d) No; obviously, the linear trend will not continue until she 
is 40 years old. Our data were based only on the first 5 years of life and predictions 
should only be made for ages 0-5. 


T3.12 (a) The unusual point is the one in the upper righthand corner with isotope 
value about —19.3 and silicon value about 345. This point is unusual in that it has 
such a high silicon value for the given isotope value. (b) (i) If the point were 
removed, the correlation would get closer to —1, because it does not follow the 
linear pattern of the other points. (ii) Because this point is “pulling up” the line on 
the right side of the plot, removing it will make the slope steeper (more negative) 
and make the y intercept smaller. Note that the y-axis is to the right of the points in 
the scatterplot. (iii) Because this point has a large residual, removing it will make 
the size of the typical residual (s) a little smaller. 


T3.13 (a) Yes; because there is no obvious leftover pattern in the residual plot, a 
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linear model is appropriate for describing the relationship between wildebeest 
abundance and percent of grass area burned. (b) yA=92.29-0.05762x, 

y = 92.29 — 0.057622, where x represents the number of wildebeest and yA 
represents the predicted percent of the grass burned. (c) The slope = —-0.05762. 
Interpretation: The predicted percent of grassy area burned decreases by about 
0.058 percent for each additional 1000 wildebeest. The y intercept does not have 
meaning in this context, as making a prediction for 0 wildebeest is a big 
extrapolation. It is impossible to know what would happen going from some 
wildebeest to no wildebeest. (d) The value of s = 15.988%. Interpretation: The 
actual percentage of burned area is typically about 15.988% away from the percent 
predicted by the least-squares regression line with x = number of wildebeest 
(1000s). The value of r? = 64.6%. Interpretation: About 64.6% of the variability in 
percentage of burned area is accounted for by the least-squares regression line with 
x = number of wildebeest (1000s). 


Chapter 4 

Section 4.1 

Answers to Check Your Understanding 

page 225: 1. Convenience sample; this could lead him to overestimate the quality 
if the farmer puts the best oranges on top or if the oranges at the bottom of the crate 
are damaged from the weight on top of them. 

2. Voluntary response sample; those who are happy that the United Nations has its 
headquarters in the U.S. already have what they want and so are less likely to 
worry about responding to the question. This means that the proportion who 
answered “No” in the sample is likely to be higher than the true proportion in the 
U.S. who would answer “No.” 

page 228: 1. Number the pieces of wood from 1 to 1000. Use the command 
randInt(1,1000) to select 5 different integers from 1 to 1000. Inspect the 
corresponding 5 pieces of wood. 

2. Number the pieces of wood from 000 to 999. Move along a line of random digits 
from left to right, reading three-digit numbers, until 5 different numbers between 
000 and 999 have been selected. Inspect the corresponding 5 pieces of wood. 

page 232: 1. Because the quality of the pencils might be the same within each 
shift, but differ across shifts, use shifts as strata. At the end of each 8-hour shift, 
label all the pencils produced during that shift from 1 to N, where N is the total 
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number of pencils produced on that shift. Generate 100 different random integers 
from 1 to N and select those pencils for inspection. 


2. The boxes of pencils could be used as a cluster because it would be relatively 
easy to select boxes. At the end of the day, label all the boxes of pencils 1 to N, 
where N is the number of boxes produced that day. Generate 30 different random 
integers from 1 to N and inspect all the pencils in the selected boxes. 

3. Stratified: We are guaranteed to inspect 100 pencils from each of the three 
shifts. This will lead to a more precise estimate of overall quality if quality is 
consistent within each shift but differs across the three shifts. Cluster: Simplifies 
the sampling process. Rather than having to label every pencil produced, only the 
boxes of 10 pencils would need to be labeled. 

page 235: 1. (a) Undercoverage (b) Nonresponse (c) Convenience 

2. The estimate of 84% is greater than the percent of all people in the population 
who would oppose banning disposable diapers. By making it sound as if they are 
not a problem in the landfill, this question will result in fewer people suggesting 
that we should ban disposable diapers. 


Answers to Odd-Numbered Section 4.1 Exercises 


4.1 Population: The 1000 envelopes stuffed during a given hour. Sample: The 40 
randomly selected envelopes. 


4.3 Population: All local businesses. Sample: The 73 businesses that return the 
questionnaire. 

4.5 (a) A convenience sample. (b) The estimate of 7.2 hours is probably less than 
the true average because students who arrive first to school had to wake up earlier 
and may have gotten less sleep than those students who are able to sleep in. 


4.7 Voluntary response sample; it is likely that those customers who volunteered to 
leave reviews feel strongly about the hotel, often due to a negative experience. As a 
result, the 26% from the sample is likely greater than the true percentage of all the 
hotel’s customers who would give the hotel 1 star. 

4.9 Voluntary response sample; it is likely that the true proportion of constituents 
who oppose the bill is less than 871/1128. 


4.11 (a) Number the 40 students from 01 to 40 alphabetically. Moving left to right 
along a line from the random digit table, record two-digit numbers, skipping any 
numbers that are not between 01 and 40 and any repeated numbers, until you have 
5 different numbers between 01 and 40. Select the corresponding 5 students. (b) 
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Johnson (20), Drasin (11), Washburn (38), Rider (31), and Calloway (07). 


4.13 (a) Number the plots from 1 to 1410. Use the command randInt(1,1410) to 
select 141 different integers from 1 to 1410. Select the corresponding 141 plots. (b) 
Answers will vary. 

4.15 (a) It is not practical to identify and number every tree in the park. (b) Trees 
along the main road are more likely to be damaged by cars and people and may be 
more susceptible to infestation, leading to an overestimate. (c) The percentage is 
unlikely to be exactly 35% because of sampling variability. 


4.17 Stratified random sampling might be preferred in this context because the 
employees’ opinions might be the same within each type of employee (servers, 
kitchen staff), but differ across employee types. Using employee type as the strata 
will help provide a more precise estimate of the overall proportion who approve of 
the no-tipping policy. Select an SRS of 15 employees who are servers and 15 
employees who work in the kitchen to form the overall sample. 

4.19 No; in an SRS, each possible sample of 250 engineers is equally likely to be 
selected. However, the method described restricts the sample to having exactly 200 
males and 50 females. 


4.21 (a) Because satisfaction with the property is likely to vary depending on the 
location of the room, we should stratify by floor and view. From each floor, 
randomly select 2 rooms with each view. Using a stratified random sample would 
assure that the manager got opinions from each type of room and would provide a 
more precise estimate of customer satisfaction. (b) Using floors as clusters, survey 
the registered guest in every room on each of 3 randomly selected floors. This 
would be a simpler option because the manager would need to survey guests on 
only three floors instead of all over the hotel. 


4.23 (a) Cluster sampling (b) In an SRS, the company would have to visit 
individual homes all over the rural subdivision. With the cluster sampling method, 
the company has to visit only 5 locations, saving time and money. 

4.25 Students who do not live in the dorms cannot be part of the sample. Some 
would live off campus and therefore be less likely to eat on campus than those 
students who live in the dorm, so the director’s estimate for the percent of students 
who eat regularly on campus will likely be too high. 


4.27 People who did not lose much weight (or who gained weight) after 
participating in the course may be less likely to respond to the survey. This will 
likely produce an estimated weight loss that is too large, as the people who 
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responded to the survey probably lost more weight than those who did not respond. 
4.29 We would not expect many people to claim they have run red lights when 
they have not, but some people will deny running red lights when they have. Thus, 
the proportion of drivers obtained in the sample who admitted to running a red 
light is likely to be less than the proportion who have actually run a red light. 

4.31 When asked in person, the boys may claim that they have never cried during a 
movie (when in reality, they have) because they are embarrassed or ashamed to 
admit it to the girls asking the survey question. Boys who were given an 
anonymous survey are more likely to be honest about their experiences. 

4.33 (a) The wording is clear, but the question is slanted in favor of warming labels 
because the first sentence states that some cell-phone users have developed brain 
cancer. (b) The question is clear, but it is slanted in favor of national health 
insurance by asserting it would reduce administrative costs and not providing any 
counter-arguments. The phrase “do you agree” also pushes respondents toward the 
desired response. (c) Not clear; for those who do understand the question, it is 
slanted because it suggests reasons why one should support recycling. It could be 
rewritten as: “Do you support economic incentives to promote recycling?” 

4.35 c 

4.37 d 

4.39 d 

4.41 (a) The predicted number of points scored decreases by 4.084 points for each 
additional turnover. (b) The actual number of points scored is typically about 57.3 
points away from the number of points predicted by the least-squares regression 
line with x = number of turnovers. (c) y“=460.2—4.084(17)=390.772, 

y = 460.2 — 4.084(17) = 390.772, <9 the residual is y-y=238-390.772= 
~152.772.Y — ¥ = 238 — 390.772 = —152.772. The number of points scored by 
the San Francisco 49ers was 152.772 points less than predicted based on their 
number of turnovers. (d) Because their point falls below the least-squares 
regression line and is to the left of the mean number of turnovers, their point 
decreases the y-intercept and increases the slope, making the slope closer to 0 (i.e., 
less negative). Because the 49ers’ point is farther from the line than the rest of the 
points in the data set, this point increases the standard deviation of the residuals. 


Section 4.2 


Answers to Check Your Understanding 
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page 245: 1. Experiment because a treatment (brightness of screen) was imposed 
on the laptops. 


2. Observational; students were not assigned to eat a particular number of meals 
with their family per week. 

3. Explanatory: The number of meals per week teens ate with their families. 
Response: GPA (or some other measure of their grades). 


4. Students who have part-time jobs may not be able to eat many meals with their 
families and may not have much time to study, leading to lower grades. So we 
can’t conclude that not eating with their family is the cause. 

page 250: 1. The patients and the researchers know who is receiving which 
treatment. This knowledge could motivate some people to take other measures 
(e.g., exercising more or eating better in general) that would also influence their 
heart health. 


2. Use a double-blind experiment; the patients would not know which treatment 
they received, nor would the researchers know what treatment each patient 
received. 


page 255: 1. A control group would show how much electricity customers tend to 
use naturally. This would serve as a baseline to determine how much less 
electricity is used in each of the treatment groups. 

2. Number the houses from 1 to 60. Write the numbers 1 to 60 on slips of paper. 
Shuffle well. Draw out 20 slips of paper (without replacement). Those households 
will receive a display. Draw out another 20 slips of paper (without replacement). 
Those households will receive a chart. The remaining 20 households will receive 
only information about energy consumption. 

3. To create groups of households that are roughly equivalent at the beginning of 
the experiment; this will ensure that the effects of other variables (e.g., the thrifty 
inclination of some households) are spread evenly among the three groups. 

page 261: 1. Number the volunteers 1 to 300. Use a random number generator to 
produce 100 different random integers from 1 to 300 and show the first 
advertisement to the volunteers with those numbers. Generate 100 additional 
different random integers from 1 to 300 and show the second advertisement to the 
volunteers with those numbers. The remaining 100 volunteers will view the third 
advertisement. Compare the effectiveness of the advertisements for the three 
groups. 


2. Because the effectiveness of the ads may depend on a volunteer’s familiarity 
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with Jane Austen, block by whether or not the individuals are familiar with the 
works of Jane Austen. The volunteers in each block would be numbered and 
randomly assigned to one of three treatment groups: One-third of the volunteers in 
each block would view the first advertisement, one-third would view the second 
advertisement, and one-third would view the third advertisement. After viewing the 
advertisements, the researchers would gauge the effectiveness of the 
advertisements. 


3. A randomized block design accounts for the variability in effectiveness that is 
due to subjects’ familiarity with the works of Jane Austen. This makes it easier to 
determine the effectiveness of the three different advertisements. 


Answers to Odd-Numbered Section 4.2 Exercises 

4.43 Although eating seafood may decrease the risk of colon cancer, it is possible 
that the physicians who ate seafood were also more likely to exercise. Because 
exercise might decrease colon cancer risk, perhaps the exercise caused the decrease 
in colon cancer risk, not eating seafood. 


4.45 Explanatory: Type of program the people watched. Response: Number of 
calories consumed. This was an experiment because the treatments (20 minutes of 
The Island, 20 minutes of The Island without sound, and 20 minutes of Charlie 
Rose) were deliberately imposed on the students. 

4.47 (a) Explanatory: Amount of time in child care from birth to age 4%. 
Response: Adult ratings of their behavior. (b) This was an observational study 
because children were not assigned to attend child care for a prescribed amount of 
time. (c) No, this is an observational study, so we cannot make a cause-and-effect 
conclusion. For example, children who spend more time in child care probably 
have less time with their parents and get less instruction about proper behavior. 
4.49 Experimental units: Pine seedlings; Treatments: Full light, 25% light, and 5% 
light. 

4.51 (a) The factors are (1) information provided by interviewer, which had 3 
levels, and (2) whether the caller offered survey results, which had 2 levels. (b) 6 
treatments (c) Answers may vary. Here are two treatments: (1) giving name/no 
survey results; (2) identifying university/no survey results. 

4.53 Explanatory: Type of snack that was eaten (berries or candy). Response: 
Amount of pasta consumed (measured by the number of calories consumed). 
Experimental units: Women; Treatments: 65 calories of berries or 65 calories of 
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candy. 

4.55 The control group can be used to show how changes in pain and joint stiffness 
progress over the three years naturally. If a control group was not used, the 
treatment could possibly be deemed ineffective at improving the symptoms, when 
in reality it may be effective at preventing the symptoms from worsening. 

4.57 There was no control group. We do not know if this was a placebo effect or if 
the flavonols actually affected the blood flow. To make a cause-and-effect 
conclusion possible, we need to randomly assign some subjects to get flavonols 
and others to get a placebo. 


4.59 Yes; if the treatment (ASU or placebo) assigned to a subject was unknown to 
both the subject and those responsible for assessing the effectiveness of that 
treatment. If subjects knew they were receiving the placebo, their expectations 
would differ from those who received the ASU. Then it would be impossible to 
know if a decrease in pain was due to the difference in expectations or the ASU. It 
is important for the experimenters to be blind so that they will be unbiased in the 
way that they interact and assess the subjects. 

4.61 Because the experimenter knew which subjects had learned the meditation 
techniques, he or she is not blind. If the experimenter believed that meditation was 
beneficial, he or she may subconsciously rate subjects in the meditation group as 
being less anxious. 

4.63 (a) Write each name on a slip of paper, put them in a container and mix 
thoroughly. Pull out 40 slips of paper and assign these subjects to Treatment 1. 
Then pull out 40 more slips of paper and assign these subject to Treatment 2. The 
remaining 40 subjects are assigned to Treatment 3. (b) Assign the students 
numbers from 1 to 120. Using the command RandInt(1,120), generate 40 unique 
integers from 1 to 120, and assign the corresponding students to Treatment 1. Then 
generate an additional 40 unique integers from 1 to 120 and assign the 
corresponding students to Treatment 2. The remaining 40 students are assigned to 
Treatment 3. (c) Assign the students numbers from 001 to 120. Pick a spot on 
Table D and read off the first 40 unique numbers between 001 and 120. The 
students corresponding to these numbers are assigned to Treatment 1. The students 


corresponding to the next 40 unique numbers between 001 and 120 are assigned to 
Treatment 2. The remaining 40 students are assigned to Treatment 3. 


4.65 The coach’s plan does not include random assignment. Perhaps the more 
motivated players will choose the new method. If they improve more by the end of 
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the study, the coach cannot be sure if it was the exercise program or player 
motivation that caused the improvement. 


4.67 (a) Researchers used a design that compared infants who were assigned to one 
of three treatments. (b) Random assignment helps to create three groups of infants 
who are roughly equivalent at the beginning of the study. This ensures that the 
effects of other variables (e.g., genetics) are spread evenly among the three groups 
of infants. (c) Birthweight and whether or not the baby was born prematurely; it is 
beneficial to control these variables, because otherwise they would provide 
additional sources of variability that would make it harder to determine the 
effectiveness of the treatments. (d) Having about 17 infants in each group makes it 
easier to rule out the chance variation in random assignment as a possible 
explanation for the differences observed in intelligence, language, and memory by 
age 4. 

4.69 (a) Expense and condition of the patient; if a patient is in very poor health, a 
doctor might choose not to recommend surgery because of the added 
complications. Then, if the non-surgery treatment has a higher death rate, we will 
not know if it is because of the treatment or because the initial health of the 
subjects was worse. (b) Write the names of all 300 patients on identical slips of 
paper, put them in a hat, and mix them well. Draw out 150 slips and assign the 
corresponding subjects to receive surgery. The remaining 150 subjects receive the 
new method. At the end of the study, count how many patients survived in each 
group. 

4.71 (a) Because smaller trees are likely to have fewer oranges than medium or 
large trees, form blocks based on the size of the trees (small, medium, large). Then 
randomly assign one-third of the small trees to fertilizer A, one-third to fertilizer B, 
and one-third to fertilizer C. Do the same for the other two blocks. In the end, 
measure each tree’s orange production. (b) In a completely randomized design, the 
differences in tree size will increase the amount of variability in number of oranges 
for each treatment. However, a randomized block design would help us account for 
the variability in number of oranges that is due to the differences in tree size. This 
will make it easier to determine if one fertilizer is better than the others. 

4.73 (a) The blocks are the different diagnoses (e.g., asthma) because the 
treatments (doctor or nurse-practitioner) were assigned to patients within each 
diagnosis. (b) Using a randomized block design allows us to account for the 
variability due to differences in diagnosis by comparing the results within each 
block. In a completely randomized design, the variability due to differences in 
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diagnoses will be unaccounted for and will make it harder to determine if there is a 
difference in health and satisfaction due to the difference between doctors and 
nurse-practitioners. (c) Advantage: There would be no variability in the response 
variables introduced by differences in diagnosis. Disadvantage: We would only be 
able to make conclusions about health and satisfaction with their medical care for 
diabetic patients like the ones in the experiment. 


4.75 (a) If all rats from litter 1 were fed Diet A and if these rats gained more 
weight, we would not know if this was because of the diet or because of genetics 
and initial health. (b) For each litter, randomly assign half of the rats to receive 
Diet A and the other half to receive Diet B. Number the rats in the first litter from 1 
to 10. Use the command randInt(1,10) to select 5 different integers from 1 to 10. 
Select the corresponding 5 rats and assign them to Diet A. The other 5 rats in this 
litter will be assigned to Diet B. Repeat the same process for the second litter of 
rats. (c) Answers will vary. 


4.77 (a) Write the names of all 30 students on identical slips of paper, put them in a 
hat, and mix them well. Draw out 15 slips and assign the corresponding students to 
take the online SAT preparation program. The remaining 15 students will take the 
in-person SAT preparation class. At the end of the study, compare the average 
improvement in SAT score for each group. (b) Create pairs of similar students 
based on their previous SAT score (the two students with the highest SAT scores 
are paired together, and so on). Within each pair, one student is randomly assigned 
to the online preparation class and the other is assigned to the in-person preparation 
class. For each student, record the SAT improvement after receiving the treatment. 
(c) The matched pairs design is preferred. By matching students by initial score, 
we can account for the variability in improvement that is introduced by the 
variability in student ability, making it easier to determine which program is more 
effective. 


4.79 (a) This is a matched pairs design because each subject was assigned both 
treatments. (b) Some students are more distractible than others. In a completely 
randomized design, the differences between the students will add variability to the 
response variable, making it harder to detect if there is a difference caused by the 
treatments. In a matched pairs design, each student is compared with himself (or 
herself), so the differences between students are accounted for. (c) If all the 
students used the hands-free phone during the first session and performed worse, 
we would not know if the better performance during the second session was due to 
the lack of phone or to learning from their mistakes. By randomizing the order, 
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some students will use the hands-free phone during the first session and others will 
use it during the second session. (d) The simulator, route, driving conditions, and 
traffic flow were all kept the same for both sessions. That way, the researchers are 
preventing these variables from adding variability to the response variable. 


4.81 (a) If the students find a difference between the two groups, they will not 
know if it is due to gender or due to the deodorant. (b) Use a matched pairs design. 
In this case, each student would have one armpit randomly assigned to receive 
deodorant A and the other to receive deodorant B. Because each person uses both 
deodorants, there is no longer any confounding between gender and deodorant. 
Also, the paired design accounts for the variability between individuals, making it 
easier to see any difference in the effectiveness of the two deodorants. 

4.83 c 

4.85 b 

4.87 c 

4.89 b 

4.91 (a) (i) z = —0.23; 1 — 0.4090 = 0.5910 (ii) normalcdf(lower: 500, upper: 
10000, mean: 525, SD: 110) = 0.5899 (b) (i) Solving -1.28=x-525110 


—1.23 = = 
110 gives x = 384.2. (ii) invNorm(area: 0.10, mean: 525, SD: 110) = 


384.0 


Section 4.3 

Answers to Check Your Understanding 

page 277: Because the individuals were not randomly selected and this is an 
observational study and not an experiment, we can only conclude that, for the 
athletes in the study, those who were removed from play immediately recovered 
more quickly, on average, than athletes who continued to play. 


Answers to Odd-Numbered Section 4.3 Exercises 

4.93 (a) Because different random samples will include different students and 
produce different estimates, it is unlikely that the sample result will be the same as 
the proportion of all students at the school who use Twitter. (b) An SRS of 100 
students; estimates tend to be closer to the truth when the sample size is larger. 
4.95 (a) Yes; it is plausible that the true proportion could be as small as 0.37 — 
0.031 = 0.339 or as large as 0.37 + 0.031 = 0.401 and 0.50 is not in this interval. 
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(b) Increase the number of adults in the sample. 


4.97 (a) If we repeatedly take random samples of size 124 from a population of 
couples that have no preference for which way they kiss, the number of couples 
who kiss the “right way” in a sample varies from about 45 to 76. (b) Yes; in the 
study, 83 couples kissed the “right way”—much higher than what we would expect 
to happen by chance alone. In the simulations, the largest number of couples 
kissing the “right way” was 76. 

4.99 (a) 4.68 — 4.21 = 0.47 days (b) Assuming the type of clipboard does not 
matter, there was one simulated random assignment where the difference (A — B) 
in the mean number of days for the two groups was 0.72. (c) Because a difference 
of means of 0.47 or higher occurred 16 out of 100 times in the simulation, the 
difference is not statistically significant. It is quite plausible to get a difference this 
big simply due to chance variation in the random assignment. 


4.101 (a) To make sure that the two groups were as similar as possible before the 
treatments were administered. (b) The difference in the percent of women who 
received acupuncture and became pregnant and those who lay still and became 
pregnant was large enough to conclude that the difference was not likely due to the 
chance variation created by the random assignment to treatments. (c) Because the 
women were aware of which treatment they received, we do not know if their 
expectations or the treatment was the cause of the increase in pregnancy rates. 


4.103 Because this study involved random assignment to the treatments (foster care 
or institutional care), we can infer that the difference in being in foster care or 
institutional care caused the difference in response. However, the results should 
only be applied to children like the ones in the study, because these 136 children 
were not randomly selected from a larger population. 


4.105 Because the subjects were not randomly assigned to attend religious services 
(or not), we cannot infer cause and effect. However, this study involved a random 
sample of adults so we can make an inference about the population of adults. It 
appears that adults who attend religious services regularly have a lower risk of 
dying, but we do not know that attending religious services is the cause of the 
lower risk. 

4.107 Because this study does not involve random assignment to the treatments 
(amount of anthocyanins consumed), we cannot infer that the difference in 
blueberry and strawberry intake caused the difference in heart attack risk. In 
addition, we should only apply the results to women like the ones in this study 
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because these 93,600 women were not randomly selected from a larger population. 
4.109 Answers will vary. 


4.111 Those Facebook users involved in the study did not know that they were 
going to be subjected to treatments, and they did not provide informed consent 
before the study was conducted. 


4.113 The responses to the GSS are confidential. The person giving the survey 
knows who is answering the questions because s/he is at that person’s home, but 
will not share the results with anyone else. 

4.115 In this case, the subjects were not able to give informed consent. They did 
not know what was happening to them and they were not old enough to understand 
the ramifications in any event. 


4.117 d 
4.119 (a) 
100 >= 
ea] Strongly 
80 + disagree 
: im Disagree 
3 60 fz] Neither 
7 “= 
2 40 - __| Agree 
|__} Strongly 
20 + agree 


Male Female 
Gender 


(b) Yes, there is an association between gender and opinion. Men are more likely 
to view animal testing as justified if it might save human lives: over two-thirds of 
men agree or strongly agree with this statement, compared to slightly less than half 
of the women. The percentages who disagree or strongly disagree tell a similar 
story: 16% of men versus 30% of women. 


Answers to Chapter 4 Review Exercises 


R4.1 (a) Population: All adult U.S. residents. Sample: The 805 adult U.S. residents 
interviewed. (b) Even though the sample size is very large, it is unlikely that the 
percentage in the entire population would be exactly the same as the percentage in 
the sample because of sampling variability. (c) A larger random sample is more 
likely to get a sample result close to the true population value. 

R4.2 (a) One possible answer: Announce in daily bulletin that there is a survey 
concerming student parking available in the main office for students who want to 
respond. Because generally only those who feel strongly about the issue respond to 
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a voluntary response survey, the opinions of respondents may differ from the 
population as a whole, resulting in an inaccurate estimate. (b) One possible answer: 
Personally interview a group of students as they come in from the parking lot. 
Because these students use the parking lot, their opinions may differ from the 
population as a whole, resulting in an inaccurate estimate. (c) Write the names of 
all 1800 students on identical pieces of paper. Place the slips of paper into a hat. 
Shuffle well. Draw out 50 names. Those students will form the SRS of 50 students 
from the school. (d) An SRS reduces bias by selecting the sample in such a way 
that every group of 50 individuals in the population has an equal chance to be 
selected as the sample. 


R4.3 (a) You would have to identify 10% of the seats, go to those seats in the 
arena, and find the people who are sitting there. (b) It would be better to use the 
lettered rows as the strata because each lettered row is the same distance from the 
court and so would contain only seats with the same (or nearly the same) ticket 
price. Within sections, ticket prices vary quite a bit. (c) Survey all fans in several 
randomly selected sections (clusters); people in a particular numbered section are 
in roughly the same location, making it easy to administer the survey. Furthermore, 
the people in each cluster reflect the variability found in the population, which is 
ideal. 

R4.4 (a) When the interviewer provides the additional information that “box-office 
revenues are at an all-time high,” the listeners may believe that they contributed to 
this fact and be more likely to overestimate the number of movies they have seen 
in the past 12 months. Eliminate this sentence. (b) A sample that uses only 
residential phone numbers is likely to underrepresent younger adults who use only 
cell phones. If younger adults go to movies more often than older adults, the 
estimated mean will be too small. (c) People who do not go to the movies very 
often might be more likely to respond to the poll because they are at home. 
Because the frequent moviegoers will not be at home to respond, the estimated 
mean will be too small. 

R4.5 (a) The data were collected after the anesthesia was administered. Hospital 
records were used to “observe” the death rates, rather than imposing different 
anesthetics on the subjects. (b) Explanatory: Type of anesthetic. Response: 
Whether or not a patient died. (c) One variable that might be confounded with 
choice of anesthetic is type of surgery. If anesthesia C is used more often with a 
type of surgery that has a higher death rate, we would not know if the death rate 
was higher because of the anesthesia or the type of surgery. 
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R4.6 (a) Experimental units: Potatoes. The factors are the storage method (3 
levels) and time from slicing until cooking (2 levels). There are six treatments: (1) 
freshly picked and cooked immediately, (2) freshly picked and cooked after an 
hour, (3) stored at room temperature and cooked immediately, (4) stored at room 
temperature and cooked after an hour, (5) stored in refrigerator and cooked 
immediately, (6) stored in refrigerator and cooked after an hour. (b) Using 300 
identical slips of paper, write “1” on 50 of them, write “2” on 50 of them, and so 
on. Put the papers in a hat and mix well. Then select a potato and randomly select a 
slip from the hat to determine which treatment that potato will receive. Repeat this 
process for the remaining 299 potatoes, making sure not to replace the slips of 
paper into the hat. (c) Benefit: The quality of the potatoes should be fairly 
consistent, reducing a source of variability. Drawback: The results of the 
experiment could then only be applied to potatoes that come from that one supplier 
rather than to potatoes in general. (d) Use a randomized block design with the 
suppliers as the blocks. For each supplier, randomly assign potatoes to the 6 
treatments. Doing so would allow the researchers to account for the variability in 
color and flavor due to differences in the initial quality of the potatoes from 
different suppliers, making it easier to estimate how the treatments affect color and 
flavor of the French fries. 


R4.7 (a) No; the 1000 students were not randomly selected from any larger 
population, so we should apply the results only to students like those in the study. 
(b) Yes; the students were randomly assigned to the three treatments, so we can 
conclude that the reduction in cold symptoms was caused by the masks. 

R4.8 (a) If all of the patients received the St. John’s wort, the researchers would 
not know if any improvement was due to the St. John’s wort or to the expectations 
of the subjects (the placebo effect). By giving some patients a treatment that should 
have no effect at all but that looks, tastes, and feels like the St. John’s wort, the 
researchers can account for the placebo effect by comparing the results for the two 
groups. (b) To create two groups of subjects that are roughly equivalent at the 
beginning of the experiment. (c) The subjects should not know which treatment 
they are getting so that the researchers can account for the placebo effect. Also, the 
researchers should be unaware of which subjects received which treatment so they 
cannot consciously (or subconsciously) influence how the results are measured. (d) 
In this context, “not statistically significant” means that the difference in 
improvement between the St. John’s wort and placebo groups was not large 
enough to rule out the variability caused by the random assignment as the 
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explanation. 

R4.9 (a) Use 30 identical slips of paper and write the name of each subject on a 
slip. Mix the slips in a hat and select 15 of them at random. These subjects will be 
assigned to Group 1. The remaining 15 will be assigned to Group 2. After the 
experiment, compare the time estimates of Group 1 with those of Group 2. (b) 
Each student does the activity twice, once with the easy mazes, and once with the 
hard mazes. For each student, randomly determine which set of mazes is used first. 
To do this, flip a coin for each subject. If it’s heads, the subject will do the easy 
mazes followed by the hard mazes. If it’s tails, the subject will do the hard mazes 
followed by the easy mazes. After the experiment, compare each student’s “easy” 
and “hard” time estimate. (c) The matched pairs design would be more likely to 
detect a difference because it accounts for the variability between subjects. 

R4.10 (a) No, because the subjects did not know the nature of the experiment 
before they agreed to participate. (b) All individual data should be kept 
confidential and the experiment should go before an institutional review board 
before being implemented. 


Answers to Chapter 4 AP® Practice Test 

T4.1¢ 

T4.2e 

T4.3 d 

T4.4c 

T4.5d 

T4.6e 

T4.7 a 

T4.8 d 

T4.9 d 

T4.10e 

T4.11 d 

T4.12 (a) Experimental units: Acacia trees. (b) This allows the researchers to 
measure the effect of active hives and empty hives on tree damage compared to no 
hives at all. (c) Assign the trees numbers from 01 to 72 and use a random number 
table to pick 24 different two-digit numbers between 01 and 72. Those trees will 
get the active beehives. The trees associated with the next 24 different two-digit 


1945 


numbers will get the empty beehives and the remaining 24 trees will remain empty. 
Compare the damage caused by elephants to the trees with active beehives, those 
with empty beehives, and those with no beehives. 

T4.13 (a) Population: All U.S. students in grades 7-12. Sample: The 1673 U.S. 
students in grades 7—12 surveyed. (b) Random selection reduces the effects of bias 
due to self-selection and also allows the results to be inferred to a larger 
population. In this case, students who respond to an online survey might be more 
interested in computer science, leading to an estimate that is too high. (c) No, 
because of sampling variability. (d) The estimate of 54% because it was based on a 
sample size of 1673, whereas the other two results were based on smaller sample 
sizes. Larger sample sizes should yield results that are closer to the truth about the 
population. 

T4.14 (a) Explanatory: The treatment given (caffeine capsule or placebo). 
Response: The time it takes to complete the tapping task. (b) Each of the 11 
individuals will be a block in a matched pairs design. Each participant will take the 
caffeine tablets on one of the two-day sessions and the placebo on the other. The 
blocking was done to account for individual differences in dexterity. (c) After the 
first trial, subjects might practice the tapping task and do better the second time. If 
all the subjects got caffeine the second time, the researchers would not know if the 
increase was due to the practice or the caffeine. (d) Yes, if neither the subjects nor 
the people who come in contact with them during the experiment (including those 
who record the number of taps) have knowledge of the order in which the caffeine 
or placebo was administered. 


Answers to Cumulative AP® Practice Test 1 
AP1l.1d 
AP1.2e 
AP1.3b 
AP1.4c¢ 
AP1.5e 
AP1.6c 
AP1.7 e 
AP1.8 e 
AP1.9d 
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AP1.10d 
AP1.11d 
AP1,12 b 
AP1.13a 
AP1.14a 


AP1.15 (a) The distribution of gains for subjects using Machine A is roughly 
symmetric, while that for subjects using Machine B is skewed to the left (toward 
the smaller values). Neither distribution appears to contain any outliers. The center 
of the distribution of gains for subjects using Machine B (median = 38) is greater 
than that for subjects using Machine A (median = 28). The distribution of gains for 
subjects using Machine B (range = 57, IQR = 22) is more variable than that for 
subjects using Machine A (range = 32, IQR = 15). (b) Machine B because its 
median gain (38) is greater than it is for Machine A (28), as is the mean 

(x B=35.4 versus x A=28.9%B = 39.4 versus % 4 = 28.9) (¢) Machine A 
because it exhibits less variation in gains than does Machine B. The IQR for 
Machine A (15) is less than the JQR for Machine B (22). Additionally, the standard 
deviation for Machine A (9.38) is less than the standard deviation for Machine B 
(16.19). (d) The experiment was conducted at only one fitness center. Results may 
vary at other fitness centers in this city and in other cities. If the company wants to 
broaden their scope of inference, they should randomly select people from the 
population they would like to draw an inference about. 


AP1.16 (a) Randomly assign 30 retail sales districts to the monetary incentives 
treatment and the remaining 30 retail sales districts to the tangible incentives 
treatment [see part (b) for method]. After a specified period of time, record the 
change in sales for each district and compare the mean change for each of the 
treatment groups. (b) Number the 60 retail sales districts with a two-digit number 
from 01 to 60. Using a table of random digits, read two-digit numbers until 30 
unique numbers from 01 to 60 have been selected. These 30 districts are assigned 
to the monetary incentives group and the remaining 30 to the tangibles incentives 
group. Using the digits provided, the districts labeled 07, 51, and 18 are the first 
three to be assigned to the monetary incentives group. (c) Matching the districts 
based on their size accounts for the variation among the experimental units due to 
their size on the response variable—sales volume. Pair the two largest districts in 
size, the next two largest, down to the two smallest districts. For each pair, pick 
one of the districts and flip a coin. If it’s heads, this district is assigned to the 
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monetary incentives group. If it’s tails, this district is assigned to the tangible 
incentives group. The other district in the pair is assigned to the other group. After 
a specified period of time, record the change in sales for each district and compare 
within each pair. 


AP1.17 (a) There is a very strong, positive linear association between sales and 
shelf length. (b) y\=317.94+152.68x,Y = 317.94 + 152.682, where yAY = 
predicted weekly sales (in dollars) and x = shelf length (in feet). (c) 
y\=317.94+152.68(5)=1081.349 = 317.94 + 152.68 (5) = 1081.34 (4) The 
actual weekly sales (in dollars) is typically about $22.9212 away from the weekly 
sales predicted by the least-squares regression line with x = shelf length (in feet). 
(e) About 98.2% of the variation in weekly sales revenue can be accounted for by 
the least-squares regression line with x = shelf length (in feet). 

AP1.18 (a) (i) z=-5-022.92=-0.22 and z=5-022.92=0.22; 

z= = = -0.22 and z= 2 = 0.22; 


22.92 22.92 


he hae hs ake 


0.5871 — 0.4219 = 0.1652 (ii) 
normalcdf(lower: —5, upper: 5, mean: 0, SD: 22.92) = 0.1727 (b) (i) Solving 


_ z-0 
~1,96=x-022.92 22.92 gives x = -$44.92. Solving 1.96=x—022.92 


Loe 
6 = 3799 gives x = $44.92. (ii) invNorm(area: 0.025, mean: 0, SD: 22.92) = — 


$44.92 and invNorm(area: 0.975, mean: 0, SD: 22.92) = $44.92 

The middle 95% of residuals should be between —$44.92 and $44.92. 

If 5 linear feet are allocated to the store’s brand of men’s grooming products, the 
weekly sales revenue can be expected to be between $1036.08 and $1125.92 (1081 
+ 177; 44.92). 


Chapter 5 

Section 5.1 

Answers to Check Your Understanding 

page 304: 1. Interpretation: If you take a very large random sample of Pedro’s 
commutes, about 55% of the time the light will be red when Pedro reaches the 
light. 

2. (a) This probability is 0. If an outcome can never occur, then it will occur in 0% 
of the trials. (b) This probability is 1. If an outcome will occur on every trial, then 
it will occur in 100% of the trials. (c) This probability is 0.001. An outcome that 
occurs in 0.1% of the trials is very unlikely, but will occur every once in a while. 


1948 


(d) This probability is 0.6. An outcome that occurs in 60% of the trials will happen 
more than half the time. Also, 0.6 is a better choice than 0.99 because the wording 
suggests that the event occurs often but not nearly every time. 


3. The doctor is wrong because the sex of the next baby born is a random 
phenomenon that is unpredictable in the short run, even though it has a regular 
predictable pattern in the long run. So while approximately 50% of all babies born 
will be male, even after a couple has seven girls in a row, the probability of the 
eighth child being a girl is still 50%. 

page 307: 1. To carry out a simulation to estimate the probability that a 50% 
shooter who takes 30 shots in a game would have a streak of 10 or more shots, 
begin by assigning digits to the outcomes. Let 1 = make a shot and let 2 = miss a 
shot. Generate 30 random integers from 1 to 2 to simulate taking 30 shots. Record 
whether or not there are a series of at least 10 “makes” in a row among the 30 
shots. Perform many trials of this simulation. See what percent of the time there is 
a streak of at least 10 “makes” among the 30 shots. 

2. There were two trials that resulted in 9 consecutive “makes” among the 30 shots. 
3. Based on the dotplot, there was only 1 trial out of 50 simulated games in which 
the player had at least 10 “makes” in a row among the 30 shots. If the player’s shot 
percentage truly is 50% and if the result of a shot truly does not depend on 
previous shots, then in 50 trials we would expect this player to have a streak of 10 
or more makes only about 2% of the time (1 out of 50). Because this phenomenon 
is unlikely to happen strictly due to chance alone, and the announcer observed the 
player making 10 shots in a row in a recent game, the announcer is justified in 
claiming that the player is streaky. 


Answers to Odd-Numbered Section 5.1 Exercises 

5.1 (a) If you take a very large random sample of times that Aaron tunes into his 
favorite radio station, about 20% of the time a commercial will be playing. (b) No, 
chance behavior is unpredictable in the short run, but has a regular and predictable 
pattern in the long run. The value 0.20 describes the proportion of times that a 
commercial will be playing when Aaron tunes in to the station in a very long series 
of trials. 


5.3 (a) If you take a very large random sample of women with breast cancer, about 
10% of the time the mammogram will indicate that the woman does not have 
breast cancer when, in fact, she does have breast cancer. (b) A false negative is a 
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more serious error because a woman with breast cancer will not get potentially life- 
saving treatment. A false positive would result in temporary stress until a more 
thorough examination is performed. 


5.5 In the short run, there was quite a bit of variability in the percentage of made 3- 
point shots. However, the percentage of made 3-point shots became less variable 
and approached 0.30 as the number of shots increased. 

5.7 (a) The wheel is not affected by its past outcomes—it has no memory. So on 
any one spin, black and red remain equally likely. (b) The gambler is wrong again. 
Removing a card changes the composition of the remaining deck. If you hold 5 red 
cards, the deck now contains 5 fewer red cards, so the probability of being dealt 
another red card decreases. 


5.9 (a) To carry out a simulation to estimate the score that Luke will earn on the 
quiz, begin by assigning digits to the outcomes. For each question, let 1 = guessed 
correctly and let 2, 3, and 4 = guessed incorrectly. Generate 10 random integers 
from 1 to 4 to simulate the result of Luke’s guess for each of the 10 questions. 
Record the number of correct guesses (1’s). Repeat this three times and record the 
highest of the three scores. (b) There was one trial that resulted in a highest quiz 
score of 1 correct response out of 10. (c) Based on the simulation, the probability 
that Luke passes the quiz (scores at least a 6 out of 10) is 5/50 = 10%. Five of the 
50 trials show a score of at least 6. (d) Yes; it is unlikely that Doug would score 
higher than 6 out of 10 based on randomly guessing the answers. Since Doug 
scored 8 points out of 10, there is reason to believe that Doug does understand 
some of the material. 

5.11 (a) To use a table of random digits to carry out the simulation, number the 
first class passengers 01-12 and the other passengers 13—76. Ignore all other 
numbers. Moving left to right across a row, look at pairs of digits until you have 10 
unique numbers (no repetitions, because you do not want to select the same person 
twice). Count the number of two-digit numbers between 01 and 12. Perform many 
trials of this simulation. Determine what percent of the trials showed no first class 
passengers selected for screening. (b) The numbers, read in pairs, are: 71 48 70 99 
84 29 07 71 48 63 61 68 34 70 52. The bold numbers indicate people who have 
been selected. The other numbers are either too large (over 76) or have already 
been selected. There is one person among the 10 selected who is in first class in 
this sample (underlined). (c) No, there is not convincing evidence that the TSA 
officers did not carry out a truly random selection. If the selection is truly random, 
there is about a 15% chance that no one in first class will be selected. So it is not 
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surprising that a single random selection would contain no first class passengers. 


5.13 (a) To carry out this simulation using slips of paper, label 10 identically sized 
pieces of paper 0—9. Let 0-5 represent hitting the center of the target and let 6—9 
represent not hitting the center of the target. Place the slips of paper into a hat and 
mix well. Draw out one slip of paper at a time and record the digit. Replace the 
paper and mix well. Repeat this process until Quinn misses the center of the target. 
Determine whether she remained in the competition for at least 10 shots. (b) To 
carry out this simulation using a random digits table, let 0-5 represent hitting the 
center of the target and let 6—9 represent not hitting the center of the target. Moving 
left to right across a row, read single digits and record whether she hit the center of 
the target or not. Repeat this process until Quinn misses the center of the target. 
Determine whether she remained in the competition for at least 10 shots. (c) To 
carry out this simulation using a random number generator, let 0-5 represent 
hitting the center of the target and let 6—9 represent not hitting the center of the 
target. Generate a random integer from 0 to 9 and record whether she hit the center 
of the target or not. Repeat this process until Quinn misses the center of the target. 
Determine whether she remained in the competition for at least 10 shots. 

5.15 This is a legitimate simulation. By letting 1 = feel addicted and 2 = doesn’t 
feel addicted, the simulation would model the claim that 50% of U.S. teens with 


smartphones feel addicted to their devices. 


5.17 This is not a legitimate simulation. When simulating the selection of 
homework assignments, the selection process would occur without replacement. 
The slips of paper should not be placed back in the hat. 

5.19 To conduct a simulation to estimate the probability that we would have to 
randomly select 20 or more U.S. adult males to find one who is red-green color- 
blind, let O—6 = color-blind and let 7—99 = not color-blind. Use technology to pick 
an integer from 0 to 99. Continue picking integers until we get a number between 0 
and 6. Count how many numbers there are in the sample. Repeat this process many 
times. Determine the percent of the time we would have to randomly select 20 or 
more U.S. adult males to find one who is red—green color-blind. 

5.21 (a) In the simulation, 43 of the 200 samples yielded a sample proportion of at 
least 0.55. Obtaining a sample proportion of 0.55 or higher is not particularly 
unusual when 50% of all students recycle. (b) Only 1 of the 200 samples yielded a 
sample proportion of at least 0.63. This means that if 50% of all students recycle, 
we would see a sample proportion of at least 0.63 in only about 0.5% of samples. 
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Because getting a sample proportion of at least 0.63 is very unlikely, we have 
convincing evidence that the percentage of all students who recycle is larger than 
50%. 

5.20 °C 

5.25 b 

5.27 C 

5.29 (a) This survey displays undercoverage because people who are younger than 
50 years old did not have a chance to participate in the survey. Younger people 
generally don’t have as many prescription drugs and might be less willing than 
older people to support a program like this. The 75% who answered “Yes” is likely 
an overestimate of the true proportion of Americans who support the program. (b) 
Including the additional information of “can be helped” might have encouraged 
respondents to say “Yes” to the program because they liked the idea of helping 
people. This would cause the 75% estimate to be higher than if the question were 
worded more neutrally. 


Section 5.2 

Answers to Check Your Understanding 

page 318: 1. Events A and B are mutually exclusive because a person cannot have 
a cholesterol level of both 240 or above and between 200 and less than 240 at the 
same time. 

2. The event “A or B” is a person who has either a cholesterol level of 240 or 
above or they have a cholesterol level between 200 and less than 240. P(A or B) = 
P(A) + P(B) = 0.16 + 0.29 = 0.45 

3. P(C) = 1 —- P(A or B) = 1 - 0.45 = 0.55 

page 321: 1. The probability that the person is an environmental club member is 
(212 + 77 + 16)/1526 = 305/1526 = 0.1999 = 19.99%. 

2. P(not a snowmobile renter) = (445 + 212 + 297 + 16)/1526 = 970/1526 = 0.6356 
= 63.56%. 

3. P(environmental club member and not a snowmobile renter) = (212 + 16)/1526 
= 228/1526 = 0.1494 = 14.94%. 

4. The probability that the person is not an environmental club member or is a 
snowmobile renter is 85.06%, which is (445 + 497 + 279 + 77)/1526 = 1298/1526 
= 0.8506 = 85.06%. 
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Answers to Odd-Numbered Section 5.2 Exercises 


5.31 (a) The following table shows the possible outcomes in the sample space. 
1 


Each of the 16 outcomes will be equally likely and have probability 116. 16° 


First roll 


1 (1,1) (2,1) (3,1) (4,1) 

Second 2 (1,2) (2,2) (3,2) (4,2) 
roll 3 (1,3) (2,3) (3,3) (4,3) 

4 (1,4) (2,4) (3,4) (4,4) 


(b) P(A) = 0.25. There are four ways to get a sum of 5 from these two dice: (1, 4), 
(2, 3), (3, 2), (4, 1). The probability of getting a sum of 5 is P(A)=416=0.25. 
P(A) = 4 = 0.25. 


5.33 (a) The sample space is: Connor/Declan, Connor/Lucas, Connor/Piper, 
Connor/Sedona, Connor/Zayne, Declan/Lucas, Declan/Piper, Declan/Sedona, 
Declan/Zayne, Lucas/Piper, Lucas/Sedona, Lucas/Zayne, Piper/Sedona, 
Piper/Zayne, and Sedona/Zayne. Each of these 15 outcomes will be equally likely 
and will have probability 1/15. (b) There are 9 outcomes in which Piper or Sedona 
(or both) get to go to the show: Connor/Piper, Connor/Sedona, Declan/Piper, 
Declan/Sedona, Lucas/Piper, Lucas/Sedona, Piper/Sedona, Piper/Zayne, 
Sedona/Zayne. Define Event A as Piper or Sedona (or both) get to go to the show. 
Then P(A) = 9/15 = 0.60. 

5.35 (a) This is a valid probability model because each probability is between 0 
and 1 and the probabilities sum to 1. (b) P(student won’t win extra homework) = 1 
— 0.05 = 0.95. There is a 0.95 probability that a student will not win extra 
homework. (c) P(candy or homework pass) = 0.25 + 0.15 = 0.40. There is a 0.40 
probability that a student wins candy or a homework pass. 


5.37 (a) The probabilities of all the possible outcomes must add to 1. The given 
probabilities add up to 0.25 + 0.32 + 0.07 + 0.03 + 0.01 = 0.68. This leaves a 
probability of 1 — 0.68 = 0.32 for P(3 or 4). Because the probability of finding 3 
people in a household is the same as the probability of finding 4 people (and they 
are mutually exclusive), each probability must be 0.32/2 = 0.16. (b) P(more than 2 
people) = 1 — P(1 or 2 people) = 1 — (0.25 + 0.32) = 0.43. This could also be found 
using the addition rule for mutually exclusive events. P(more than 2 people) = P(3 
or 4 or 5 or 6 or 7+) = 0.16 + 0.16 + 0.07 + 0.03 +0.01 = 0.43. 
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5.39 (a) The given probabilities have a sum of 0.72 and the sum of all probabilities 
should be 1. Thus, the probability that a randomly chosen young adult has some 
education beyond high school but does not have a bachelor’s degree is 1 — 0.72 = 
0.28. There is a 0.28 probability that a young adult has some education beyond 
high school but does not have a bachelor’s degree. (b) Using the complement rule, 
P(at least a high school education) = 1 — P(has not finished high school) = 1 — 0.13 
= 0.87. There is a 0.87 probability that a young adult has completed high school. 
(c) P(young adult has further education beyond high school) = P(young adult has 
some education beyond high school but does not have a bachelor’s degree) + 
P(young adult has at least a bachelor’s degree) = 0.28 + 0.30 = 0.58. The 
probability rule used is the addition rule for mutually exclusive events. 


5.41 (a) P(B°) = P(student does not eat breakfast regularly) = 295595=0.496 


295 __ : 

a (b) P(F and B®) = P(student is a female and does not eat breakfast 
165 

regularly) = 165595=0.277.595 tat Interpretation: If we select a student 


from the school at random, the probability that the student will be female and does 


not eat breakfast regularly is 0.277. (c) P(F and B®) = P(student is a female or does 
110+165+130 


= 0.681 
not eat breakfast regularly) = 110+165+130595=0.681 595 
5.43 (a) P(not age 18 to 34 and not own an iPhone) = 
189+100+277+643 _ 1209 _ gq row 
189+100+277+6432024=12092024=0.597 2024 2024 ; 
(b) P(age 18 to 34 or owns iPhone) = 
169+214+134+1714+1272024=8152024=0.403 
1694+214413441714+127 815 
— -2024C*~=“‘“C;*‘«éC 0.403 
5.45 (a) 
Black Not black Total 
Even 10 10 20 
Not even 8 10 18 
Total 18 20 38 


(b) P(B)=1838=0.474;P(E)=2038=0.526 


= 4: mer gee 
P(B) = go ee PLE) ane (c) The event “B and E” would be that 


the ball lands in a spot that is black and even. P(B and E)=1038=0.263 


_ 10 _ 9 9¢: 
P(BandE) = 55 = 0.263 (d) The probability of the event “B or E” means the 
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probability of landing in a spot that is either black, even, or both. If we add the 
probabilities of landing in a black spot and landing in an even spot, the spots that 
are black and even will be double counted because events B and E are not mutually 
exclusive. P(B and E)=1838+2038-1038=2838=0.737. 


P(BandE) = = + 2 - 2 = = = 0.737. 


5.47 P(own a dog or a cat) = P(own a dog) + P(own a cat) — P(own a dog anda 
cat) = 0.40 + 0.32 — 0.18 = 0.54 

5.49 (a) We are given the following information: P(M) = 0.6, P(G©) = 0.67, P(G 
and M) = 0.23. Therefore, P(G) = 1 — 0.67 = 0.33. P(G U M) = P(G) + P(M) - 
P(G n M) = 0.33 + 0.6 — 0.23 = 0.70. Interpretation: The probability of randomly 
selecting a student from among those who were part of the census that is a graduate 
student or uses a Mac is 0.70. (b) PGS n M°) =1- P(G U M)=1-0.7=0.3 


5.51 (a) A Venn diagram is shown here. 


= = 
yy re % 
| 022 0.18 oa | 
a ae 


(b) P(D n C°) = 0.22 

5.53 The general addition rule states that P(A U B) = P(A) + P(B) - P(A 1 B). 
We know that P(A) = 0.3, P(B) = 0.4, and P(A U B) = 0.58. Therefore, 0.58 = 0.3 
+ 0.4 - P(A n B). Solving for P(A n B) gives P(A n B) = 0.12. 


Bae 
aver 


5.59 (a) The scatterplot for the average crawling age and average temperature is 
given here. 


Average crawling age 
(weeks) 


30 40 50 60 70 

Average temperature (°F) 
In this scatterplot, there appears to be a moderately strong, negative linear 
relationship between average temperature and average crawling age. (b) The 
equation for the least-squares regression line is y\=35.68-0.0777x, 
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y = 35.68 — 0.07772, where yy is the predicted average crawling age and x is 
the average temperature in degrees Fahrenheit. (c) The slope = —0.0777. 
Interpretation: The predicted average crawling age decreases by 0.0777 weeks for 
each additional 1-degree increase in the average temperature in degrees Fahrenheit. 
(d) We cannot conclude that warmer temperatures six months after babies are born 
causes them to crawl sooner because this was an observational study and not an 
experiment. We cannot draw conclusions of cause and effect from observational 
studies. 


Section 5.3 


Answers to Check Your Understanding 


P(NIE) = 395 = 0.695. Con that a survey 


page 334: 1. P(N|E)=212305=0.695. 
respondent belongs to an environmental organization, there is about a 69.5% 
chance that he or she never used a snowmobile. 


2. P(E|SC)=212+77657+574=2891231=0.235 


657+574 1231 


3. P(EC|N)=445657=0.677. P(EC|renter)=497574=0.866. 
P(ECljowner)=279295=0.946. 
P(E°|N) = = = 0.677. P(E® |renter) = “ = 0.866. P(E° |owner) = 


574 


279 


295 


= 0.946. 


The chosen person is more likely to not be an environmental club member if he or 
she owns a snowmobile. 

page 337: 1. Events A and B are independent. Because we are putting the first card 
back and shuffling the cards before drawing the second card, knowing what the 
first card was will not help us predict what the second card will be. 

2. Events A and B are not independent. Once we know the suit of the first card, 
then the probability of getting a heart on the second card will change depending on 
what the first card was. 

3. The two events, “Female” and “Got an A,” are independent. Once we know that 
the chosen person is female, this does not help us predict if she got an A or not. 
Overall, 4/28 or 1/7 of the students got an A on the quiz. And, among the females, 
3/21 or 1/7 got an A. So P(got an A) = P(got an A | female). 

page 343: 1. 
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oe Desktop 


a 
California <—:30 Laptop 


Qg5~ Tablet 


Computers eo 
gi Desktop 


sf 


4s 
A 


3 
rs 

Texas < = Laptop 
4 


™~ 
2>~ Tablet 


2. P(tablet) = P(California n tablet) + P(Texas n tablet) = (0.40)(0.45) + (0.60) 
(0.70) = 0.60. The probability of selecting a tablet is 0.60. 


3. P(California|tablet)=P(Californiantablet)P(tablet)=(0.4) 


(0.45)0.60=0.180.60=0.30. 


: wer P(CaliforniaMtablet) (0.4)(0.45) 18 
P(California|tablet) = ————_——- = ——_— = 28 = 0.30. 
P(tablet) 0.60 0.60 Given 


that a tablet was selected, there is a 0.30 probability that it was made in California. 
page 347: 1. P(train arrives late on Monday and on time on Tuesday) = P(Arrives 
late on Monday) - P(Arrives on time on Tuesday) = (0.10)(0.90) = 0.09 


2. P(train arrives late at least once in a 5-day week) = 1 — P(train never arrives late 
in a 5-day week) = 1 — (0.90)? = 0.41 

3. P(full-time college student and age 65 or older) 4 (0.08)(0.15) because “Full- 
time college student” and “Adults age 65 or older” are not independent events. 
Knowing that a person is a full-time college student decreases the likelihood that 
he or she is an adult age 65 or older. 


Answers to Odd-Numbered Section 5.3 Exercises 


_ 44 
5.61 (a) P(TIE)=44200=0.22,P (TIE) = 300 = 9-22. en that the child is from 


England, there is a 0.22 probability that the student selected the superpower of 
telepathy. 
(b) P(E|SC)=54+52+30+4499+96+67+110=180372=0.484 


Cy\ _ 54452+30+44 — 180 _ 
P(E|S l= 99+96+67+110 372 — 0.484 


Given that the child did not choose superstrength, there is a 0.484 probability that 
the child is from England. 


5.63 (a) P(female|about right)=560560+295=0.655. 


. _ 560 _ noe 
P(female|about right) = 604205 — 9-655. Given that the person perceived his 


or her body image as about right, there is a 0.655 probability that the person is a 
female. (b) P(not overweight] F)=560+37560+163+37=597760=0.786. 
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= 560+37__ 597 __ 9 796 
560+163+37 760 "~~" Given that the person 


selected is female, there is a 0.786 probability that she did not perceive her body 


P(not overweight| F 


image as overweight. 

5.65 (a) P(is studying other than English) = 1 - P(mone) = 1 — 0.59 = 0.41. There is 
a 0.41 probability that the student is studying a language other than English. 

(b) P(Spanishother than English)=0.260.41=0.6341. 


P(Spanishother than English) = a = = 0.6341. @venaedie sndenes 


studying some language other than English, there is a 0.6341 probability that he or 
she is studying Spanish. 
5.67 P(B) < P(B | T) < P(T) < P(T | B). There are very few professional basketball 
players, so P(B) should be the smallest probability. If you are a professional 
basketball player, it is quite likely that you are tall, so P(T | B) should be the largest 
probability. Finally, it’s much more likely to be over 6 feet tall (B) than it is to bea 
professional basketball player if you’re over 6 feet tall (B | T). 
5.69 Method 1: P(D|C)=P(DnC)P(C)=0.180.32=0.563. 

P(DNC) 9, 18 


P(D|C) = = = $8 = 0.563 


" Given that a household owns a cat, there is a 
0.563 probability that the household owns a dog. Method 2: Use a Venn diagram. 


_ 0.18 0.18 
P(DIC)=0.180.18+0.14=0.180.32=0.563/ (©) = castora = 032 = 9-563 


yo 


0.22 0.18 0.14 


wa ae 


0. 


5.71 P(homeowner) = 340/500 = 0.68; P(homeowner | high school graduate) = 
221/310 = 0.713. Because these probabilities are not equal, the events 
“Homeowner” and “High school graduate” are not independent. Knowing that the 
person is a high school graduate increases the probability that the person is a 
homeowner. 

5.73 (a) P(iPhone | 18-34) = 169/517 = 0.327. Given that the adult is aged 18-34, 
there is a 0.327 probability that the person owns an iPhone. (b) First we determine 
P(iPhone) = 467/2024 = 0.231. Because these probabilities are not equal, the 
events “Own an iPhone” and “Aged 18-34” are not independent. Knowing that the 
person is aged 18—34 increases the probability that the person owns an iPhone. 
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5.75 There are 36 different possible outcomes of the two dice: (1, 1), (1, 2), ... , (6, 
6). Let’s assume that the second die is the green die. There are then 6 ways for the 
green die to show a 4: (1, 4), (2, 4), (3, 4), (4, 4), (5, 4), (6, 4). Of those, there is 
only one way to get a sum of 7, so P(sum of 7 | green is 4) = 1/6 = 0.1667 Overall, 
there are 6 ways to get a 7: (1, 6), (2, 5), (3, 4), (4, 3), (5, 2), (6, 1). So P(sum of 7) 
= 6/36 = 0.1667 Because these two probabilities are the same, the events “Sum of 
7” and “Green die shows a 4” are independent. Knowing that the green die shows a 
4 does not change the probability that the sum is 7. 

5.77 P(download music) = 0.29, P(don’t care | download music) = 0.67, 
P(download music n don’t care) = P(download music) « P(don’t care | download 
music) = (0.29)(0.67) = 0.1943 = 19.43%. About 19.43% of Internet users 
download music and don’t care if it is copyrighted. 

5.79 P(all three candies have soft centers) = 


P(1st soft) - P(2nd soft | 1st soft) - P(3rd soft | 1st and 2nd soft) = 


144 613 = «(12 = 2184 — 0.319 


1420-1319 -1218=21846840=0.319.20 19 18 6840 


0.319 probability that 3 randomly selected candies from a Gump box will all have 


’ There is a 


soft centers. 


5.81 (a) A tree diagram is shown here. 


gasoline 


sa 0.02 
Customer 


gasoline 


Z 


Mid-grade <—_ 0.6 


pe Credit card 
Regular <0, 


Y.72 
“—~ No credit card 


O- Credit card 


“06 A 
— No credit card 


¥) 
oe Credit card 


Premium <—_@. 


gasoline 


<8 No credit card 


(b) P(credit card) = (0.88)(0.28) + (0.02)(0.34) + (0.10)(0.42) = 0.295. There is a 
0.295 probability that the customer paid with a credit card. (c) P(premium gasoline 


| credit card) = P(premium gasolinen credit card)P(credit card)=(0.10) 


(0.42)0.295=0.0420.295=0.142. 
__ (0.10)(0.42) 
0.295 


P(premium gasolineMcredit card) 


P(credit card) 


_ 0.042 
~~ 0.295 


= 0.142. 


Given that the 


customer paid with a credit card, there is a 0.142 probability that the customer 


bought premium gas. 


5.83 We need to determine P(missed his first serve | won the point). Here is a tree 


diagram. 


1959 


go Wins point 


a { : 
gp Makes ~224 Doesn't 
we ees ~ 
Andy Murray =~ first-serve win point 
oy 
rVes 0 ss 
Betvee = y54_ Wins point 
Misses are 
first-serve 46 Doesn't 
win point 


P(missed his first serve | won the point) = 
P(missed his first serven won the point)P(won the point)=(0.40)(0.54)(0.60)(0.76)+ 
(0.40)(0.54)=0.2160.672=0.32. 


P(missed his first serveMwon the point) _ (0.40) (0.54) — 0.216 __ 0.32 
P(won the point) ~— (0.60)(0.76)+(0.40)(0.54) 0.672," Given 


that he won the point, there is a 0.32 probability that he missed his first serve. 


5.85 We need to determine P(antibody | positive). Here is a tree diagram. 


25 
of * 
Antibodies <0, 
\ ~00 
ot present SS 
} os = 
Person <_ 1.99 oe + 
Pie, — 
Antibodies <0 
present aad 


P(antibody|positive)=P(antibody n positive)P(positive)=(0.01)(0.9985)(0.01) 
(0.9985) 1 (0.99)(0.006)=0.6270. 


P as -_ P(antibodyM positive) _ (0.01)(0.9985) i 
P(antibody|positive) = —Sipositivey = Tooiio.o085) 1 0.90)(0006) = 0.6270. 


Given that the EIA test is positive, there is a 0.627 probability that the person has 
the antibody. 

5.87 (a) P(contributed) = (0.5)(0.4)(0.8) + (0.3)(0.3)(0.6) + (0.2)(0.1)(0.5) = 0.16 + 
0.054 + 0.01 = 0.224. There is a 0.224 probability that a potential donor 
contributed to the charity. (b) 

P(recent donor|contribute)=P(recent donorncontribute)P(contribute)=0.160.224=0.' 


‘ P(recent. donor“ contribute) 0.16 
P(recent donor|contribute) = ————___——_- = —— = 0.7143. 
( ) P(contribute) 0.224 7 


Given that the person contributes to the charity, there is a 0.7143 probability that 
the person was a recent donor. 


5.89 The probability that the string of lights will remain bright for 3 years is 
(0.98)? = 0.6676. 

5.91 P(none are late) 

= P(1st is not late and 2nd is not late and ... and 20th is not late) 

= P(1st is not late) - P(2nd is not late) - ... - P(20th is not late) 

= (0.90) (0.90) ... (0.90) 
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= (0.90)? 

= 0.1216 

P(at least 1 late) = 1 — P(mone are late) = 1 — 0.1216 = 0.8784 

5.93 We cannot simply multiply the probabilities together. If a woman is over 55 
years old, it is unlikely that she is pregnant. These events are not independent. 
5.95 (a) P(all 4 calls are for medical help) 

= P(1st is medical and 2nd is medical and 3rd is medical and 4th is medical) 

= P(1st is medical) - P(2nd is medical) - P(3rd is medical) - P(4th is medical) 
= (0.81)(0.81)(0.81)(0.81) 

= (0.81)4 

= 0.430 

There is a 0.430 probability that all 4 calls are for medical help. 

(b) P(at least 1 not for medical help) 

= 1 - P(all 4 calls are for medical help) 

= 1 - 0.430 

= 0.570 


There is a 0.570 probability that at least one of the 4 calls is not for medical help. 
(c) The calculation in part (a) might not be valid because the 4 consecutive calls 
being medical are not independent events. Knowing that the first call is medical 
might make it more likely that the next call is medical (e.g., several people might 
call for the same medical emergency). 


5.97 (a) 
Gender 
Male Female Total 
Blue eyes 10 
Eye color Brown eyes 40 
Total 50 


The events “Student is male” and “Student has blue eyes” are mutually exclusive 
because they don’t occur at the same time. 


(b) 


Gender 
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Blue eyes 
Eye color Brown eyes 
Total 


If the event “Student is male” and the event “Student has blue eyes” are 
independent, then: P(male)=P(male|blue) — 2050=x10 — x=4 


P(male) = P(male|blue) > 2 = + +a2=4 
(c) Answers may vary. 
Gender 
Male Female Total 
Eye color Blue eyes 10 
Brown eyes 40 
Total 50 


The events “Student is male” and “Student has blue eyes” are not mutually 
exclusive because they can occur at the same time (for 5 students). The events 
“Student is male” and “Student has blue eyes” are not independent because 


P(male)#* P(male | blue). P(male)=20/50=0.4/ (male) = 20/50 = 0.4. 
P(male|blue)=5/10=0.5/ (male|blue) = 5/10 = 0.5 

5.99 Two events are independent if P(ANB)=P(A) - P(B). 

P(AMB)= P(A): P(B). tris given that P(A) = 0.3, P(B) = 0.4, and 
P(AnB)=0.12. (AB) = 0.12. Because 0.12 = (0.3)(0.4), events A and B are 
independent. 

5.101 (a) There are 6 ways to get doubles out of 36 possibilities so 
P(doubles)=6/36=0.167,P (doubles) = 6/36 = 0.167. There is a 0.167 
probability of getting doubles on a single toss of the dice. (b) Because the rolls are 
independent, we can use the multiplication rule for independent events. 

P(no doubles firstn doubles second)=" (no doubles first ) doubles second) = 
P(no doubles first) -P(doubles second)=(30/36)(6/36)=0.139. 

P (no doubles first) - P (doubles second) = (30/36) (6/36) = 0.139. 

There is a 0.139 probability of not getting doubles on the first toss and getting 
doubles on the second toss. 
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(c) P(first doubles on third roll)=" (first doubles on third roll) 
P(no doubles): P(no doubles) : P(doubles)=56(56)(16)=25216 
P (no doubles) - P (no doubles) - P (doubles) = 2 () (4) = 


6\ 6 6 216 


= 0.116. There is a 0.116 probability that the first doubles occurs on the third roll. 
(d) For the first doubles on the fourth roll, the probability is 

(5) (3) 
(56)3(16) \® ®/ For the first doubles on the fifth roll, the probability is 


5 
eaiien ( . . The probability that the first doubles are rolled on the 


k-1 
5 t) 
kth roll is ee) (; , 
5.103 ¢ 


S405 6 


za 185-268 _ 449 
5.107 (a) 2=18.5-26.87.4=-112 74 © 


(i) The proportion of z-scores below —1.12 is 0.1314. 

(ii) normalcdf(lower: —1000, upper: 18.5, mean: 26.8, SD: 7.4) = 0.1314. About 
13.14% of young women are underweight by this cri- terion. (b) Note that 

P(not underweight)=1-—P(underweight)= 

P (not underweight) = 1 — P (underweight) = ;_9 131=0.869 

1 — 0.131 = 0.869, P(at least one is underweight) = 1 — P(mone are underweight) 
=1-0.8692=0.24481 — 0.869° = 0.2448 There is a 0.2448 probability that at 
least one of the two women will be classified as underweight. 


Answers to Chapter 5 Review Exercises 

R5.1 (a) If you take a very large random sample of pieces of buttered toast and 
dropped them from a 2.5-foot-high table, about 81% of them will land butter side 
down. (b) No; if four dropped pieces of toast all landed butter side down, it does 
not make it more likely that the next piece of toast will land with the butter side up. 
Chance behavior is unpredictable in the short run, but it has a regular and 
predictable pattern in the long run. The value 0.81 describes the proportion of 
times that toast will land butter side down in a very long series of trials. 

R5.2 (a) To use a table of random digits to perform the simulation, begin by letting 
00-80 = butter side down and 81-99 = butter side up. Moving left to right across a 
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row, look at pairs of digits to simulate dropping one piece of toast. Read 10 such 
pairs of two-digit numbers to simulate dropping 10 pieces of toast. Record the 
number of pieces of toast out of 10 that land butter side down. Determine if 4 or 
fewer pieces of toast out of 10 landed butter side down. Repeat this process many 
times. (b) Bold numbers indicate that the toast landed butter side down. 


Trial 1: 29 07 71 48 63 61 68 34 70 52. Did 4 or fewer pieces of toast land butter 
side down? No. 
Trial 2: 62 22 45 10 25 95 05 29 09 08. Did 4 or fewer pieces of toast land butter 
side down? No. 


Trial 3: 73 59 27 51 86 87 13 69 57 61. Did 4 or fewer pieces of toast land butter 
side down? No. 

(c) Assuming the probability that each piece of toast lands butter side down is 0.81, 
the estimate of the probability that dropping 10 pieces of toast yields 4 or less 
butter side down is 1/50 = 0.02. There is convincing evidence that the 0.81 claim is 
false. Maria does have reason to be surprised, as these results are unlikely to have 
occurred purely by chance. 


R5.3 (a) The sample space is: rock/rock, rock/paper, rock/scissors, paper/rock, 
paper/paper, paper/scissors, scissors/rock, scissors/paper, and scissors/scissors. 
Because each player is equally likely to choose any of the three, each of these 9 
outcomes will be equally likely and have probability 1/9. (b) There are three 
outcomes—rock/scissors, paper/rock, and scissors/paper—where player 1 wins on 
the first play. Define event A as player 1 wins on the first play. P(A) =3/9=0.33 
3/9 = 0.33. 

R5.4 (a) The probability that it is a crossover is 1 — 0.46 — 0.15 — 0.10 — 0.05 = 
0.24. The sum of the probabilities must add to 1. (b) P(vehicle is not an SUV ora 
minivan) = 0.46 + 0.15 + 0.24 = 0.85. The probability that the vehicle is not an 
SUV or a minivan is 0.85 


(c) P(pickup truck | not a passenger car) = 


0.150.15+0.10+0.24+0.05=0.150.54=0.278 0.15+0.10+0.24+0.05 0.54 


Given that the vehicle is not a passenger car, there is a 0.278 probability that the 
vehicle is a pickup truck. 
R5.5 (a) P(drives an SUV) = 39/120 = 0.325. The probability that the person drives 


an SUV is 0.325. (b) P(drives a sedan or exercises) = 25+20+15+12120=0.60 


25+20+15+12 _ 9 eg 
120 ‘~~. The probability that the person drives a sedan or exercises for 
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at least 30 minutes four or more times per week is 0.60. 
(c) P(does not drive a truck | exercises) = 25+1525+15+12=4052=0.769 
ai, = 2 =0.769 


25+15+12 Given that the person exercises for at least 30 minutes 
four or more times per week, there is a 0.769 probability that the person does not 
drive a truck. 

R5.6 (a) The events “Thick-crust pizza” and “Pizza with mushrooms” are not 
mutually exclusive. It is possible for a pizza to be thick-crust and have mushrooms. 


(b) Let’s organize the given information in a table. 


Thick crust Thin crust Total 
Mushrooms 2 4 6 
No mushrooms 1 2 3 


Total 3 6 9 


P(mushrooms) = 6/9 = 0.667 and P(mushrooms | thick crust) = 2/3 = 0.667. 
Because P(mushrooms) = P(mushrooms | thick crust), the events “Mushrooms” 
and “Thick crust” are independent. (c) Use the general multiplication rule. P(both 
randomly selected pizzas have mushrooms) = P(first has mushrooms) :* P(second 
has mushrooms | first has mushrooms) =(6/9)(5/8)=0.417(6/9) (5/8) = 0.417 ff 
you randomly select 2 of the pizzas in the oven, the probability that both have 
mushrooms is 0.417. 

R5.7 (a) A tree diagram is shown here. 


Test result 


oe + 
ere 
i 0 2 
Se Qh + 
No <2, 95 


drugs 


(b) P(drug test result is positive) = (0.04)(0.90) + (0.96)(0.05) = 0.084. The 
probability that the drug test result is positive is 0.084. 
(c) P(uses illegal drugs | positive result) = 
P(uses illegal drugsn positive result)P(positive result)=(0.04)(0.90)0.084=0.429 
P(uses egal drugs positive result) __(0.04)(0.90) _ 0.429 

P{positive result) ee Given that the drug test result 


is positive, there is a 0.429 probability that person uses illegal drugs. 
R5.8 P(at least 1 believes that finding and picking up a penny is good luck) = 1 - 
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P(mone of the 10 people believe this) 

= 1- (0.67)!° = 0.98177 

The probability of selecting 10 U.S. adults and finding at least 1 who believes that 
finding and picking up a penny is good luck is 0.98177. 


Answers to Chapter 5 AP® Practice Test 
T5130 
T5.2 b 
Thc 
15.4 b 
T3.2¢ 
T5.6a 
T5.7 b 
T5.8 a 
(3.2 
T5.10 e 


T5.11 (a) P(student eats regularly in the cafeteria and is not a 10th grader) 
130+122+68 _ 320 _ @ a9 
=130+122+68805=320805=0.398 = 805 805 “"~. There is a 


probability of 0.398 that a randomly selected student eats regularly in the cafeteria 
and is not a 10th grader. (b) P(10th grader | eats regularly in the cafeteria) = 
175/495 = 0.354. Given that the student eats regularly in the cafeteria, there is a 
0.354 probability that he or she is a 10th grader. (c) P(10th grader) = 209/805 = 
0.260. The events “10th grader” and “Eats regularly in the cafeteria” are not 


independent because P(10th grader | eats regularly in the cafeteria)#7 P(10th 
grader). 


T5.12 Let’s organize this information using a tree diagram. 


gio Defective 


A =O 
70 
& = OOK 
~ 
030 oe Defective 
Part <—— B <_07% 
" SHOOK 
Y 
ln 
4 oad Defective 
C = _O65 
~2 OK 


(a) To get the probability that a part randomly chosen from all parts produced in 
this factory is defective, add the probabilities from all branches in the tree that end 
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in a defective part. P(defective) = (0.60)(0.10) + (0.30)(0.30) + (0.10)(0.40) = 0.06 


+ 0.09 + 0.04 = 0.19. (b) P(Machine B | defective) =(0.30) 
(0.30)(0.30) _ 0.09 _ 474 
(0.30)0.19=0.090.19=0.474 0.19 = 0.19 ~~ “*"" “, Given that a part is 


inspected and found to be defective, there is a 0.474 probability that it was 


produced by Machine B. 


T5.13 (a) P(customer will pay less than the usual cost of the buffet) = 23/36 = 
0.639. There is a 0.639 probability that the customer will pay less than the usual 
cost of the buffet. (b) P(all 4 friends end up paying less than the usual cost of the 
buffet) = (23/36)* = 0.167. There is a 0.167 probability that all 4 of these friends 
end up paying less than the usual cost of the buffet. (c) P(at least 1 of the 4 friends 
ends up paying more than the usual cost of the buffet) = 1 — 0.167 = 0.833. There 
is a 0.833 probability that at least 1 of the 4 friends ends up paying more than the 
usual cost of the buffet. 

T5.14 (a) To carry out the simulation using a table of random digits, let 00-16 = 
cars with out-of-state plates and let 17-99 = other cars. Moving left to right across 
a row, look at pairs of digits from a random number table until you have found two 
numbers between 00 and 16 (repeats are allowed). Record how many two-digit 
numbers you had to read in order to get 2 numbers between 00 and 16. Repeat this 
process many times. (b) The first sample is 41 05 09. The bold numbers represent 
cars with out-of-state plates. In this sample, it took three cars to find two with out- 
of-state plates. The second sample is 20 31 06 44 90 50 59 59 88 43 18 80 53 11. 
In this sample, it took 14 cars to find two with out-of-state plates. The third sample 
is 58 44 69 94 86 85 79 67 05 81 18 45 14. In this sample, it took 13 cars to find 
two with out-of-state plates. 


Chapter 6 

Section 6.1 

Answers to Check Your Understanding 

page 365: 1. Using probability notation, the event “Student got a C” is written as 
p(x=2)P (X = 2), pcx=2)=1-0.011-0.032-0.362— 

P(X = 2) =1—0.011 — 0.032 — 0.362—_ 9 457=9 1390.457 = 0.138 

2 P(x>3)P (X 2 3) is the probability that the student got either an A or a B. This 
probability is P(X>3)=0.362+0.457=0.819" (X = 3) = 0.362 + 0.457 = 0.819. 


3. The histogram is left skewed. This means that higher grades are more likely, and 
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there are a few lower grades. 


page 371: 1. pX=(0)(0.011)+(1)(0.032)+(2)(0.138)+ 


(3) (0.362) + (4) (0.457) = 3.222 J many, many students are selected, the 


average grade of the students selected will be about 3.222. 
2. 0X2=(0-3.222)2(0.011)+(1-3.222)2(0.032)+ 
o2 = (0 — 3.222)” (0.011) + (1 — 3.222)” (0.032) + (2-3,292)2(0.138)+ 


(2 — 3.222)" (0.138) + (3-3.992)2(0.362)+(4-3.222)2(0.457) 
(3 — 3.222)° (0.362) + (4 — 3.222)” (0.457). gx9=0,77277% x = 9-7727 so 


oX=0.7727=0.8797X = V0.7727 = 0.879 The grade received by a randomly 
selected student will typically vary from the mean (3.222) by about 0.879 points. 


Answers to Odd-Numbered Section 6.1 Exercises 
6.1 (a) X=5; P(X=5)=0.11% = 93 P(X = 5) = 0.11 (py pex<3)=0.72 
P(X < 3) =0.72 


6.3 (a) P(Y<20)=0.31P (Y < 20) = 0.31. there is a 0.31 probability that the 
amount of money collected on a randomly selected ferry trip is less than $20. (b) 
y>20Y > 20; pry>20)=0.69 (Y 2 20) = 0.69 

6.5 


$ 4 
mm O14 — | 
OO —— : 


0 5 10 15 20 25 
Y = Money collected ($) 


Skewed to the left with a single peak at $25 collected. 

6.7 pY=19.35HY = 19.35. if many, many ferry trips are randomly selected, the 
average amount of money collected will be about $19.35. 

6.9 (a) Right-skewed distribution; the most likely first digit is 1, and each 


subsequent digit is less likely than the previous digit. (b) px=3.4414« ~ 3.441. in 
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many, many randomly chosen legitimate records, the average of the first digit 
would be about 3.441. 


6.11 (a) 

Money collected 0 5 10 15 20 Ps 
Probability 0.02 0.05 0.08 0.16 0.27 0.42 
Cumulative Probability 0.02 0.07 0.15 0.31 0.58 1 


The median of Y is $20. (b) The mean of Y is less than the median of Y because the 
probability distribution is skewed left. 

6.13 oY=6.4297Y = 6.429. the money collected on a randomly selected ferry trip 
will typically vary from the mean ($19.35) by about $6.43. 


6.15 ox=2.462% = 2.462. the first digit of a randomly chosen record will 
typically vary from the mean (3.441) by about 2.462. 

6.17 (a) The company has collected $1250 and must pay out $100,000. The 
company ears $1250-$100,000=-$98,75051250 — $100,000 = —$98,750 (py 
HY=$303.354Y — $303.35 ; if many, many males are insured by this company, the 
average amount the company would make, per person, will be about $303.35. (c) 
oY=$9,707.577Y = $9,707.57 ; the amount that the company earns on a randomly 
selected policy will typically vary from the mean ($303.35) by about $9,707.57. 
6.19 (a) Both distributions are skewed to the right. The center for the “household” 
distribution is less than the center for the “family” distribution, but the variability 
of the household distribution is greater than the variability of the family 
distribution. Also, the event X=1X = 1 has a much higher probability in the 
household distribution. (b) pH=2.64# = 2-6 and pF=3.14HF = 3-14, the 
household distribution has a slightly smaller mean than the family distribution. (c) 
OF=1.2499F = 1.249. the standard deviation for the household distribution is 
slightly larger than for the family distribution. 


6.21 (a) P(Y>6)=0.333; P(X>6)=0.155/ (Y > 6) = 0.333; P(X > 6) = 0.155. 
if an expense report contains more than a proportion of 0.155 of first digits that are 
greater than 6, it may be a fake expense report. (b) The mean is 5 because this 
distribution is symmetric. (c) To detect a fake expense report, compute the sample 
mean of the first digits and see if it is closer to 5 (Suggesting a fake report) or near 
3.441 (consistent with a truthful report). 


6.23 Length=3600; height=1/3600; probability=300(1/3600)=0.083; 
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Length = 3600; height = 1/3600; probability = 300 (1/3600) = 0.083; 


there is about an 8.3% chance that the request is received within the first 300 
seconds (5 minutes) after the hour. 

6.25 (a) P(-1<v<1)=0.4P (—1 < Y < 1) = 0-4. the probability that Mr. Shrager 
dismisses class within a minute of the end of class is 0.4. (b) pY=1.5;#Y = 1.95 
the mean is 1.5 because this distribution is symmetric. (c) k=2.75;* = 2.79} to find 


this value, we solve the equation (4-k)(1/5)=0.25(4 — k) (1/5) = 0.25 fork. 


6.27 (i)z=-1.502 = 1.50; p(z<-1.50)=0.0668" (4 < —1.50) = 0.0668. 

(ii) p(y<6)= (Y <6) = normalcdf(lower: —1000 1000 | upper: 6, mean: 7.11, 
SD: 0.74). There is about a 6.68% chance that this student will run the mile in 
under 6 minutes. 

6.29 (a) (i) z=-1.832 = —1.83 and z=1.52 = 1.5; P(-1.83<Z<1.5)= 

P(—1.83 < Z < 1.5) = 9 9339-:.90336=0.89960-9332 — 0.0336 = 0.8996 


(ii) P(325<X<345)=P (325 < X < 345) = normalcdf(lower: 325, upper: 345, 
mean: 336, SD: 6) = 0.8998. There is about an 89.98% chance that a randomly 
selected horse pregnancy will last between 325 and 345 days. 

(b) (i) 0.20 area to the right of c -© ~z=0.842 = 0.84, 0.84=c-3366 


0.84 — c—336 
ou 6 4 > 5.04=c-3369-04 = ¢ — 336 _,—> (=341.04c = 341.04 


(ii) invNorm(area: 0.80, mean: 336, SD: 6)=341.05= 341.05, About 20% of 
horses will have pregnancies that last more than 341.04 days. 

6.31b 

6.33.¢ 


6.35 (a) Yes; if we look at the differences (Post — Pre) in the scores, the mean 


difference was 5.38 and the median difference was 3. This means that at least half 
of the students (though less than three-quarters because q19i was negative) 
improved their reading scores. (b) No, we do not have a control group that did not 
participate in the chess program for comparison. It may be that children of this age 
improve their reading scores for other reasons (e.g., regular school) and that the 
chess program had nothing to do with their improvement. 


Section 6.2 
Answers to Check Your Understanding 
page 386: 1. The probability distribution of X and the probability distribution of Y 
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are shown here. 


Probability 
8 


0 1 2 3 
X = Number of cars sold 


Probability 
8 


0 500 1000 1500 
Y = Bonus received ($) 


Both distributions are slightly skewed to the right with a single peak. Their shapes 
are identical. 
2. pY¥=500(1X)=500(1.1)=$550HY = 500 (1x) = 500 (1.1) = $550 


3. cY=500(0.943)=$471.507Y = 900 (0.943) = $471.50 The bonus earned in 
the first hour of business on a randomly selected Friday will typically vary from 
the mean ($550) by about $471.50. 


4, Note that T=Y-751 = Y — 75, Shape: The shape of the probability distribution 
of T will be the same as the shape of the probability distribution of Y: skewed right 
with a single peak. 

Center: 'T='Y—-75=550-75=$475HT = Hy — 75 = 550 — 75 = $475, 
Variability:oT=0Y=$471.509T = 7y = $471.50. 

page 394: 1. pT=pX+pY=1.1+0.7=1.gHT = Hx + Hy = 1.1+0.7= 1.8. this 
dealership expects to sell or lease about 1.8 cars in the first hour of business, on 
average, over many randomly selected Fridays. 


2. Because X and Y are independent, oT2=o0X2+o0Y2=(0.943)2+ 
9 9 9 2 2 
oT=1.2988=1.149T = V 1.2988 = 1.14 The total number of cars sold or leased 


in the first hour of a randomly selected Friday will typically vary from the mean 
(1.8) by about 1.14. 


3. The total bonus is B=500X+300YB = 500X + 300Y. This means that pB= 
HB =500uX+300pLY=500(1.1)+300(0.7)=$760 
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500.x + 300py = 500 (1-1) + 300 (0.7) = $760 pocanse X and Yare 


5) ee /* 
independent, oB2=(5000X)2+(3000¥)2=7B = (5000x)" + (3000y)" = 


2 2 

(500)2(0.943)2+(500)° (0.943) "+ (399)2(0.64)2=259,176.25 
2 2 

(300)"(0.64)" = 259,176.25 Therefore, cB=259,176.25= 

oB = 259, 176.25 = geo 99$509.09, 


Answers to Odd-Numbered Section 6.2 Exercises 
6.37 Approximately Normal; pT=204T — 20 minutes; oT=6.57T = 6-5 minutes 
6.39 (a) 


2 | 
& 0.10 1 
0.05 | | r 
LE NEE ERAGE 
38 8639 40 41 42 43 44 45 
X = Total number of people 
on the flight 


The distribution of total number of people on the flight is roughly symmetric with a 
single peak at 42 people. (b) pX=41.44X = 41.4. if many, many flights are 
randomly selected, the average of the total number of people on the flight will be 
about 41.4 people. (c) oX=1.2479X 1.24 ; the total number of people on a flight 
will typically vary from the mean (41.4) by about 1.24 people. 

6.41 (a) The distribution of M has the same shape as the distribution of X: skewed 
to the left. (b) uM=$19.354HM = $19.35 (cy gM=$6.457™ = $6.45 

6.43 (a) W represents the amount of time (in minutes) after the start of an hour at 
which a randomly selected request is received by a particular web server. (b) W has 
a uniform distribution on the interval from 0 to 60 minutes. 

6.45 (a) G=5X+50G = 5X + 50; medianG=5(8.5)+50=92.5 

mediang = 5 (8.5) + 50 = 92.5 (b) IQRG=5(9-8)=5! QR =5(9-—8)=5 


_ 9 nach 
6.47 (a) pY=95(8.5)+32=47.3"" ~ = (8.5) + 32 = 47.3 


oy = 3(2.25) = 4.05, re 


(b) oY=95(2.25)=4.05 


temperature in the cabin at midnight on a randomly selected night will typically 
vary from the mean (47.3 °F)(47-3 F) by about 4.05 °F4.05 °F, 

(c) (i)z=-1.802 = —1.80; p(z<-1.80)=0.0359" (4 < —1-80) = 0.0359 

(ii) pcy<40)=" (Y < 40) = normalcdf(lower: —1000 1000 | upper: 40, mean: 
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47.3, SD: 4.05)=0.0357= 0.0357. There is a 0.0357 probability that the midnight 
temperature in the cabin is below 40°F40 F. 
6.49 pS=pX+pY=732.50+825=$1557.50 

Ms = Bx + By = 732.50 + 825 = $1557.50. the average of the sum of the 
tuitions would be about $1557.50 for many, many randomly selected pairs of 


students from each of the campuses. 


6.51 pF-M=120-105=15H#F-M = 120 — 105 = 15, the average of the difference 
would be about 15 points, if you were to repeat the process of selecting a single 
male student, selecting a single female student, and finding the difference (Female 
— Male) in their scores many times. 


6.53 LX/50-Y/55=pX/50-pY/55=14.65-15=—0.35 
Lx/50-Y/55 = Ux/s0 — Hy/s5 = 14.65 — 15 = —0.35 


6.55 (a) Yes; the mean of a sum is always equal to the sum of the means. (b) No; 
the variance of the sum is not equal to the sum of the variances because it is not 
reasonable to assume that X and Y are independent. 


6.57 Because X and Y are independent, oS=1032+126.502= 


22 Fe 
gs = V 103" + 126.50" = $163 13$163.13. For a student randomly selected 
from each of the campuses, the sum of their tuitions typically varies from the mean 
($1557.50) by about $163.13. 


6.59 (a) Independence of F and M means that knowing the value of one student’s 
score does not help us predict the value of the other 


student’s score. (b) OF-M=282+352=44,8229F-M = V 28" + 35° = 44.822. fo, 
a randomly selected male and female college student, the difference of SSHA 
scores typically varies from the mean (15) by about 44.822 points. (c) No, we do 
not know the shapes of the distributions. We cannot assume that the distributions 
are Normal without additional information. 


6.61 oD=0X/50- Y/55=(10350)2+(126.5055)2=3.088 


2 9 
if 103 \~ 126.50 \~ 
oD = 9X/50-Y/55 = \ (43) a ( 55 ) = 3.083 


the 
standard deviation of the difference D (Main — Downtown) in the number of units 
that two randomly selected students take is 3.088. 


6.63 1X 1+X2=$606.70"%: +X2 = $606.70 and because the variables are 
independent, oX1+X2=$13,728.58. W=12(X1+X2) 
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- waco = 
0x,+X, = $13,728.58. W = (Xi 4 X2) so wW=$303.35/ =$303.35 and 


oW=$6864,299W = $6864.29 

6.65 (a) Normal distribution with mean = 11 + 20 = 31 seconds, and because the 
variables are independent, standard deviation=22+42=4,4721 

standard deviation = \/ 2? + 4? = 4.4721 (b) (i)z=-0.222 = —0.22. prz< 
(ii) P7X+Y<30)=normalcdf(lower: —1000, 

P(X + i <= 30) = normalcdf(lower: = 1000, upper: 30. mean: 31. SD: 4.4721) 
= 0.4115. There is a 0.4115 probability of completing the process in less than 30 
seconds for a randomly selected part. 


6.67 Let D = L — H; pD=105-98=7-D = 105 — 98 = 7. Because the random 
variables are independent, oD=(10)2+(15)2=18.03 


= ,/(10)? + (15)? = 18.03 
am, (10)" + (15) .D follows a Normal distribution with a mean of 


7 and a standard deviation of 18.03. We want to find P(-5<L-—H<5) 

Pl bi hie d eG) p(-s<p<5yP (—5 <D<5) 

(i)z=-0.11 and z=-0.672 = —0.11 and z = —0.67, p(—0.67<Z<-0.11)=0.4562- 
P (0.67 < Z < —0.11) = 0.4562— 9 9544=0.20480.2514 = 0.2048 (ii) 
P(-5<D<5)P (-5 < D<5)- normalcdf(lower: —5, upper: 5, mean: 7, SD: 
18.03)=0.2030= 0.2030. There is a 0.203 probability that Lamar and Hareesh 
finish their jobs within 5 minutes of each other on a randomly selected day. 
6.69 Let T=X1+X2+X3+X4l = X1 + Xo + X3 + Xa, T=55.2+58.0+56.3+ 
br = 59.2 + 58.0 + 56.3+54 7-994.954.7 = 224.2. Because the random 
variables are independent, 

OT=(2.8)2+(3.0)2+(2.6)2+(2.7)2=5.56 


a, wi a. eS 2 _ 
ee y (2.8) + (3.0)" + (2.6)" + (2.7) Dae rag the Normal distribution 


with a mean of 224.2 and a standard deviation of 5.56. We want to find P(T<220) 
P(T < 220) (jy 2=-0.762 = —0.76; p(z.<-0.76)=0.2236 

P(Z < —0.76) = 0.2236 

(ii) P(T<220)=normalcdf(lower: —1000, upper: 220, mean: 224.2, SD: 


5.96)=0.2250. 
P(T < 220) = normalcdf(lower: — 1000, upper : 220, mean : 224.2, SD:5.56) = 0.2250. 


There is a 0.2250 probability that the total team time is less than 220 seconds in a 
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randomly selected race. 


6.71 Let D=X1-X2; pD=1.4-1.4=027 = X1 — X23 Hp = 14—1.4=0 Because 
the random variables are independent, oX1—X2=0.32+0.32=0.4243 


_ fra . = 
xX, = V0.3" + 0.3" = 0.4243 | D has the Normal distribution with a mean of 


0 and a standard deviation of 0.4243. We want to find P(D<-—0.8 or D>0.8) 
P(D < —0.80rD > 0.8) 


(i) z=1.892 = 1.89 and z=-1.892 = —1-89; P(Z<-1.89 or Z>1.89) 
P(Z < —1.890r Z > 1.89) <9 gsgg— 0.0588 


ox, 


(ii) P(D<—0.8 or D>0.8)=1-normalcdf(lower: —0.8, 

P(D < —0.80r D > 0.8) = 1 — normalcdf(lower: — 0.8, upper: 0.8, mean: 0, 
SD: 0.4243) = 1 -0.9406 = 0.0594. There is a 0.0594 probability that difference is 
at least as large as the attendant observed. 
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6.75 (a) Yes; the girls who were assigned to receive the fluoride varnish may 
expect to get fewer cavities and, as a result, might take better care of their teeth. (b) 
This experiment could be double-blind if the girls who are not selected to receive 
the fluoride varnish received a placebo varnish that would neither help nor hurt 
their teeth. Also, the dental hygienist who checks the girls’ teeth for cavities at the 
end of the 4 years should not know which girls received which treatment. (c) The 
random assignment should help to make the two groups as similar as possible 
before treatments are administered. 


Section 6.3 

Answers to Check Your Understanding 

page 406: 1. Binary? “Success” = get an ace; “Failure” = don’t get an ace. 
Independent? Because you are replacing the card in the deck and shuffling each 
time, the result of one trial does not tell you anything about the outcome of any 
other trial. Number? n=10n = 10, Same probability? p=4/52P? — 4/52 Thisisa 
binomial setting, and X has a binomial distribution with n=10n = 10 and p=4/52 
p = 4/52. 

2. Binary? “Success” = over 6 feet; “Failure” = not over 6 feet. Independent? 
Because we are selecting without replacement from a small number of students, the 
observations are not independent. Number? n=5"% = 5.Same probability? The 
(unconditional) probability of success will not change from trial to trial. Because 
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the trials are not independent, this is not a binomial setting. 


3. Binary? “Success” = roll a5; “Failure” = don’t roll a 5. Independent? Because 
you are rolling a die, the outcome of any one trial does not tell you anything about 
the outcome of any other trial. Number? n=100” = 100. Same probability? No; 
the probability of success changes when the corner of the die is chipped off. 
Because the (unconditional) probability of success changes from trial to trial, this 
is not a binomial setting. 

page 412: 1. Binary? “Success” = question answered correctly; “Failure” = 
question not answered correctly. Independent? Mr. Miller randomly determined 
correct answers to the questions, so knowing the result of one trial (question) 
should not tell you anything about the result on any other trial. Number? n=10 

n = 10. Same probability? p=0.20P = 0.20 This is a binomial setting and X has a 
binomial distribution with n=10n = 10 and p=0.20P = 0.20, 

2. P(X=3)=(103)(0.2)3(0.8)7=0.2013. 

P(X = 3) = & (0.2)°(0.8)' = 0.2013. 
3 There is about a 20% 


chance that Hannah will answer exactly 3 questions correctly. 

3. P(X>6)=1-P(X<6)=1-P(X<5)=1-binomcdf 

P(X > 6) =1-— P(X < 6) =1-— P(X < 5) =1- binomedf (ia1s:10, p: 
0.2, x value:5)=1-0.9936=0.0064. 

(trials:10,p:0.2,avalue:5) = 1 — 0.9936 = 0.0064. There is only a 0.0064 
probability that a student would get 6 or more correct, so we would be quite 
surprised if Hannah was able to pass. 

page 418: 1. 


Probability 
o 
ind 
i=) 


rr a a | 


Vi23 45 6 7 8 19 10 


Y = Number of questions 
answered incorrectly 


The probability distribution is skewed to the left with a single peak at Y = 8. 

2. pY=np=10(0.80)=3HY = ™P = 10 (0.80) = 8 if many students took the quiz, 
the average number of questions students would answer incorrectly is about 8 
questions. 
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3. oY=np(1—p)=10(0.80)(0.20)=1.265. 


Coy = \/ np (1 Pp) = V 10 (0.80) (0.20) = 1 20. If many students took the 


quiz, we would expect individual students’ scores to typically vary from the mean 
of 8 incorrect answers by about 1.265 incorrect answers. 
4. The shape of X is skewed right and the shape of Y is skewed left, both with a 
single peak. The center (mean) of the distribution of X is 2, which is less than the 
center (mean) of the distribution of Y, which is 8. (Note that the sum of the means 
is 10 because X + Y = 10.) The variability of both probability distributions is the 
same (standard deviation = 1.265). 
page 426: 1. Die rolls are independent, the probability of getting doubles is the 
same on each roll (1/6), and we are repeating the chance process until we get a 
success (doubles). This is a geometric setting and T is a geometric random variable 
with p=16" ~ $. 

P(T =3)= ( 
2. P(T=3)=(56)2(16)=0.1157 
is 11.57% that 
you will get the first set of doubles on the third roll of the dice. 
3. P(T<3)=16+(56)(16)+(56)2(16)=0.4213 


P(T <3)= : . (5) (5) " (3) (3) A PEE rhe pibUas 


bility is 42.13% of getting doubles in 3 or fewer rolls. 


(+) =0.1157 
? . The probability 


Answers to Odd-Numbered Section 6.3 Exercises 

6.77 Binary? “Success” = survive; “Failure” = does not survive. Independent? Yes; 
knowing the outcome of one elk shouldn’t tell us anything about the outcomes of 
other elk. Number? n=7n = 7. Same probability? p=0.44P = 9-44, x has a 
binomial distribution with n=7" = 7 and p=0.44P = 0.44 

6.79 Binary? “Success” = hits; “Failure” = does not hit. Independent? Yes, the 
outcome of one shot does not tell us anything about the outcome of other shots. 
Number? No, there is not a fixed number of trials. Y does not have a binomial 
distribution. 

6.81 X has a binomial distribution with n=7" = 7 and p=0.44P = 0.44, 


P(X=4)=(74)(0.44)4(0.56)3=0.2304. 
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7 4 3 
PA == ( ) (0.44)°(0.56)° = 0.2304. 
4 Tech: p(y=4)=P (Y = 4) 
binompdf(trials: 7, p: 0.44, x value: 4)=0.2304. 
binompdf (trials:7,p:0.44,xvalue:4) = 0.2304. There is a 23.04% probability 
that exactly 4 of the 7 elk survive to adulthood. 


0 


6.83 (a) Binary? “Success” = spinner lands in the blue region; “Failure” = spinner 
does not land in the blue region. Independent? Knowing whether or not one spin 
lands in the blue region tells you nothing about whether or not another spin lands 
in the blue region. Number? n=12n = 12, Same probability? p=0.80P = 0.80 x 
has a binomial distribution with n=12n = 12 and p=0.80P = 9-80. (ph) P(x=8)= 
P(X =8)= 

12 


nest 8 
P(Y =8) 


) (0.80)°(0.20)* = 0.1329 
; Tech: P(Y=8)= 


~ binompdf(trials: 

12, p: 0.80, x value: 8) =0.1329= 0.1329. There is about a 13.29% probability that 
the spinner will land in the blue region exactly 8 of the 12 times. 

6.85 X has a binomial distribution with n=7n = 7 and p=0.44P = 0.44, 
P(X>4)=0.1402P (X > 4) = 0.1402 Tech: pex>ay=P (X > 4) = 1 - 
binomcd{(trials: 7, p: 0.44, x value: 4) =1-0.8598=0.1402= 1 — 0.8598 = 0.1402 
. This probability isn’t very small, so it is not surprising for more than 4 elk to 
survive to adulthood. 

6.87 X has a binomial distribution with n=12n = 12 and p=0.80P = 9-80. Tech; 
P(x<7)=P (X<7)= binomcdf(trials: 12, p: 0.80, x value: 7)=0.0726= 0.0726, 
Assuming the website’s claim is true, there is a probability of about 7.26% that 
there would be 7 or fewer spins landing in the blue region. 

6.89 (a) X has a binomial distribution with n=22n = 22 and p=0.20P = 9.20, 
Tech: p(x>14)=P (X > 14) = 1 — binomcdf(trials: 22, p: 0.20, x value: 13) = 
0.00001. Assuming participants don’t have a special preference for the last thing 
they taste, there is an almost 0% probability that there would be at least 14 people 
who choose the last kiss. (b) Because this outcome is very unlikely, we have 
convincing evidence that participants have a preference for the last thing they taste. 
6.91 (a) Binary? “Success” = red; “Failure” = not red. Independent? Knowing 
whether or not the light is red for one randomly selected passenger tells you 
nothing about whether or not the light is red on another randomly selected 
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passenger. Number? n=20n = 20. Same probability?p=0.30P = 9-30. R has a 
binomial distribution with n=20% = 20 and p=0.30P = 9-30. (b) The shape is 
fairly symmetric with a single peak at R=6R = 6 passengers. 

6.93 


Probability 
o 


O D-2:3' 45 627 8 9 021 


Number of times the spinner 
lands in the blue region 


The shape is left-skewed with a single peak at X=10X = 10, 


6.95 (a) uR=20(0.3)=642 = 20 (0.3) = 6. if many groups of 20 passengers were 
selected, the average number of passengers who would get a red light would be 


about 6 passengers. (b) oR=20(0.3)(0.7)=2.0497R = Vv 20 (0.3) (0.7) = 2.049. i- 
many groups of 20 passengers were selected, the number of passengers who would 
get a red light would typically vary from the mean (6) by about 2.049. 

6.97 (a) Let Y=numberY = number of calls not completed. Y has a binomial 
distribution with n=15n = 15 and p=0.91P = 9-91. Tech: P(Y>12)=1- 

P(Y > 12) = 1~ pinomedf(15,0.91,12)=0.8531 

binomedf (15, 0.91, 12) = 0.8531 There is an 85.31% probability that more than 


12 calls are not completed. (b) pX=15(0.09)=1.354-x = 15 (0.09) = 1.35. it we 
watched the machine make many sets of 15 calls, the average number of calls that 
would reach a live person would be about 1.35 calls. (c) oX=15(0.09)(0.91)=1.11 


ox = ¥/15 (0.09) (0.91) = 1.11. if we watched the machine make many sets of 


15 calls, we would expect the number of calls that reach a live person to typically 
vary from the mean (1.35) by about 1.11 calls. 

6.99 (a) As long as n=1007 = 100 js less than 10% of the size of the population 
(the entire high school), L can be modeled by a binomial distribution even though 
the sample was selected without replacement. (b) Tech: 
P(L>15)*1-binomcdf(100,0.11,14)=0.1330 

P(L > 15) = 1 — binomedf (100, 0.11, 14) = 0.1330 There is a 13.3% 


probability that 15 or more students in the sample are left-handed. 


6.101 No; we are sampling without replacement and the sample size (10) is more 
than 10% of the population size (76). We should not treat the observations as 
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independent. 

6.103 (a) n=100n = 100 and p=0.11,P = 9-11; so np=11210" = 11 = 10 ana 
n(1-p)=89>10” (1— p) = 89> 10 Since the expected number of successes and 
the expected number of failures are both 10 or more, L can be approximated by a 
Normal distribution. (b) We want to find P(L>15)P (4 = 15). wt =(100)(0.11)=11 


He = (100) (0.11) = 11 ang o£. =(100)(0.11)(0.89)=3.129 
o1 = y/(100) (0.11) (0.89) = 3.129 


(i) z=1.282 = 1.28; P(Z>1.28)=1-0.8997=0.1003 
P(Z > 1.28) = 1 — 0.8997 = 0.1003 


(ii) P(L=15)=normalcdf(lower: 15, upper: 1000, mean: 11, SD:3.129)=0.1006. 
P(LZ > 15) = normalcdf(lower:15,upper:1000,mean:11, SD : 3.129) = 0.1006. 
There is a 0.1006 probability that 15 or more students in the sample are left- 
handed. 

6.105 When sampling without replacement, the trials are not independent because 
knowing the outcomes of previous trials makes it easier to predict what will 
happen in future trials. However, if the sample is a small fraction of the population 
(less than 10%), the makeup of the population doesn’t change enough to make the 


lack of independence an issue. 

6.107 (a) X is a geometric random variable with p=0.20P = 9-20. prx=3)= 
(0.8)2(0.2)=0.128P (X = 3) = (0.8)° (0.2) = 0.128. Tech: geometpdf(p:0.20, x 
value:3)=0.1288e0metpdf(p : 0.20, x value : 3) = 0.128 qhore is a 0.128 
probability that it takes Rita exactly 3 pulls to start the mower. (b) P(X>6) 

P(X > 6) 

=1-[0.20+(0.8)(0.2)+(0.8)2(0.2)+- - -+(0.8)5(0.2)]=0.2621 

= 1 — [0.20 + (0.8) (0.2) + (0.8)? (0.2) +--+ - + (0.8)° (0.2)] 

= 0.2621 

Tech: 1 — geometcdf(p: 0.20, x value: 6)=0.2621= 0.2621, There is a 0.2621 
probability that it takes Rita more than 6 pulls to start the mower. 

6.109 (a) This is not a geometric setting because we can’t classify the possible 
outcomes on each trial (card) as “success” or “failure” and we are not selecting 
cards until we get 1 success. (b) Games of 4-Spot Keno are independent, the 
probability of winning is the same in each game (p=0.259)(P ~ 0.259) and Lola 
is repeating a chance process until she gets a success. X is a geometric random 
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variable with p=0.259P = 9.259. 


6.111 (a) X is a geometric random variable with p=0.097P ~ 0.097, 


eo ae 
uX=1p=10.097=10.31"* ~ Pp 0.097 BEE wi would expect to examine 


about 10.31 
invoices to find the first 8 or 9. (b) Tech: P(X>40)=1-geometcdf(p: 0.097, x value: 


39)=1-0.9813=0.0187 
P(X > 40) = 1 — geometcdf (p:0.097,xvalue:39) = 1 — 0.9813 = 0.0187 


Because the probability of not getting an 8 or 9 before the 40th invoice is small, we 
may begin to worry that the invoice amounts are fraudulent. 

6.113 b 

6.115 d 

6.117 c 


6.119 The standard deviations from smallest to largest are: B, C, A. 


Answers to Chapter 6 Review Exercises 

R6.1 (a) P(Y=5)=0.1P (Y = 5) = 9-1, there is a 0.1 probability that a randomly 
selected patient would rate his or her pain as a5 on a scale of 1 to 5. (b) 
pcv<2)=0.3P (Y < 2) = 0-3 (6 wx=3.1; 6X=1.136HX = 3-liox = 1.136 

R6.2 (a) Temperature is a continuous random variable because it takes all values in 
an interval of numbers. (b) pD=550-550=0°C,HD = 950 — 550 = 0°C, and the 
standard deviation stays the same, oD=5.7°C7D ~ 5.7" Cc because subtracting a 
constant does not change the variability. 


_3 ane 
(c) W¥=95(550)+32=1022°rHY = gS) + 92 = 1022" F 


= (2) (5.7) = 10.26°F. 


and oY=(95) 


a 
(5.7)=10.26°F. 
R6.3 (a) 


Probability 
g 


0.0 eee 
0 1 3 120 
X = Payout for a single $1 bet 


The distribution is skewed right with a single peak at the $0 payout. The mean of 
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the distribution is $0.70 and the standard deviation is $6.58. (b) Payout averages 
$0.70 per game. If you were to play many games of 4-Spot Keno, the payout 
amounts would typically vary by about $6.58 from the mean ($0.70). (c) Y=5X; 
uw Y=5(0.70)=$3.50; oY =5(6.58)=$32.90 

Y = 5X;py = 5 (0.70) = $3.50;0y = 5 (6.58) = $32.90 (d) Let Wbe the 
amount of Marla’s payout, W=X1+X2+X3+X4+X5 

W = X; + Xo + X3 + Xy 4 Xs. uW=5(0.70)=$3.50/HW = 5 (0.70) = $3.50 


and oW=$14.719w = $14.71, 


R6.4 (a) (i)z=0.83;2 = 9-83; p(z>0.83)=1-P(Z<0.83)=0.2033 
P(Z > 0.83) = 1— P(Z < 0.83) = 0.2033 


(ii) P(C>11)=normalcdf(lower: 11, upper: 1000, mean: 10, SD:1.2)=0.2023. 

P(C > 11) = normalcdf(lower:11,upper:1000,mean:10, SD : 1.2) = 0.2023. 
There is a 0.2023 probability that a randomly selected cap has strength greater than 
11 inch-pounds. (b) It is reasonable to assume the cap strength and torque are 
independent because the machine that makes the caps and the machine that applies 
the torque are not the same. (c) C-TC ~ T is Normal with mean 10-7=3 
10-7=3 inch-pounds and standard deviation 


2 = 

0.92+1,.22=1,5V 9.9" + 1.2" = 1.5 inch-pounds. 

(a) ()z=-22 = —2; PZ<-2)=0.0228P (2 < —2) = 0.0228 

(ii) P(C-T<0)=normalcdf(lower: —1000, upper: 0, mean: 3, SD: 1.5)=0.0228 
P(C—T < 0) = normalcdf(lower: — 1000, upper : 0, mean : 3, SD: 1.5) = 0.0228 
. There is a 0.0228 probability that a randomly selected cap will break when being 
fastened by the machine. 
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R6.5 (a) Binary? “Success” = orange; “Failure” = not orange. Independent? The 
sample of size n=8n = 8 is less than 10% of the large bag, so we can assume the 
outcomes of trials are independent. Number? We are choosing a fixed sample of 
n=8n = 8 candies. Success? The probability of success remains constant at 
p=0.205P = 0.205. This is a binomial setting, so X has a binomial distribution 
with n=8n = 8 and p=0.205P = 9-205, () p(x=3)=0.1532P (X = 3) = 0.1532. 
Tech: P(X=3)=P (X = 3) = pinompaf (trials: 8, p: 0.205, x value: 3)=0.1532 

= 0.1532. There is a 15.32% probability that exactly 3 of the 8 M&M’S® will be 
orange. (c) Tech: P(X>4)=1-binomcdf(trials: 8, p: 0.205, x value: 3)=0.0610 

P(X > 4) = 1— binomedf(trials:8,p:0.205,2value:3) = 0.0610 there is a 
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6.1% probability that at least 4 of the 8 M&M’S will be orange. (d) No; it is not 
very unusual to receive 4 or more orange M&M’S in a sample of 8 M&M’S when 
20.5% of M&M’S produced are orange. This happens 6.1% of the time due to 
chance alone, so this result does not provide convincing evidence that Mars, Inc.’s 
claim is false. 


R6.6 (a) wX=8(0.205)=1.644x = 8 (0.205) = 1.64. i¢ we were to select many 


random samples of size 8, we would expect to get about 1.64 orange M&M’S, on 


-—  /Rff 9N&) (fh VOR) — 
average. (b) oX=8(0.205)(0.795)=1.147X = V8 (0.205) (0.795) = 1.14. is v0 


were to select many random samples of size 8, the number of orange M&M’S 
would typically vary from the mean (1.64) by about 1.14. 


eon ee 
R6.7 Y is a geometric random variable with p=312=0.25? a” aie 0.25. 


P(Y<3)=0.5781P (Y < 3) = 0.5781 Tech: geometcdf(p: 0.25, x value: 3)=0.5781 
geometcdf(p:0.25,xvalue:3) = 0.5781 There is a 0.5781 probability that it takes 
3 or fewer spins to get a wasabi bomb. 

R6.8 (a) Although the trials are binary (“success” = use and “failure” = does not 


(n = 200) 


use) and there are a fixed number of trials (n=200) , the residents were 


selected without replacement, violating the condition of independence. However, 


n = 200) is less than 10% of the population size 


since the sample size (n=200)| 
(all residents in a large city), T has approximately a binomial distribution with 
n=200n = 200 and p=0.34P = 9.34, (b) n=200n = 200 and p=0.34P = 9-34 so 
np=68>10"P = 68 > 10 and n(1-p)=132>10" (1 — Pp) = 182 2 10. since the 
expected number of successes and the expected number of failures are both 10 or 
more, T can be approximated by a Normal distribution. (c) We want to find 


pcr<6o)? (F< 69) 7=(200)(0.34)=68H7 = (200) (0.34) = 68 

and oT=np(1-p)=?? = VP (1 — P) = (200)(0.34)(0.66)=6.699 
(200) (0.34) (0.66) = 6.699 

(i)z=-1.19 2 = —1.19; p(z.<-1.19)=0.1170P (4 S —1.19) = 0.1170 


(ii) P(T<60)=normalcdf(lower: —1000, upper: 60, mean: 68, SD:6.699)=0.1162. 
P(T < 60) = normalcdf(lower: — 1000, upper : 60, mean : 68, SD:6.699) = 0.1162. 


There is a 0.1162 probability that at most 60 residents in the sample use public 
transportation at least once per week. 
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Answers to Chapter 6 AP® Practice Test 

T6.1 b 

T6.2 d 

T6.3 d 

T6.4e 

T6.5 d 

T6.6 b 

T6.7 Cc 

T6.8 b 

T6.9 b 

T6.10 c 

T6.11 (a) P(Y<2)=0.96P (Y < 2) = 0.96 ; there is a 96% chance that at least 10 
eggs are unbroken in a randomly selected carton of “store brand” eggs. (b) 
wY=0.38HY = 0.38 ; if we were to randomly select many cartons of eggs, we 
would expect an average of about 0.38 eggs to be broken. 

(c)oY=0.821979Y = 0.8219. if we were to randomly select many cartons of eggs, 
the number of broken eggs would typically vary from the mean (0.38) by about 
0.8219. (d) X is a geometric random variable with p=0.11? = 0.11 We are 
looking for P(x<3)P (X <3) Tech: geometcdf1p:2 ech : geometcdflp: 9 71, 
0.11, x value: pbc=0.2950*value:pbe = 0.2950. 

T6.12 (a) Binary? “Success” = greets dog first; “Failure” = does not greet dog first. 
Independent? We are sampling without replacement, but 12 is less than 10% of all 


dog owners. Number? n=12n = 12. Same probability? The probability of success 


= 0.66) . X is a binomial random variable with 


is constant for all trials (p=0.66)\P 
n=12n = 12 and p=0.66P = 9-66. (hy p(x=6)=0.1180P (X = 6) = 0.1180 (¢) 
P(xX<4)=0.0213P (X < 4) = 0.0213 Tech: binomcdf(trials: 12, p: 0.66, x value: 
4)=0.0213binomed (trials:12,p:0.66,xvalue:4) = 0.0213 pocause this 
probability is very small, it is unlikely that only 4 or fewer owners will greet their 
dogs first by chance alone. This gives convincing evidence that the claim by 
Ladies’ Home Journal is incorrect. 

T6.13 (a) Letting D=A-E, pD=50-25=25) = A — E,up = 50 — 25 = 25, 
Because the amount of time they spend on homework is independent of each other, 
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oD=11.187D = 11.18. (p) (i)z=-2.242 = —2.24, p(z<-2.24)=0.0125 
P(Z < —2.24) = 0.0125 


(ii) P(D<0)=normalcdf1lower: -1000,P (D < 0) = normaledflower: — 1000, 
upper: 0, mean: 25, SD: 11.182 = 0.0127. There is a 0.0127 probability that Ed 
spent longer on his assignment than Adelaide did on hers. 

16.14 (a) X has a binomial distribution with n=1200n = 1200 and p=0.13 

p= 0.13, 1x=1200(0.13)=1564x = 1200 (0.13) = 156 ang 6x=1200(0.13) 


(0.87)=11.64997* ~ 1200 (0.13) (0.87) = 11.6499 


10% Hispanics, there were 1200(0.10)=120!200 (0.10) = 120 Hispanics in the 
sample. We want to find P(X<120)=0.00083/ (X < 120) = 0.00083 | tech. 
binomcdf1trials: 1200, p=0.13, binomedfltrials:1200,p = 0.13, x value: 
1202=0.00083xvalue:1202 = 0.00083. If we use the Normal approximation to the 


binomial distribution, z=-3.092 = —3-09 and P(Z<-3.09)=0.001 
P(Z < —3.09) 


. (b) If the sample contains 


= 0.001 Because this probability is small, it is unlikely to select 
120 or fewer Hispanics in the sample just by chance. This gives us reason to be 
suspicious of the sampling process. 


Chapter 7 

Section 7.1 

Answers to Check Your Understanding 

page 446: 1. The population is all M&M’S® Milk Chocolate Candies produced by 
the factory in Hackettstown, NJ. The parameter is p=? ~ the proportion of all 
M&M’S Milk Chocolate Candies produced by the factory in Hackettstown, NJ, 
that are orange. The parameter is claimed to be p=0.25? — 0.25 The sample is the 
50 M&M’S Milk Chocolate Candies selected. The statistic is the proportion of the 
sample of 50 M&M’S that are orange, p\P ; 

2. 


Proportion 
=] 


0.10 
0.05 
0.00 T T T T T T 


Blue Orange Green Yellow Red Brown 


Color 


3. The graph below shows a possible distribution of sample data. For this sample, 
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there are 11 orange M&M’S, so p\=1150=0.22? => = 0.22 


‘Allnas 


Blue Orange hie Yellow Red ices 
Color 


Frequency 
nan 


4. The middle graph is the approximate sampling distribution of pp. The statistic 
measures the proportion of orange candies in samples of 50 M&M’S. Assuming 
that the company is correct, 25% of the M&M’S are orange, so the center of the 
distribution of p\P should be at approximately 0.25. The first graph shows the 
distribution of the colors for one sample, rather than the distribution of p\P from 
many samples, and the third graph is centered at 0.125, rather than 0.25. 


page 452: 1. The median does not appear to be an unbiased estimator of the 
population median. The mean of the simulated sampling distribution of the sample 
median (73.5) is not equal to the median of the population (75). 

2. Increasing the sample size from 10 to 20 will decrease the variability of the 
sampling distribution. Larger samples provide more precise estimates, because 
larger samples include more information about the population distribution. 

3. The sampling distribution of the sample median is skewed to the left and single 
peaked. 


Answers to Odd-Numbered Section 7.1 Exercises 

7.1 Population: All people who signed a card saying they intend to quit smoking. 
Parameter: p=? ~ the proportion of the population who had not smoked over the 
past 6 months. Sample: The 1000 people selected at random. Statistic: p\=P ~ the 
proportion in the sample who had not smoked over the past 6 months=0.21 

6 months = 0.21, 

7.3 Population: All dental practices in California. Parameter: The interquartile 
range of the price to fill a cavity for all dental practices in California. Sample: The 
10 randomly selected dental practices that provided the price they charge to fill a 
cavity. Statistic: 1QR=! QR = the interquartile range of the price to fill a cavity 
for the 10 selected dental practices =$74= $74, 

7.5 Population: All bottles of Arizona Iced Tea produced that day. Parameter: p= 
H = the average number of ounces per bottle in all bottles of Arizona Iced Tea 
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produced that day. Sample: The 50 bottles of tea selected at random from the day’s 
production. Statistic: x = = the average number of ounces of tea contained in 
the 50 bottles=19.690 bottles = 19.6 ounces. 


7.7 The 10 possible SRSs of size n=2% = 2 and their sample means are: 
1: Abigail(10), Bobby(5) x =7.5@ = 7.5 

: Abigail(10), Carlos(10) x =10% = 10 

: Abigail(10), DeAnna(7) x =8.5% = 8.5 
: Abigail(10), Emily(9) x =9.5% = 9.5 

: Bobby(5), Carlos(10) x =7.5% = 7.5 

: Bobby(5), DeAnna(7) x =62 = 6 

: Bobby(5), Emily(9) x =7% = 7 

: Carlos(10), DeAnna(7) x =8.5= = 8.5 
: Carlos(10), Emily(9) x =9.5% = 9.5 
10: DeAnna(7), Emily(9) x =8% = 8 
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7.9 The 10 possible SRSs of size n=2N = 2 and the proportion of females in each 
sample are: 


: Abigail(F), Bobby(M) p4=0.50P = 9.50 
: Abigail(F), Carlos(M) p=0.50? = 9.50 
: Abigail(F), DeAnna(F) p§=1P = 1 

: Abigail(F), Emily(F) p\=1? = 1 

: Bobby(M), Carlos(M) pA=0P = 9 

: Bobby(M), DeAnna(F) p\=0.50P = 0.50 
: Bobby(M), Emily(F) pA=0.50P = 9-50 

: Carlos(M), DeAnna(F) p=0.50? = 9.50 
: Carlos(M), Emily(F) p=0.50? = 9-50 
10: DeAnna(F), Emily(F) p\=1? = 1 
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p = Sample proportion of females 


7.11 (a) 
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Did homework 


(b) An example bar graph is given. 
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7.13 (a) p’=45100=0.45" ~ i00 ~ Deer «sith Ge lave dihian 060: (b) It is possible 


that 60% of the students did their homework, and the students got a p\P less than 
60% because of sampling variability. It is also possible that the sample proportion 
is less than 60% because less than 60% of all the students did their homework. (c) 
In one simulated SRS of 100 students, 73 students did their assigned homework 
last week. (d) Because there were no values of p\p less than or equal to 0.45 in the 
simulation, it would be very surprising to get a sample proportion of 0.45 or less in 
an SRS of size 100 from a population in which p=0.60? ~ 0.60. (e) Because it 
would be very surprising to get a sample proportion of p\=0.45P = 0.495 or less in 
an SRS of size 100 when p=0.60? > 0.60 we should be skeptical of the 
newspaper’s claim. 

7.15 Because 78/250 or 31.2% of the values of p\P are less than or equal to 0.57 
in the simulation, it would not be surprising to get a sample proportion of p\=0.57 
P = 0.57 or Jess in an SRS of size 100 froma population in which p=0.60 

p = 0.60 a sample proportion of p\=0.57P = 0.57 does not provide convincing 
evidence that the population proportion of students who did all their assigned 
homework is less than p\=60,P = 60, 

7.17 A sample standard deviation of 5°F is quite large compared with what we 
would expect by chance alone, because none of the 300 simulated SRSs had a 
standard deviation that large. A sample standard deviation of 5°F provides 
convincing evidence that the manufacturer’s claim is false and that the thermostat 
actually has more variability than claimed. 
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7.19 The 6 possible SRSs of size n=2N = 2 and the sample proportion for each 
sample are: 


1: Red, White p=0.5P = 9.5 
2: Red, Silver p\=0.5P = 9-5 
3: Red, Red p\=1P = 1 

4: White, Silver p\=0P = 9 
5: White, Red pA=0.5? = 9-5 
6: Silver, Red p\=0.5P = 9-5 


0 0.5 1 
p = Sample proportion of red cars 


The population proportion of red cars is p=2/4=0.5? = 2/4 =0.5 Because the 
mean of the sampling distribution is also 0.5, the sample proportion is an unbiased 
estimator of the population proportion. 

7.21 The 4 possible SRSs of size n=3” = 3 and the sample proportion for each 
sample are: 

1: Red, White, Silver p\=1/3? = 1/3 

2: Red, White, Red p\=2/3? = 2/3 

3: Red, Silver, Red p\=2/3? = 2/3 

4: White, Silver, Red p\=1/3? = 1/3 


0 0 1 

p = Sample proportion of red cars 
The variability in this sampling distribution is less than the variability in the 
sampling distribution from Exercise 19. Increasing the sample size decreases the 
variability in the sampling distribution. 
7.23 If we chose many SRSs and calculated the sample mean x © for each sample, 
the distribution of x © would be centered at the value of pp. 
7.25 (a) Statistics ii and iii are unbiased estimators because the means of their 
sampling distributions appear to be equal to the corresponding population 
parameters. (b) Statistic ii does the best job at estimating the parameter. It is 
unbiased and has very little variability. 
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7.29b 

7.31 (a) We are looking for P(z < -2.5). (i) P(z < -2.5) = 0.0062 (ii) P(z < -2.5) = 
normalcdf(lower: —1000, upper: —2.50, mean: 0, SD: 1) = 0.0062. Less than 1% of 
healthy young adults have osteoporosis. (b) X follows a Normal distribution with a 
mean of —2 and a standard deviation of 1. We want to find P(X < —2.5). (i) z= 
-0.5; P(X < -2.5) = 0.3085 (ii) P(X < -2.5) = normalcdf(lower: —1000, upper: 
—2.5, mean: —2, SD: 1) = 0.3085. About 30.85% of women aged 70-79 have 
osteoporosis. 


Section 7.2 


Answers to Check Your Understanding 


papetGee up =p=-0.7see 


2. Because 1000 < 10% of all young adult Internet users, the 10% condition has 


been met. The standard deviation of the sampling distribution of p\P is 
i /pQ-p) _ _ [0.75(0.25) 
op=p(1-p)n=0.75(0.25)1000=0.0137-” VV ™ ~V~ 1000 ~ 


SRSs of size 1000, the sample proportion of young adult Internet users that watch 


AMIS Fg 
. In 


online videos will typically vary by about 0.0137 from the true proportion of 
p=0.75P = 0.75. 

3. Because np=1000(0.75)=750210"P = 1000 (0.75) = 750 > 10 ang 
n(1-p)=1000(0.25)=250>10" (1 — p) = 1000 (0.25) = 250 > 10 the sampling 
distribution of pp is approximately Normal. 

4. If the sample size were 9000 instead of 1000, the sampling distribution would 
still be approximately Normal with mean 0.75. However, the standard deviation of 


the sampling distribution would be smaller by a factor of 3. In this case, 


/ 0.75(0.25) 
,=4/ - = 0.0046 
op=0.75(0.25)9000=0.0046”  V 9000 


Answers to Odd-Numbered Section 7.2 Exercises 


7.33 (a) up=p=0.554” = P = 9-55 @) n=500n = 500 is less than 10% of the 
population of all registered voters, so op=0.55(1—0.55)500=0.022 


[O551—0.55) 
“> 500 7 0.022 In SRSs 


of size n=5002 = 500, the sample proportion of registered voters that are 
Democrats will typically vary by about 0.022 from the true proportion of p=0.55 
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p = 0.55. (¢) Because np=500(0.55)=275>10"P = 900 (0.55) = 275 > 10 ang 
n(1-p)=500(1-0.55)=225>10"(1 — p) = 500(1 — 0.55) = 225 > 10 the 
sampling distribution of p? is approximately Normal. 


7.35 (a) pp=p=0.204» ~ ? ~ — (b) n=307 = 30 is less than 10% of the 


population of all Skittles®, so op=0.20(1-0.20)30=0.0730 


/0.20(1—0.20) 
/—__——. = ().0730 
= . In SRSs of 


ex p— V 

size n=307 = 30, the sample proportion of Skittles that are orange will typically 
vary by about 0.073 from the true proportion of p=0.20P ~ 0.20. 

(c) Because np=30(0.2)=6<10"P = 30 (0.2) =6 < 10 the sampling distribution 
of p/P is not approximately Normal. Because p=0.20P ~ 0.20 is closer to 0 than 
to 1, the sampling distribution of p\? is skewed to the right. 


7.37 A sample size of 30(4)=120°0 (4) = 120 Would reduce the standard 
deviation of the sampling distribution to half the value found in Exercise 35(b): 


op’=0.20(1-0.20)120=0.0365°7 V0 a cal 

7.39 (a) No; the 10% condition is not met here because more than 10% of the 
population (10/76=13%) (10/76 = 13%) was selected. (b) No; the Large Counts 
condition is also not met because the sample size was only n=10n = 10. Neither 
np”? nor n(1-p)” (1 — P) will be at least 10. 

7.41 (a) upA=p="» ~  ~ 0.70 (b) n=1012n = 1012 is less than 10% of the 
population of all U.S. adults, so opA=0.7(1—0.7)1012=0.0144 


[0.7(1—-0.7) 


= 0.0144 


p~ VY 1012 
(c) np=1012(0.70)=708.4210"P = 1012 (0.70) = 708.4 > 10 ang 
n(1-p)=1012(0.30)=" (1 — P) = 1012 (0.30) = 393 6>19303.6 > 10, so the 
sampling distribution of p/? is approximately Normal. (d) We want to find 
P(p\<0.67)P (P < 9-67) (j)7=-2.082 = —2.08, p(z<-2.08)=0.0188 
P(z < —2.08) = 0.0188 
(ii) P(p4<0.67)=normalcdf(lower: —1000, upper: 0.67, mean: 0.70, SD: 


0.0144)=0.0186 
P(p < 0.67) = normalcdf(lower: — 1000,upper:0.67, mean: 0.70,SD:0.0144) = 0.0186 


. There is a 0.0186 probability of obtaining a sample of size 1012 in which 67% or 
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fewer say they drink the milk. (e) Because 0.0186 is a small probability, there is 
convincing evidence against the claim that p=0.70P = 0.70__it isn’t plausible to 


get a sample proportion this small by chance alone. 


1A up\=0.70' : i ; because 267 is less than 10% of the population of 


[0.7(0.3) 


all college women, op'=0.7(0.3)267=0.0280°” = 
np="P = 


/ — = 0.0280 
V ~ 267 . Because 


267(0.7)=186.9>10267 (0-7) = 186-9 > 10 ang n(1—p)=267(0.3)=80.1210 


n(1—p) = 267 (0.3) = 80.1> 10 ine sampling distribution of p/? can be 
approximated by a Normal distribution. We want to find P(pA>0.75)/ (p > 0.75), 
(i)z=1.792 = 1.79: P(z>1.79)=0.0367P (2 2 1.79) = 0.0367 


(ii) P(p4=0.75)=normalcdf(lower: 0.75, upper: 1000, mean: 0.7, SD: 


0.02802=0.0371. 
P (p > 0.75) = normalcdf(lower: 0.75,upper: 1000, mean: 0.7, SD:0.02802 = 0.0371. 


There is a 0.0371 probability that 75% or more of the women in the sample have 
been on a diet within the last 12 months. 


7.45 (a) upA=0.90' a oan). because 100 is less than 10% of the population 


of orders (100/5000=2%)(100/5000 = 27%) | G4n=0,90(0.10)100=0.03 
__ [0.90(0.10) 
=e . Because 


np=100(0.90)=90>10"P = 100 (0.90) = 90 > 10 ang n(1—p)=100(0.10)=10>10 
n(1—p) = 100(0.10) = 10 > 10 the sampling distribution of p? can be 


approximated by a Normal distribution. We want to find P(pA<0.86)/ (p< 0.86). 
(i)z=-1.332 = —1.33, pz<-1,33)=0.0918? (2 S$ —1-33) = 0.0918 Gi 
P(p4<0.86)=normalcdf(lower: —1000, 

P(p < 0.86) = normalcdf(lower: — 1000, upper: 0.86, mean: 0.90, SD: 


0.03)=0.0912.9-99,5D:0.03) = 0.0912. There is a 0.0912 probability that 86% or 
fewer of orders in an SRS of 100 were shipped within 3 working days. (b) No; it 
isn’t unusual to get a sample proportion of 0.86 or smaller when selecting an SRS 
of 100 from a population in which p=0.90P = 0.90. Thus, it is plausible that the 
90% claim is correct and that the lower than expected percentage is due to chance 
alone. 

7.47 a 
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7.49 b 
7.51 (a) 


Share music from computer? 


(b) 62/100=0,62=62%52/100 = 0.62 = 62% oF Internet users neither download 


nor share music files. (c) 12/29=0.414=41.4%12/29 = 0.414 = 41.4% oF Internet 
users who download music files online also share music files. 


Section 7.3 
Answers to Check Your Understanding 
page 474: 1. We want to find P(X>270)P (X > 270) (jy 7=0.252 = 0.25: 
P(z>0.25)=P (2 > 9.25) = 1-9.5987=0.40131 — 0.5987 = 0.4013 (iy 
P(X>270)=normalcdf(lower: 270P(X > 270) = normalcdf(lower: 270) upper: 
1000, mean: 266UPPer: 1000, mean: 266, sp; 16)=0.4013.9):16) = 0.4013. 
There is a 0.4013 probability of selecting a woman whose pregnancy lasts for more 
than 270 days. 
2. days px =p=266Hz = H = 266 
3. The sample of size 6 is less than 10% of all pregnant women. Therefore, the 
standard deviation of the sampling distribution of x © is 

o 16 


i 


- o — 532 
ox =on=166=6.532 V v6 days. In SRSs of size 6, the sample 
mean 


length of human pregnancies from conception to birth will typically vary by about 
6.532 days from the true mean of 266 days. 

4. The mean length of pregnancy follows a Normal distribution with a mean of 266 
and a standard deviation of 6.532 and we want to find P(x >270)P (z > 270) (i) 
2=0.612 = 0.61; p(z>0.61)=P (2 > 9-61) = 1~-9.7291=0.2709 

1 — 0.7291 = 0.2709 (ii) P(x >270)=normalcdf(lower: 270, upper: 1000, mean: 
266, SD: 6.532)=0.2701 
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P (x > 270) = normalcdf(lower: 270, upper: 1000, mean: 266, SD: 6.532) = 0.2701 


. There is a 0.2701 probability of selecting a sample of 6 women whose mean 
pregnancy length exceeds 270 days. 


Answers to Odd-Numbered Section 7.3 Exercises 
7.53 (a) px =p=225Hx = H = 225 seconds (b) Because the sample size (10) is 
less than 10% of the population of songs on David’s iPod, 

c — © _ 18.974 


= i . 
ox =on=6010=18.974 vn v10 seconds. In SRSs of size 10, the 
sample 


mean play time will typically vary by about 18.974 seconds from the true mean of 
225 seconds. 
7.55 10=60n | 10n=60 — n=6 > n=36 


10 = 2 > 10/n = 60 > n= 6 > n = 36 
vn . A sample size 


of 36 songs will produce a sampling distribution with a standard deviation of 10 
seconds. 
7.57 (a) We want to find P(X<295)P (X < 295) (jy z=-12 = —1, pz< 


(ii) P1X<295)=normalcdf(lower: -1000, upper: 295, mean: 
P 1X < 295) = normalcdf(lower: — 1000,upper:295,mean: 


298, SD: 3)=0.1587.298,5D: 3) = 0.1587. There is a 0.1587 probability that a 
randomly selected bottle contains less than 295 ml. (b) Because X has a Normal 
distribution, the sampling distribution of x © has a Normal distribution. 

ux =p=298 mlz = # = 298ml. because 6 is less than 10% 


_ op = 4 = % = 1.2247 
of all bottles produced, ox =on=36=1.2247 vn v6 m 


want to 


1. We 


find P(x <295)P (© < 295) (j) z=-2.452 = —2.45; p(z<-2.45)=0.0071 
P(Z < —2.45) = 0.0071 


(ii) P(x <295)=normalcdf(lower: —1000, upper: 295, mean: 298 
P(& < 295) = normalcdf(lower: — 1000,upper:295, mean:298 cp. 


1.2247)=0.0072,9D:1.2247) = 0.0072. There is a 0.0072 probability that the 
mean contents of 6 randomly selected bottles is less than 295 ml. 
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7.59 (a) The sampling distribution of x @ is Normal with px =y=188 mg/dl 


Hz = # = 188mg/dl Because the sample size (100) is less than 10% of all men 


_ 34,05 = % = -L =4.1meg/dl 
aged 20 to 34, ox =on=41100=4.1 mg/dl ue v 100 : 


(b) We want to find P(185<x~<191)P (185 < % < 191) (jy 7=-9.732 = —0.73 
and z=0.732 = 0.73; P(-0.73<z<0.73)=0.7673- 
P(—0.73 < z < 0.73) = 0.7673— 9 9397=9 53460.2327 = 0.5346 
(ii) P(485<x < 191)=normalcdf(lower: 185, upper: 191, mean: 
P(185 < © < 191) = normalcdf(lower:185,upper: 191, mean: 188, SD: 
4.1)=0.5357,188,SD:4.1) = 0.5357. There is a 0.5357 probability that x= 
estimates p/ within +3 mg/dl+3™8/4!. (¢) ox~=on=411000= 

a 41 


CO, = = | a 
as Jn 1000 


We want to find P(185<x~<191)P (185 < % < 191) (jy) 7=-9 312 = —2.31 and 
7=2.312 = 2.31; P(-2.31<2<2.31)=0.9896-P (—2.31 < z < 2.31) = 0.9896— 
0.0104=0.97920.0104 = 0.9792 

(ii) P(185<x <191)=normalcdf(lower: 185, upper: 191, mean: 

P(185 < Z < 191) = normalcdf(lower:185,upper:191, mean: 

188, SD: 1.30)=0.9790.188,5D:1.30) = 0.9790. There is a 0.9790 probability that 


x= estimates p/4 within +3 mg/dl~3 ™8/4l. The larger sample is better because 
it is more likely to produce a sample mean within 3 mg/dl of the population mean. 


7.61 (a) The sampling distribution of x £ is Normal with ux =p=484z2 ~ h = 48 
months. Because the sample size (8) is less than 10% of all batteries, 

7 o, =“ = © = 2.899 _ 
ox =on=8.28=2.899 vn vs months. We want to find P(x < 


42.2)P (# < 42.2), 


(i) z=-2.002 = —2.00: piz<-2,00)=0.0228P (2 S —2.00) = 0.0228 (i) 

P(x <42.2)=normalcdf! (Z < 42.2) = normalcdf 

(lower: —1000, upper: 42.2, mean: 48, SD: 2.899)=0.0227. 

(lower: — 1000,upper:42.2, mean: 48,SD:2.899) = 0.0227. There is a 0.0227 
probability that the sample mean lifetime is 42.2 months or less if the lifetime 
distribution is unchanged. (b) Because this probability is very small, there is 
convincing evidence that the company is overstating the average lifetime of its 
batteries. It is not plausible to get a sample mean this small by chance alone. 
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7.63 (a) Because n=5<30n = 5 < 30, the sampling distribution of x % will also 
be skewed to the right, but not quite as strongly as the population. (b) Because 
n=100>30" = 100 > 30) the sampling distribution of x © is approximately 
Normal by the central limit theorem. 

7.65 (a) Because the sample size is large (n=1000$30)(” = 1000830) , the 
sampling distribution of x © is approximately Normal. px =y1=$250 

Hz = H = $250. Assuming 1000 is less than 10% of all homeowners with fire 
insurance, 


_ op = = = 158.114 
ox =on=50001000=158.114 vn 1000 . We want to find 


(i) z=0.322 = 0.32; P(z>0.32)=1-0.6255=0.3745 
normalcdf(lower: 300, upper: 1000, mean: 250 
normalcdf(lower:300,upper:1000, mean : 250 SD: 158.114) = 0.3759. There 


is a 0.3759 probability that the mean annual loss from a sample of 1000 policies is 
greater than $300. (b) (i) Look in the body of Table A for the value closest to 0.90, 


__ z—250 
which is z=1.282 = 1.28. Solving 1.28=x-250158.114''7® ~ 158.114 gives 


x=$452,.39@ = $452.39, 


(ii) invNorm(area: 0.90, mean: 250, SD: 158.114)=$452.63. 
invNorm (area:0.90,mean:250,SD:158.114) = $452.63. r¢ the 


company wants to be 90% certain that the mean loss from fire in an SRS of 1000 
homeowners is less than the amount they charge for the policy, they should charge 
$452.63. 


7.67 (a) It is more likely to randomly select 1 square yard of material and find 2 or 
more flaws than to randomly select 50 square yards of material and find an average 
of 2 or more flaws. There is more variability from the mean of 1.6 in the number of 
flaws found in individual square yards of material than in the average number of 
flaws found in a sample of 50 square yards of material. (b) You cannot use a 
Normal distribution to calculate the probability of the first event in part (a) because 
the population distribution is not Normal and the sample size is not at least 30. (c) 
Because the sample size is large (n=50>30)(" = pus 30) the sampling 
distribution of x 2 is approximately Normal with px =p=1.642 = # = 1.6, 
Because 50 is less than 10% of all square yards of this carpet, 
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= 2 = +2 — 0.1697 


n 


= Te . 
ox =on=1.250=0.1697 ~  v 50 _ We want to 


find P(x >2)P (® 2 2). (i) 2=2.362 = 2.36; P(z>2.36)=1-0.9909=0.0091 
P(z > 2.36) = 1 — 0.9909 = 0.0091 


(ii) P(x >2)=normalcdf(lower: 2, upper: 1000, mean: 1.6, SD: 
P17$22 = normalcdf(lower:2,upper:1000,mean:1.6, SD: 


0.16972=0.0092.0.16972 = 0.0092. There is a 0.0092 probability of finding that 
the mean number of flaws is 2 or more in a random sample of 50 square yards of 
carpet. 

7.69 No; the histogram of the sample values will look like the population 
distribution, whatever it happens to be. The central limit theorem says that the 
histogram of the sampling distribution of the sample mean will look more and 
more Normal as the sample size increases. 

(n = 30 > 30) 
= p = 190 


7.71 Because the sample size is large (n=30230) , the distribution of 


x & is approximately Normal with px =y=190H pounds. Because 
n=30n = 30 is less than 10% of all possible passengers, 
a ee 


_ ~- .3901 
ox =on=3530=6.3901 el pounds. We want to find 
(i) 2=1.562 = 1.56; P(z>1.56)=1-0.9406=0.0594 
P(z > 1.56) = 1 — 0.9406 = 0.0594 
(ii) P(x >200)=normalcdf(lower: 200, upper: 1000, mean: 190, SD: 
6.3901)=0.0588. 
P(z > 200) = normalcdf(lower:200,upper:1000,mean : 190, SD:6.3901) = 0.0588. 
There is a 0.0588 probability that the mean weight exceeds 200 pounds (and the 
total weight exceeds 6000 pounds). 


7.73 b 

7:/9C 

7.77 The unemployment rates for each level of education are: 
P(unemployed|didn’'t finish HS)=106212,470=0.0852 


P (unemployed|didn tt finish HS) = a = 0.0852 


P(unemployed|HS but no college)=197737,834=0.0523 


P (unemployed|HS but no college) = ~ 7 = 0.0523 
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P(unemployed|less than bachelor’s degree)=146234,439=0.0425 


P (unemployed|less than bachelor's degree) = aS = 0.0425 


P(unemployed|college graduate)=109740,390=0.0272 


P (unemployed|college graduate) = aT = 0.0272 


There is an association between unemployment rate and education. The 
unemployment rate decreases with additional education. 


Answers to Chapter 7 Review Exercises 

R7.1 The population is the set of all eggs shipped in one day. The parameter is p= 
P ~ the proportion of eggs shipped that day that had salmonella. The sample 
consists of the 200 eggs examined. The statistic is pA=P = the proportion of eggs 
in the sample that had 

salmonella =9/200=0.045~ 9/200 = 0.045 


R7.2 (a) The 10 possible SRSs of size n=3" = 3 and the median score of each 
sample are: 


: 64, 66, 71 median=66median = 66 
: 64, 66, 73 median=66median = 66 
: 64, 66, 76 median=66median = 66 
64, 71, 73 median=71median = 71 
: 64, 71, 76 median=71median = 71 
64, 73, 76 median=73median = 73 
: 66, 71, 73 median=71median = 71 
: 66, 71, 76 median=71median = 71 
: 66, 73, 76 median=73median = 73 
10: 71, 73, 76 median=73median = 73 


OMAN DU KR WN 


oe Se ae OE a) Se Pe fi 
6 67 6& 69 7 71 2 73 
Sample median number of pages 


(b) If the sample size was increased to n=4” = 4, the variability of the sampling 
distribution of the sample median would decrease. (c) The 5 possible SRSs of size 
n=4n = 4 and the median number of pages of each sample are: 


1: 64, 66, 71, 73 median=68.5median = 68.5 
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2: 64, 66, 71, 76 median=68.5median = 68.5 
3: 64, 66, 73, 76 median=69.5median = 69.5 
4: 64, 71, 73, 76 median=72median = 72 
5: 66, 71, 73, 76 median=72median = 72 


. . 
7 Es a LO 
6 67 #6 6 7 71 72 73 
Sample median number of pages 


This sampling distribution does support the answer to part (b). The variability of 
the sampling distribution with n=4n = 4 is less than the variability of the sampling 
distribution with n=3n = 3, 

R7.3 (a) 


T T T T T 
2135 2646 «43157 3668 «4179 4690 5201 
Birth weight (g) 


(b) Answers will vary. An example dotplot is given. The range for this sample is 
4375-2790=15854375 — 2790 = 1585, 


ho hy ie a 
2600 3000 3400 3800 4200 4600 


Birth weight (g) 
(c) The dot at 2800 represents one SRS of size n=5" = 5 from this population 
where the sample range was 2800 grams. (d) The sample range is not an unbiased 
estimator of the population range. If it were unbiased, then the sampling 
distribution of the sample range would have 3417 (the population range) as its 
mean. 


R7.4 (a) upA=p=0.15' a aaa (b) Because the sample size of n=1540 
nm = 1540 is less than 10% of the population of all adults, the standard deviation 


of the sampling distribution of p\P is opA=0.15(0.85)1540=0.0091 
/0.15(0.85) 


= + = 0.0091 


Pp 1540 


(c) Because np=1540(0.15)=231>10"P = 1540 (0.15) = 231>10 and nq - py = 
1540(0.85) = 13092101309> 10 the sampling distribution of p/P is 
approximately Normal. (d) We want to find P(0.13 <p<0.17) 

P(0.13 <p £0.17) jy z=-2,202 = —2.20 and 2=2.202 = 2.20; 
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P(-2.20<z<2.20)=0.9861-P (—2.20 < z < 2.20) = 0.9861— 9 9139=0.9722 
0.0139 = 0.9722 (ii) P(O.13<p/<0.17)=normalcdf(lower: 0.13, upper: 0.17, 
P(0.13 < p < 0.17) = normalcdf(lower:0.13, upper:0.17, jean: 0.15, SD: 
0.0091) = 0.9720. There is a 0.9720 probability of obtaining a sample in which 
between 13% and 17% are joggers. 

R7.5 (a) We have an SRS of size 100 drawn from a population in which the 
proportion who get a red light is p=0.30? ~ 0.30 assuming the agents’ claim is 


true. Thus, upA=p=0.30' ‘p ~ P= aaa Because 100 is less than 
10% of the population of travelers, op\=0.30(0.70)100=0.0458 


/ 0.30(0.70) 
= \/ 100 = 0.0458 


Because np=100(0.30)=30>10"P = 100 (0.30) = 30 > 10 ang 
n(1-p)=100(0.70)=70>10" (1 — p) = 100 (0.70) = 70 > 10 the sampling 
distribution of p\P can be approximated by a Normal distribution. We want to find 
P(p<0.20)P (P < 9.20) (jy 7=-2.192 = —2.18; p(z<-2.18)=0.0146 

P(z < —2.18) = 0.0146 (ii) P(p4<0.20)=normalcdf (lower: 

P(p < 0.20) = normalcdf(lower: 1000, upper: 0.20, mean: 
~ 1000, upper:0.20, mean: 9.39, SD: 0.0458)=0.0145. 

0.30,SD:0.0458) = 0.0145. There is a 0.0145 probability that 20% or fewer of 
the travelers get a red light. 

(b) Because this is a small probability, there is convincing evidence against the 
agents’ claim—it isn’t plausible to get a sample proportion of travelers with a red 
light this small by chance alone. 

R7.6 (a) We want to find P(Xx>105)P (X = 105) | (jy 7=0,332 = 0.33; 


> 


(ii) P(X=105)=normalcdf(lower: 105, upper:1000, mean: 100, SD: 15)=0.3694. 
P(X > 105) = normalcdf(lower:105, upper:1000, mean : 100, SD:15) = 0.3694. 
There is a 0.3694 probability of selecting an individual with a WAIS score of at 
least 105. 

(b) The mean of the sampling distribution of x @ is px =p=100Hz = # = 100. 
Because the sample of size 60 is less than 10% of all adults, the standard deviation 
of the sampling distribution of x % is ox =?z ~ 


2 = 4% = 1,9365 
on=1560=1.9365 v” v 60 . The mean WAIS score for a random 
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sample 

of 60 adults will typically vary from the mean (100) by about 1.9365 points. (c) 
We want to find P(x 2105)” (© 2 105). (jy 7-2. 582 = 2.58; 
P(z>2.58)=1-0.9951=0.0049P (2 2 2.58) = 1 — 0.9951 = 0.0049 


(ii) P(x >105)=normalcdf(lower: 105, upper: 1000, mean: 100, SD: 
1.9365)=0.0049. 

P(z > 105) = normalcdf(lower:105,upper:1000,mean: 100, $D:1.9365) = 0.0049. 
There is a 0.0049 probability of selecting a sample of 60 adults whose mean WAIS 
score is at least 105. (d) The answer to part (a) could be quite different depending 
on the shape of the population distribution. The answer to part (b) would be the 
same because the mean and standard deviation do not depend on the shape of the 
population distribution. Because of the large sample size (60>30)(60 = 30) the 
answer we gave for part (c) would still be fairly reliable due to the central limit 
theorem. 


R7.7 (a) Because the sample size is large (n=50>30)(" = 50 > 30) 


, the central 
limit theorem says that the distribution of x % will be approximately Normal. (b) 
ux =p=0.5Hz = # = 9-5; because 50 is less than 10% of all 


--— 2 — 27 _ 99990 
‘50 


£ in / 
Vv n yo 


traps, ox =on=0.750=0.0990- . We want to find 

p(x >0.6)P (% 2 0.6). 

(i) z=1.012 = 1.01; P(z>1.01)=1-0.8438=0.1562 

P (z > 1.01) = 1 — 0.8438 = 0.1562 (ii) P(x >0.6)=normalcdf(lower: 0.6, 
upper: 1000, mean: 0.5, SD: 0.0990)=0.1562. 

P(z > 0.6) = normalcdf(lower:0.6,upper:1000,mean: 0.5, SD:0.0990) = 0.1562. 


There is a 0.1562 probability that the mean number of moths is greater than or 
equal to 0.6. (c) No; because this probability is not small, it is plausible that the 
sample mean number of moths is this high by chance alone. There is not 
convincing evidence that the moth population is getting larger in their state. 


Answers to Chapter 7 AP® Practice Test 
7 
17,2. 
T73€ 
T7.4a 
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T7.5d 

T7.6d 

T7.7b 

T7.8e 

T7.9 b 

T7.10e 

T7.11 Sample statistic A provides the best estimate of the parameter. Both statistics 
A and B appear to be unbiased, while statistic C appears to be biased because the 
center of its sampling distribution is smaller than the value of the parameter. In 
addition, statistic A has less variability than statistic B. In this situation, we want 
low bias and small variability, so statistic A is the best choice. 

T7.12 (a) The probability that a single household pays more than $55 cannot be 
calculated, because we do not know the shape of the population distribution of 


monthly fees. (b) px =p=$50Hz2 ~ # = $50. because the sample of size 50 is less 
than 10% of all households with 


=e ee AE $9 598 


x 


2 o ~ 
Internet access, ox =on=2050=$2.828 vn vo0 . (c) Because the 
sample 


n= 50230) the distribution of x will be 


size is large (n=50>30)! 
approximately Normal. (d) We want to find P(x >55)P (Z > 55), (i) z=1.77 

z = 1.77: p@>1.77)=1-0.9616=0,0384P (z > 1.77) = 1 — 0.9616 = 0.0384 |i) 
P(x >55)=normalcdf!” (% > 55) = normalcdf (lower: 55, upper: 1000, mean: 50 
(lower:55,upper:1000,mean: 50 SD: 2.828)=0.0385.2-828) = 0.0385. There is 
a 0.0385 probability that the mean monthly fee paid by the sample of 50 
households exceeds $55. 


T7.13 (a) upA=p=0.22" > a a saaral because 300 is less than 10% of children 


/0.22(0-78) 


under the age of 6, op'=0.22(0.78)300=0.0239°” 
Because np = 

np=300(0.22)=66"P = 300 (0.22) = 66 ang n(1-p)=300(0.78)=234 
n(1— p) = 300 (0.78) = 234 o16 both at least 10, the sampling distribution of p/ 
P can be approximated by a Normal distribution. We want to find P(p4>0.29) 


P(p > 0.29) (j) 7=9.932 = 2.93; P(z>2.93)=1-0.9983=0.0017 
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P(z > 2.93) = 1 — 0.9983 = 0.0017 (ii) P(p4>0.29)=normalcdf(lower: 0.29, 
upper: 1000, mean: 0.22, SD: 0.0239)=0.0017. 
P(p > 0.29) = normalcdf(lower:0.29,upper:1000,mean: 0.22, SD:0.0239) = 0.0017. 


There is a 0.0017 probability that more than 29% of the sample are from poverty- 
level households. (b) Because it is unlikely to get a sample proportion of 29% or 
greater by chance alone, there is convincing evidence that the percentage of 
children under the age of 6 living in households with incomes less than the official 
poverty level in this state is greater than the national value of 22%. 


Answers to Cumulative AP® Practice Test 2 
AP2.1 a 
AP2.2d 
AP2.3 e 
AP2.4b 
AP2.5c¢ 
AP2.6d 
AP2.7 c 
AP2.8 a 
AP2.9 d 
AP2.10c¢ 
AP2.11b 
AP2.12 c 
AP2.13 a 
AP2.14 ¢ 
AP2.15d 
AP2.16 ¢ 
AP2.17 e 
AP2.18 a 
AP2.19 c 
AP2.20b 
AP2.21a 


AP2.22 (a) This is an observational study. Subjects were not assigned to take (or 
not take) fish oil. (b) Two variables are confounded when their effects on the 
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cholesterol level cannot be distinguished from one another. For example, people 
who take omega-3 fish oil might also be more health conscious in general and do 
other things such as eat more healthfully or exercise more. If eating more 
healthfully or exercising more lowers cholesterol, researchers would not know 
whether it was the omega-3 fish oil or the more healthy food consumption or 
exercise that lowered cholesterol. (c) No; this wasn’t an experiment and taking fish 
oil is possibly confounded with other good habits, such as healthful eating and 
exercise. 


AP2.23 (a) y\=25.66-0.003131x¥ = 25.66 — 0.0031312 Where yAY is the 
predicted number of wins and x is the number of yards allowed. (b) yA=4¥ > 


25.66-0.003131(4668)=11.0429-66 — 0.003131 (4668) = 11.04 Ving The 
residual=10-residual = 10— 41 94=-1,0411.04 = —1.04 wins. The actual 
number of Seattle Seahawk wins was 1.04 less than the number of wins predicted 
by the regression line with x=46682 = 4668 yards allowed. (c) Because the 
Carolina Panthers allowed fewer yards than average and also had more wins than 
average, this point will increase the steepness of the negative slope of the least- 
squares regression line (make it more negative) and increase the y intercept of the 
least-squares regression line. 

AP2.24 (a) The distribution of seed mass for the cicada plants is roughly 
symmetric, whereas that for the control plants is skewed to the left. Neither group 
had any outliers. The median seed mass is the same for both groups (median=0.25) 
(median = 0.25) The cicada plants had a larger range in seed mass, but the 
control plants had a larger IQR. (b) The distribution of seed mass for the cicada 
plants is roughly symmetric, which suggests that the mean should be about the 
same as the median. However, the distribution of seed mass for the control plants is 
skewed to the left, which will pull the mean of this distribution below its median 
toward the smaller values. Because the medians of both distributions are equal, the 
mean for the cicada plants is likely greater than the mean for the control plants. (c) 
The purpose of the random assignment is to create two groups of plants that are 
roughly equivalent at the beginning of the experiment. (d) A benefit of using only 
American bellflowers is that the researchers may then control a source of 
variability. Different types of flowers will have different seed masses, making the 
response more variable if other types of plants were used. A drawback to only 
using American bellflowers is that we can’t make inferences about the effect of 
cicadas on other types of plants, because other plants may respond differently to 
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cicadas. 


AP2.25 (a) Because the sample size is large (n=50>30)(" = US 30) the 
distribution of x 2 is approximately Normal with px =p=5254z = # = 925 
pages. Because n=50" = 50 is less than 10% of all novels in the library, 


a 200 
_ Os = — = — = 28.28 . 
ox =on=20050=28.28 vn vo0 pages. We want to find 


P(x-<500)P (# < 500), 

(i) z=20.882 = 20.88; p(z<-0.88)=0.1894P (4 < —0.88) = 0.1894 () 

P(x <500)=P (z < 500) = normalcdf(lower: —1000, upper: 500, mean: 
normalcdf(lower: — 1000,upper:500, mean: 525, SD: 28,28)=5D:28.28) = 
0.18830.1883,. There is a 0.1883 probability that the average number of pages in 
the sample is less than 500. (b) X is a binomial random variable with n=50n = 50 
and p=0.30P = 9-30, We want to find P(X>20)P (X = 20), 
P(X=20)=1-binomcdf(trials: 50, p : 0.30, x value: 19)=1-0.9152=0.0848 

P(X > 20) = 1 — binomcdf(trials:50, p : 0.30, rvalue: 19) = 1 — 0.9152 = 0.0848 
. There is a 0.0848 probability of selecting at least 20 novels that have fewer than 
400 pages. 


Chapter 8 

Section 8.1 

Answers to Check Your Understanding 

page 502: 1. We are 95% confident that the interval from 0.175 to 0.225 captures 
the true proportion of all U.S. adults who would answer the question correctly. 

2. If we were to select many random samples of U.S. adults and construct a 95% 
confidence interval using each sample, about 95% of the intervals would capture 


the true proportion of all U.S. adults who would answer the question correctly. 


0.175+0.225 _ 9 99 
3. The point estimate is 0.175+0.2252=0.20 2 ~~", The margin of 


error 
is 0.225 — 0.20 = 0.025. 

4. All of the plausible values in the 95% confidence interval are less than the 
proportion expected if people were to simply guess from the four choices at 
random. Therefore, this interval does provide convincing evidence that less than 
25% of all U.S. adults would answer this question correctly. 
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Answers to Odd-Numbered Section 8.1 Exercises 
8.1 Sample mean, x =30.35X = 30.35 


p= = =0.72 
8.3 Sample proportion, p\=3650=0.72 50 


8.5 (a) We are 95% confident that the interval from 0.63 to 0.69 captures the true 
proportion of all U.S. adults who favor an amendment to the Constitution that 


would permit organized prayer in public schools. (b) The point estimate is 


~ _ 0.6340.69 _ 
pA=0.63+0.692=0.66.? = 5 = 0.66. The 


margin of error = 0.69 — 0.66 = 0.03. (c) Because the value 2/3 = 0.667 (and values 
less than 2/3) are in the interval, it is plausible that two-thirds or less of the 


population favor such an amendment. Thus, there is not convincing evidence that 
more than two-thirds of U.S. adults favor such an amendment. 


8.7 (a) Because 12 is one of the plausible values in the 95% confidence interval, 
there is not convincing evidence that the true mean volume is different from 12 
ounces. (b) No; although 12 is a plausible value for the true mean volume of all 
cans of diet cola, there are many other plausible values in the confidence interval. 
Because any of these values could be the true mean, there is not convincing 
evidence that the true mean volume is 12 ounces. 

8.9 (a) We are 95% confident that the interval from 10.9 to 26.5 captures the true 
difference (Girls - Boys) in the mean number of pairs of shoes owned by all girls 
and boys at this school. (b) Yes; because the 95% confidence interval does not 
include 0 as a plausible value for the difference in means, there is convincing 
evidence of a difference in the mean number of shoes for boys and girls. 


8.11 If we were to select many random samples of U.S. adults and construct a 95% 
confidence interval using each sample, about 95% of the intervals would capture 
the true proportion of all U.S. adults who would favor an amendment to the 
Constitution that would permit organized prayer in public schools. 

8.13 (a) The confidence interval is $51,492 — $431 = $51,061 to $51,492 + $431 = 
$51,923. We are 90% confident that the interval from $51,061 to $51,923 captures 
the true median household income for all households in Arizona in 2015. (b) If we 
were to select many random samples of Arizona households and construct a 90% 
confidence interval using each sample, about 90% of the intervals would capture 
the true median household income for all households in Arizona in 2015. 

8.15 84% of the intervals did contain the true parameter, which suggests that these 
were 80% or 90% confidence intervals. 
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8.17 (a) Incorrect; the interval provides plausible values for the mean BMI of all 
women, not plausible values for individual BMI measurements, which will be 
much more variable. (b) Incorrect; we shouldn’t use the results of one sample to 
predict the results for future samples. (c) Correct; a confidence interval provides an 
interval of plausible values for a parameter. (d) Incorrect; the population mean 
always stays the same, regardless of the number of samples taken. (e) Incorrect; we 
are 95% confident that the population mean is between 26.2 and 27.4, but that does 
not absolutely rule out any other possibility. 


8.19 (a) The length of the interval would increase. (b) The length of the interval 
would decrease. (c) One of the practical difficulties would include non-response 
(those who are not available to respond or those who refuse to answer). For 
example, if people who were selected but do not respond have different opinions 
than the people who did respond, the estimated proportion may be off by much 
more than 3 percentage points. 

8.21 (a) If we were to select many random samples of California adults and 
construct a 90% confidence interval using each sample, about 90% of the intervals 
would capture the true average travel time to work for all employed California 
adults. (b) Decrease the confidence level; drawback: we can’t be as confident that 
our interval will capture the true proportion. Increase the sample size; drawback: 
larger samples cost more time and money to obtain. (c) People who have longer 
travel times to work might have less time to respond to a survey. This would cause 
our estimate from the sample to be less than the true mean travel time to work. The 
bias due to nonresponse is not accounted for by the margin of error, because the 
margin of error accounts for only variability we expect from random sampling. 


8.23 b 
S.29€e 


8.27 (a) Observational study; pregnant women and children were not assigned to 
different amounts of exposure to magnetic fields. (b) No; we cannot make any 
conclusions about cause and effect because this was not an experiment. We can 
only conclude that there isn’t convincing evidence that living near power lines is 
related to whether children develop cancer. 


Section 8.2 
Answers to Check Your Understanding 


page 516: 1. p = true proportion of all U.S. adults who “often or always” got 
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enough sleep during the last 7 nights. 
2. The Random condition is met because the statement says that the adults were 
chosen randomly. The 10% condition is met because the sample size n = 1029 is 
less than 10% of all U.S. adults. The Large Counts condition is met because npAnp 
= (1029)(0.48) = 493.92 > 10 and n(1-p’)” (1—p)- (1029)(1 — 0.48) = 535.08 = 
10. 

1—0.99 


3. 1-0.992=0.005 2 ~ 9-999. using Table A, the closest area is 0.0051 (or 


0.0049), corresponding to a critical value of z* = 2.57 (or 2.58). Tech: 
invNorm(area: 0.005, mean: 0, SD: 1) = -2.576, so z* = 2.576, and 
0.48+2.5760.48(1—-0.48)1029=0.48+0.04=(0.44,0.52) 


0.48 + 2.5764/-——" = 0.48 + 0.04 = (0.44, 0.52) 


4. We are 99% confident that the interval from 0.44 to 0.52 captures p = the true 
proportion of all U.S. adults who would report that they “often or always” got 
enough sleep during the last 7 nights. 


/ 0.80(0.20) 


page 520: 1. Solving 1.960.80(0.20)n<0.03""9°V < 0.03 


n > (0.80) (0.20) (428): = 682.95. 


0.03 


for n gives 
n>(0.80)(0.20)(1.960.03)2 =682.95. hic means 
that we should select a sample of at least 683 customers. This sample size is less 


-95 p = 0.5. 
than that determined in the example using P=0-5-” 


2. If the company president demands 99% confidence instead, the required sample 
size will be larger because the critical value is larger for 99% confidence (2.576) 
versus 95% confidence (1.96). The company would need to select at least 1180 
customers to have 99% confidence. 


Answers to Odd-Numbered Section 8.2 Exercises 

8.29 Random: Met because Latoya selected an SRS of students. 10%: Not met 
because the sample size (50) is more than 10% of the population of seniors in the 
dormitory (175). Large Counts: Met because npA=14>10"™ = 14> 10 and 
n(1-p’) =362107 (1 —P) = 36 2 10, 

8.31 Random: Met because the inspector chose an SRS of bags. 10%: Met because 
the sample of 25 is less than 10% of the thousands of bags filled in an hour. Large 
Counts: Not met because there were only 3 successes (bags with too much salt), 
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which is less than 10. 

8.33 (a) It is necessary to ensure that the observations are close to independent. If 
they aren’t, the formula for the standard error of p\P won’t accurately estimate the 
standard deviation of the sampling distribution of prp. (b) The confidence interval 
will capture the population parameter more often than the specified confidence 


level. 
1—0.98 


8.35 (a) 1-0.982=0.01 2 = 0.01. using Table A, the closest area is 0.0099, 


corresponding to a critical value of z*=2.332* = 2.33, Tech: invNorm(area: 0.01, 


ee! 
p= 24 = 0.1996 


mean: 0, SD: 1) = -2.326, so z* = 2.326. (b) P*=9144979=0.1996 
. 0.1996+2.3260.1996(1—-0.1996)4579=0.1996+0.0137=(0.1859, 0.2133) 


/0.1996(1—0.1996) 


0.1996 + 2.3264/ an7e = 0.1996 + 0.0137 = (0.1859, 0.2133) 


(c) We are 95% confident that the interval from 0.1859 to 0.2133 captures p = the 
true proportion of American adults who have earned money by selling something 
online in the previous year. 

8.37 The standard error of p\=0.1996(1—0.1996)4579=0.0059 


m / 0.1996(1—0.1996) 

= 4/ ——-.—- = 0.0059 
rare 4579 . In repeated SRSs of size 4579, the sample 
proportion of American adults who have earned money by selling something 
online in the previous year typically varies from the population proportion by about 


0.0059. 

8.39 (a) Population: All seniors at Tonya’s high school. Parameter: The true 
proportion of all seniors who plan to attend the prom. (b) Random: The sample is a 
simple random sample. 10%: n = 50 is less than 10% of the population size. Large 
Counts: npA=36210"P = 36 > 10 and n(1-p/) 14210” (1—p) 14> 10. (c) 
7#=1,6452* = 1.645; pA=3650=0.72? = 30 = nee 

0.72 + 1.6450.72(0.28)50=0.72+0.104=(0.616, 0.824) 


0.72 + 1.645,/——"= = 0.72 + 0.104 = (0.616, 0.824) 


(d) We 
are 90% confident that the interval from 0.616 to 0.824 captures p = the true 
proportion of all seniors at Tonya’s high school who plan to attend the prom. 


8.41 STATE: p = the true proportion of all U.S. adults who play video games. 
PLAN: One-sample z interval for p. Random: The adults were selected randomly. 
10%: n=2001" = 2001 is less than 10% of the population of all U.S. adults. Large 
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Counts: np\=980.49>10"%P = 980.49 > 10 and n(1-p)=1020.51>10 
n(1—p) = 1020.51 > 10 no; (0.468, 0.512). CONCLUDE: We are 95% 
confident that the interval from 0.468 to 0.512 captures p = the true proportion of 
U.S. adults who play video games. 


8.43 (a) We do not have enough information because we do not know the size of 
the sample that was taken from each age group. (b) Larger margin of error because 
this is a smaller sample size than the original group. 

8.45 (a) STATE: p = the true proportion of all U.S. adults who think that organic 
produce is better for health than conventionally grown produce. PLAN: One- 
sample z interval for p. Random: The adults were selected randomly. 10%: n=1480 
n = 1480 js less than 10% of the population of all U.S. adults. Large Counts: 
np\=814210"P = 814 > 10 and n(i-ps)=666210" (1 — P) = 666 = 10 po, 
(0.517, 0.583). CONCLUDE: We are 99% confident that the interval from 0.517 to 
0.583 captures p = the true proportion of U.S. adults who would agree with the 
statement “Organic produce is better for health than conventionally grown 
produce.” (b) Yes; all of the plausible values in the interval are greater than 0.5, 
which provides convincing evidence that a majority of all U.S. adults think that 
organic produce is better for health than conventionally grown produce. 


8.47 The confidence interval is (0.616, 0.824). We expect between 61.6% and 
82.4% of the 750 seniors to attend the prom. We can be 90% confident that the 
interval from 462 to 618 captures the total number of seniors planning to go to the 
prom. 


2.576,/-——" < 0.03 


8.49 (a) Solving 2.5760.44(0.56)n<0.03 gives n=1817 


2576 / 0.5(0.5) 


a 


n = 1817. (b) Solving 2.5760.5(0.5)n<0.03 a, OS gives n=1844 
n = 1844, The conservative approach requires 27 more adults. 


0.5(0.5) 


- 5 cence <a . e 
8.51 Solving 1.6450.5(0.5)n<0.041°64° ya SS gives n=423n = 423, 
/ 0.64(0.36) 
: 0.03 = z* \/ = . 
8.53 (a) Solving 0.03=z*0.64(0.36)1028 1028 gives z*=2.00 


z* = 2.00. The 


confidence level is likely 95%. (b) Teens are hard to reach and often unwilling to 
participate in surveys, so nonresponse bias is a major “practical difficulty” for this 
type of poll. If teens with TVs in their rooms are less likely to answer the poll 
(because they are watching TV in their rooms!), the estimate from the poll would 
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likely be too small. 

8.55 a 

8.57 d 

8.59 (a) X follows the Normal distribution with a mean of 21.1 and a standard 
deviation of 1.8. We want to find P(X > 22). (i) z = 0.5; P(z = 0.5) = 0.3085 (ii) 
P(X > 22) = normalcdf(lower: 22, upper: 1000, mean: 21.1, SD: 1.8) = 0.3085. 
There is a 0.3085 probability that a randomly selected orange from this tree has a 


circumference greater than 22 cm. (b) Normal distribution; px =m=21.1 cm 
He =m=21.1cm. 


_ === 
Because 20 is less than 10% of all oranges on the tree, ox =on= “— 
== = 0.4025 __ 
1.820=0.4025 v20 cm. We want to find P(x X> 22). (i) z=2.242 = 2.24 


P(z > 2.24) = 0.0125 (ii) 

P(x >.22)=normalcdf(lower: 22, upper: 1000, mean: 21.1, SD: 0.4025)=0.0127 
P(z>.22)=normalcdf (lower: 22, upper: 1000, mean: 21.1, SD: 0.4025) =0.0127 
. There is a 0.0127 probability that the mean circumference of 20 randomly 
selected oranges from this tree is greater than 22 cm. 


Section 8.3 

Answers to Check Your Understanding 

page 531: 1. df = 22 — 1 = 21, t* = 2.189. Tech: invT(area: 0.02, df: 21) = —2.189, 
so t®* = 2.189. 

2. df = 71 — 1 = 70, t* = 2.660 (using df = 60). Tech: invT(area: 0.005, df: 70) = 
—2.648, so t* = 2.648. 

page 538: STATE: p = the true mean healing rate. PLAN: One-sample t interval 
for p. Random: The newts were randomly chosen. 10%: n = 18 is less than 10% of 
the population of all newts. Normal/Large Sample: The histogram below does not 
show strong skewness or outliers, so this condition is met. 


w > 


Frequency 
= 


T “T 7 T 
10 15 20 25 30 35 40 
Healing rate (j:m/h) 
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DO: x =25.67X = 25.67 and s, = 8.32; df = 17 and t* = 2.110; 25.67 ++ 
2.110(8.3218)=25.67+4.14=(21.53,29.81). 


2.110 (22) = 25.67 + 4.14 = (21.53,29.81). 
Vid 


CONCLUDE: We are 95% 
confident that the interval from 21.53 to 29.81 micrometers per hour captures j= 
# = the true mean healing rate for all newts. 


o 


Z* —. 
page 540: The margin of error is defined to be z*on. vv" Using o® = 154 and 
30 > 1.645 4%. 


Vv n 


z* = 1.645 for 90% confidence, 3921-645 154n. 


1.645(154) \ 2 : 
n>() =13 
n2(1.645(154)30)2=71.3 , so take a sample of 72 students. 


Thus, 


Answers to Odd-Numbered Section 8.3 Exercises 

8.61 (a) df = 9, t* = 2.262. Tech: invT(area: 0.025, df: 9) = —2.262, so t* = 2.262. 
(b) df = 19, t* = 2.861. Tech: invT(area: 0.005, df: 19) = —2.861, so t* = 2.861. (c) 
Use df = 60, t* = 1.671. Tech: invT(area: 0.05, df: 76) = —1.665, so t* = 1.665. 
8.63 Not met; the sample size is small (n = 28 < 30) and there are outliers in the 
data. We cannot assume that the population is approximately Normal. 


8.65 (a) Random: No; members of the AP® Statistics class are not a random 
sample of all students at the school. 10%: Yes, assuming 32 is less than 10% of all 
2 = 32 > 30 
n=322>30!% _(b) 
Random: Yes; random sample of 100 home sales from the previous 6 months in 


her city. 10%: Yes; assume that 100 is less than 10% of all homes that were sold 
n=100230 


students at the school. Normal/Large Sample: Yes; 


during the previous 6 months in her city. Normal/Large Sample: Yes; 


eee sid Despite the fact that the boxplot showing the distribution of home 


sales is strongly right skewed with outliers, this condition is still met because the 
sample size is large. 


SE; = -£ = 2% = 1.7898. 
vn V 27 If we take many 


8.67 SEx =sxn=9.327=1.7898. 
samples of size 27, the sample mean blood pressure will typically vary by about 
1.7898 from the population mean blood pressure. 

8.69 (a) STATE: p = the true mean percent change in BMC for breast-feeding 
mothers. PLAN: One-sample t interval. Random: The mothers were randomly 


selected. 10%: 47 is less than 10% of all breast-feeding mothers. Normal/Large 
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_ = 
Sample:n=47230" = 47 = 30. Do: df = 40 (-4.575,-2.590) (-*-575, — 2.500). 


Tech: (—4.569,-2.605) with df = 46. CONCLUDE: We are 99% confident that the 
interval from —4.569 to —2.605 captures p = the true mean percent change in BMC 
for breast-feeding mothers. (b) Because all of the plausible values in the interval 
are negative (indicating bone loss), the data give convincing evidence that breast- 
feeding mothers lose bone mineral, on average. 


8.71 STATE: p = the true mean weight of an Oreo cookie. PLAN: One-sample t 
interval. Random: The cookies were randomly selected. 10%: 36 is less than 10% 


of all Oreo cookies. Normal/Large Sample:n=36230" = 36 = 30, Do: df = 30 


11.369, 11.4152). 
(11.369, 11.4152) | ) Tech: (11.369, 11.415) with df = 35. 


CONCLUDE: We are 90% confident that the interval from 11.369 g to 11.4152 g 
captures p = the true mean weight for all Oreo cookies. 

8.73 STATE: p = the true mean number of pepperonis on a large pizza at this 
restaurant. PLAN: One-sample t interval. Random: Pizzas were randomly selected. 
10%: 10 is less than 10% of all pepperoni pizzas made at this restaurant. 
Normal/Large Sample: The dotplot doesn’t show any outliers or strong skewness. 


G oO 

e e @ @¢@ ee ¢@ 

as T T T T hee 
25 30 35 40 45 50 
Number of pepperonis 


po: * =374¢ = 37.4 sx=7.66252 = satis and n= 10; df = 9; (31-919, 42.881). 


(31.919, 42.881). . 
Tech: (31.919, 42.881) with df = 9. CONCLUDE: We are 95% 


confident that the interval from 31.919 to 42.881 captures p = the true mean 
number of pepperonis on a large pizza at this restaurant. 


8.75 (a) It was necessary because the sample size was not large (less than 30). 
When the sample size is less than 30, we must assume the population is Normally 
distributed. (b) The value 40 is a plausible value found within the confidence 
interval, so we do not have convincing evidence that the average number of 
pepperonis is less than 40. 

7.5 


ay eae 
8.77 Solving 2.576 wok mr ) gives n = 373.26, take an SRS of 374 
women. 


SE- = 19.03 = “2 = — 


8.79 (a) SEX =19.03=sxn=sx23 vi V3 s0 sx=19,0323= 
S, = 19.03/23 =91 2691.26 cm. 


(b) The researchers are using a critical value of t* = 1; with df = 23 — 1 = 22, the 
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area between t = —1 and t = 1 is approximately tcdf (lower: —1, upper: 1, df: 22) = 
0.67. So the confidence level is 67%. 


8.81 b 
8.83 b 


8.85 (a) P(X=7)=0.57P (X = 7) = 0.57 (b) Mean: If we were to randomly select 
many young people, the average number of days they watched television in the 
past 7 days would be about 5.44. Standard deviation: The number of days of 
television watched typically varies from the mean (5.44 days) by about 2.14 days. 
(c) Because n=100>30" = 100 > 30) we expect the mean number of days x = for 
100 randomly selected young people to be approximately Normally distributed 
with mean ux =p=5.44Hz ~ H = 5.44 Because the sample size (100) is less than 
7 op = et = 44 = 0.214 
10% of all young people, ox =on=2.14100=0.214 vn v 100 . We 
want to find P(x #< 4.96). (i) z = -2.24; P(z < -2.24) = 0.0125 (ii) P(x ¥< 4.96) 
= normalcdf(lower: —1000, upper: 4.96, mean: 5.44, SD: 0.214) = 0.0124. There is 
a 0.0124 probability of getting a sample mean of 4.96 or smaller. Because this 
probability is small, a sample mean of 4.96 or smaller would be surprising. 


Answers to Chapter 8 Review Exercises 


1-0.94 _ 
R8.1 (a) 1-0.942=0.03 2 — 0.08 ; the closest area is 0.0301, corresponding to 


a critical value of z* = 1.88. Tech: invNorm(area: 0.03, mean: 0, SD: 1) = —-1.881. 
So z* = 1.881. (b) df = 57; using Table B and 50 degrees of freedom, t* = 2.678. 
Tech: invT(area: 0.005, df: 57) = —2.665, so t* = 2.665. 


— _ 4304470 __ 
R8.2 (a) The point estimate x _=430+4702=450" ~~ 2—O 450 minutes. The 


margin of error = 470 — 450 = 20; df = 29 and t* = 2.045. Thus, 20=20 =2.045 


Sy a 20 


2.0 z.. _— =—_= 
5x30 v30 and the standard error = sx30=202.045= v30 2-945. 780 


Sx 


= = 9.780 
9.780. Finally, because sx30=9.780 v0 , SX=9.78030= 


Sz = 9.78030 =53 5753.57, (b) Incorrect; the confidence interval provided 
gives an interval estimate for the mean lifetime of batteries produced by this 


company, not individual lifetimes. (c) No; any given confidence interval either 
captures the population mean (probability = 1) or it doesn’t (probability = 0). (d) If 
we were to take many samples of 30 batteries and compute 95% confidence 
intervals for the mean lifetime, about 95% of these intervals will capture the true 
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mean lifetime of the batteries. 


R8.3 (a) p = the proportion of all adults aged 18 and older who would say that 


football is their favorite sport to watch on television. (b) The point estimate 
oceans 
pA=3701000=0.37" 1000 sail (c) Because of sampling variability, I doubt 


that the value of the point estimate is exactly equal to the value of p. However, 
because the data were collected from a large random sample from the population of 
interest, we have reason to believe that the value of the point estimate will be 
relatively close to the value of p. 

R8.4 (a) STATE: p = the true proportion of all drivers who have run at least one 
red light in the last 10 intersections they have entered. PLAN: One-sample z 
interval. Random: Drivers were selected at random. 10%: n = 880 is less than 10% 


of all drivers. Large Counts: np\p = 171> 10 and n(1>p)" (1 > P) = 709 > 10. 


ae 
DO: p\=171880=0.194? = 380 = 9-194. (9 168 0.290) (0-168, 0.220) . 


CONCLUDE: We are 95% confident that the interval from 0.168 to 0.220 captures 
the true proportion of all drivers who have run at least one red light in the last 10 
intersections they have entered. (b) It is likely that more than 171 respondents have 
run red lights. We would not expect very many people to claim they have run red 
lights when they have not, but some people will deny running red lights when they 
have. The margin of error does not account for these sources of bias, only sampling 
variability. 

R8.5 (a) STATE: ps = the true mean measurement of the critical component for the 
engine crankshafts produced in one day. PLAN: One-sample t interval. Random: 
The crankshafts are randomly selected. 10%: n = 16 is less than 10% of all 
crankshafts produced in one day. Normal/Large Sample: The histogram shows no 
strong skewness or outliers. 


Frequency 
oOorNnNwWwWeUuUaw~ we © 
SS Se 


223,90 223.95 224.00 224.05 224.10 


Component measurement (mm) 


DO: x =224,002% = 224.002, sx=0,061882 = 9-0618 and n=16n = 16. Thus, 


df = 15 and te=2.131.t" = 2.131. (223.969,224.035),(223-969, 224.035) . 
CONCLUDE: We are 95% confident that the interval from 223.969 to 224.035 
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mm captures p/ = the true mean measurement of the critical component for engine 
crankshafts produced on this day. (b) Because 224 is in this interval, it is a 
plausible value for the true mean. We don’t have convincing evidence that the 
process mean has drifted. 


9 576 J 0.5(0.5) 


< 0.01 


(2) < 1000 
R8.7 Solving 1.96(3000n)<1000 vin 
fir. 


R8.6 Solving 2.5760.5(0.5)n<0.01 ,n = 16,590 adults. 


,n = 35 pieces of Douglas 


R8.8 (a) If we increase the confidence level, then the margin of error must get 
larger to increase the capture rate of the intervals. (b) If we quadruple the sample 
size, the margin of error will decrease by a factor of 2. 


R8.9 (a) We use a ¢ critical value, rather than a z critical value, when constructing a 
confidence interval for a population mean when using the sample standard 
deviation s, to estimate the population standard deviation o@. (b) As the degrees of 
freedom increase, the t critical values decrease and become closer and closer to the 
z critical value for a particular level of confidence. 


Answers to Chapter 8 AP® Practice Test 
T8.1a 
T8.2 d 
T8.3 Cc 
T8.4d 
T8.5 b 
T8.6a 
TG.7 < 
T8.8 d 
T8.9 e 
T8.10d 


T8.11 (a) STATE: p = the true proportion of all visitors to Yellowstone who would 
say they favor the restrictions. PLAN: One-sample z interval. Random: The visitors 
were selected randomly. 10%: n = 150 is less than 10% of all visitors to 
Yellowstone National Park. Large Counts: =g9>10Counts : = 89 > 10 and 


~ 89 


n(1-p)=61210"(1 — P) = 61 = 10. Ho; pr=g91s0=” ~ 150 ~0,5930.593; 
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(0.490,0.696).(9-499, 0.696) . CONCLUDE: We are 99% confident that the 
interval from 0.490 to 0.696 captures p = the true proportion of all visitors who 
would say that they favor the restrictions. (b) Because there are values less than 
0.50 in the confidence interval, the U.S. Forest Service cannot conclude that more 
than half of visitors to Yellowstone National Park favor the proposal. It is plausible 
that only 49% favor the proposal. 


T8.12 (a) We are 95% confident that the interval from 317.64 to 394.56 captures 


= the true mean number of licks it takes to get to the center of a Tootsie Pop. (b) 


317.64+394.56 _ gre 4 
The point estimate is 317.64+394.562=356.1 2 7 ™,. The margin of 


error is 394.56 — 356.1 = 38.46. (c) The researcher could reduce the margin of 
error by decreasing the confidence level. The drawback is that we can’t be as 
confident that our interval will capture the true mean. The researcher could also 
reduce the margin of error by increasing the sample size. The drawback is that 
larger samples cost more time and money to obtain. 

T8.13 STATE: p = the true mean number of bacteria per milliliter in raw milk 
received at the factory. PLAN: One-sample t interval. Random: The data come 
from a random sample. 10%: n=10n = 10 is less than 10% of all one-milliliter 
specimens that arrive at the factory. Normal/Large Sample: The dotplot shows that 
there is no strong skewness or outliers. 


e e ¢@ an e 6 » a 
! ' 
4560 4680 4800 4920 5040 5160 5280 5400 
Number of bacteria (per ml) in raw milk 


DO: x =4950.0% = 4950.0, sx=268.58r = 268.5, and n=10n = 10; df = 9 and 
pe=t* = 1,833,1.833. (4794.37,5105.63), (4794-37, 5105.63) . CONCLUDE: We 
are 90% confident that the interval from 4794.37 to 5105.63 bacteria per milliliter 
captures = ~ the true mean number of bacteria in the milk received at this 
factory. 


Chapter 9 

Section 9.1 

Answers to Check Your Understanding 

page 556: 1. HO: p=0.85440 : Pp = 9.85 and Ha: pz0.8540 : P # 9.85 where p= 
P ~ proportion of all students at Jannie’s high school who get fewer than 8 hours 
of sleep at night. 2. HO: u=10H0 ‘= 10 and Ha: u>10La ‘> 10 where 
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u=truel4 = true 


mean amount of time that it takes to complete the census form. 
page 562: 1. Type I error: The manager finds convincing evidence that less than 
63% of the drive-thru wait times are longer than 2 minutes, when the true 
proportion really is 0.63. Type II error: The manager does not find convincing 
evidence that less than 63% of the drive-thru wait times are longer than 2 minutes, 
when the true proportion really is less than 0.63. 

2. In this case, a Type I error is more serious because the manager will believe that 
the additional employee reduces the proportion of drive-thru customers who have 
to wait longer than 2 minutes to receive their food, when that is not the case. 

3. No; if the null hypothesis is true, a significance level of a=0.10@ = 0.10 will 
result in a Type I error 10% of the time just by chance. Since a Type I error is more 
serious in this case, it would be better to pick a smaller value of a@, such as 
a=0.01a = 0.01, 

4. Assuming that the true proportion of all drive-thru customers who have to wait 
longer than 2 minutes to receive their food after placing an order is 0.63, there is a 
0.0385 probability of getting a sample proportion of 0.576 or less who have to wait 
longer than 2 minutes just by chance in a random sample of 250 drive-thru 
customers. 


Answers to Odd-Numbered Section 9.1 Exercises 

9.1 HO: p=0.75/40 : p = 9.75. Ha: p<0.75Ha : Pp < 9-75, where p = the true 
proportion of the students at Mr. Tabor’s school who completed their math 
homework last night. 

9.3 HO: u=180/40 --}= 180. Ha: 141804 iF 180° where p= > the true 
mean volume of liquid dispensed by the machine. 

9.5 HO: o=340 9 = 3 ; Ha: o>3Ha io > 3. where o=0 = the true standard 
deviation of the temperature in the cabin. 

9.7 (a) The null hypothesis is always that there is “no difference” or “no change” 
and the alternative hypothesis is what we suspect is true. These ideas are reversed 
in the stated hypotheses. Correct: HO: p=0.3740 “p= 0.37. Ha: p>0.37 

H, : p > 0.37 (b) Hypotheses are always about population parameters. However, 
the stated hypotheses are in terms of the sample statistic. Correct: HO: p=3000 
Ho : w = 3000 grams; Ha: 1<3000/a > pe < 3000 grams. 

9.9 (a) If HO: p=0.754 0: p = 0.79 is true, then the proportion of all students at 
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Mr. Tabor’s school who completed their homework last night is 0.75. (b) 
Assuming that the proportion of all students at Mr. Tabor’s school who completed 
their homework last night is 0.75, there is a 0.1265 probability of getting a sample 
proportion of 0.68 or less just by chance in a random sample of 50 students at the 
school. 


9.11 Assuming the true mean volume of liquid dispensed by the machine is 180 ml, 
there is a 0.0589 probability of getting a sample mean at least as far from 180 as 
179.6 (in either direction) just by chance in a random sample of 40 bottles filled by 
the machine. 

9.13 The student forgot to include the conditions and the direction in the 
interpretation. Assuming the null hypothesis is true, there is a 0.044 probability of 
getting the sample result I did or one even larger by chance alone. 

9.15 Because the PP-value of 0.1265 is greater than a=0.05a = 0.05, we fail to 
reject Hoo. We do not have convincing evidence that the true proportion of 
students at Mr. Tabor’s school that completed their math homework last night is 
less than 0.75. 

9.17 (a) Because the PP-value of 0.0589 is less than a=0.10@ = 0.10, we reject 
Hoo. We have convincing evidence that the true mean volume of liquid 
dispensed by the machine is different from 180 ml. (b) Yes; because the PP-value 
of 0.0589>0=0.050.0589 > a = 0.05, we would fail to reject HO40. We do not 
have convincing evidence that the true mean volume of liquid dispensed by the 
machine is different from 180 ml. 


9.19 It is never correct to “accept the null hypothesis.” If the P-value is large, the 
data do not provide convincing evidence that the alternative hypothesis is true. 
However, lacking evidence for the alternative hypothesis does not provide 
convincing evidence that the null hypothesis is true. 

9.21 (a) HO: u=1Ho SM= 1. Ha: <1 ‘He <1 where p=! = the true mean 
weight (in pounds) of bread loaves produced at the bakery. (b) There is some 
evidence for the alternative hypothesis because the mean weight of an SRS of 
bread loaves is only 0.975 pound, which is less than what the mean is supposed to 
be (1 pound). (c) Assuming that the true mean weight of bread loaves produced at 
the bakery is 1 pound, there is a 0.0806 probability of getting a sample mean of 
0.975 pound or less just by chance in a random sample of 50 bread loaves. (d) 
Because the P-value of 0.0806>a0=0.010.0806 > a = 0.01, we fail to reject HO 


Ho. We do not have convincing evidence that the true mean weight for all loaves 
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of bread produced at the bakery is less than one pound. 

9.23 Type I: You find convincing evidence that the mean income of all residents 
near the restaurant exceeds $85,000 when in reality it does not. Consequence: You 
will open your restaurant in a location where the residents will not be able to 
support it, so your restaurant may go out of business. Type IT: You do not find 
convincing evidence that the mean income of all residents near the restaurant 
exceeds $85,000 when in reality it does. Consequence: You will not open your 
restaurant in a location where the residents would have been able to support it and 
you lose potential income. 

9.25 (a) A Type I error would be finding convincing evidence that the proportion 
of all calls in which first responders took more than 8 minutes to arrive had 
decreased when it really hadn’t. A Type II error would be not finding convincing 
evidence that the proportion of all calls in which first responders took more than 8 
minutes to arrive decreased when it really had. (b) A Type I error would be worse 
because the city would overestimate the ability of the emergency personnel to get 
to the scene quickly and people may end up dying. (c) The probability of a Type I 
error is a @ 0.05 @ 0.05; because the consequence may be the difference 
between life and death, the manager should use a value for o@ that is lower, such 
as a=0.01a = 0.01. 

9.27 (a) HO: p=0.10/40 : p = 9.10. Ha: p>0.104e : Pp > 9.10 where p=P = the 
true proportion of all students at Simon’s school that are left-handed. (b) Based on 
the simulation results, 24 of the 100 simulated trials yielded a sample proportion of 
0.16 or greater, so the PP-value is approximately 24/100=0.2424/ 100 = 0.24. 
Assuming that the true proportion of all students at Simon’s school that are left- 
handed is 0.10, there is a 0.24 probability of getting a sample proportion of 0.16 or 
greater just by chance in a random sample of 50 students. (c) Use a=0.05@ = 0.05 
. Because the PP-value of 0.24>0=0.050.24 > a = 0.05, we fail to reject HOMO . 
We do not have convincing evidence that the true proportion of all students at 
Simon’s school that are left-handed is greater than 0.10. 

9.29 d 

se 


9.33 (a) P(degree earned by a woman)=0.4168 


P (degreeearnedbyawoman) = 0.4168. 


; approximately (24,611)\24, 611) 


(0.4168)=10,258(9-4168) = 10, 258 mathematics degrees were awarded to 
women. (b) Not independent; P(woman) = 0.4168, which is not equal to P(woman 
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| bachelors) = 0.43. (c) P(at least 1 of the 2 degrees earned by a woman) = 1 
—P(neither degree is earned by a woman) = 1-(14,35324,611)(14,35224,610) 


_ (Sa) (Sa) 
24,611 24,610 } -9 6599= 0.6599. 


Section 9.2 

Answers to Check Your Understanding 

page 576: 1. STATE: HO: p=0.20Ho : p = 9.20. Ha: p>0.20Ha : p > 0.20, 
where pP is the true proportion of all teens at the school who would say they have 
electronically sent or posted sexually suggestive images of themselves, using 
a=0.05a = 0.05, PLAN: One-sample z2 test for p. Random: We have a random 
sample of 250 students from the school. 10%: 250 is less than 10% of the 2800 
students at the school. Large Counts: np0=250(0.2)=50>10 

npp = 250(0.2) = 50 > 10 and n(1—p0)=250(0.8) @” (1 — po) = 250(0.8) @ 


— oo 
200$10200$10. DO: p=63250=0.252? ~ 250 aoe. z=0.252-0.200.20( 
x — 0:252-0.0 _ 9 06 
| 0.20(0.80) 
0.80)250=2.06 \V 20 ; P-value = 0.0197. CONCLUDE: Because 


the PP-value of 0.0197<0.0197 <=0.05a = 0.05, we reject HOM0. We have 
convincing evidence that more than 20% of the teens in her school would say they 
have electronically sent or posted sexually suggestive images of themselves. 

page 580: 1. STATE: HO: p=0.7540 : p = 9-75; Ha: pz0.75Ha : Pp # 0.75, 
where pP is the true proportion of all restaurant employees at this chain who would 
say that work stress has a negative impact on their personal lives using a=0.05 

a = 0.05, PLAN: One-sample z2 test for p. Random: We have a random sample 
of 100 employees from the large restaurant chain. 10%: Assume the sample size 
(100) is less than 10% of all employees at this large restaurant chain. Large 


Counts: np0="P0 = 100(0.75)=75$10100 (0.75) = 75810 ang n(1-poy" (1 — Po) 


=100(0.25)== 100 (0.25) = 531925 > 10, Do: pr=68100=0.68 
0.68—0.75 
/0.75(0.25) 


z ==0.68-0.750.75( 0.25)100 V1 =-1,62= ~1.62; p- 


~ 68 


p= 8 =0.68 


100 
value = 0.1052. 


CONCLUDE: Because the PP-value of 0.1052>a=0.100.1052 > a = 0.10, we 
fail to reject HO. We do not have convincing evidence that the true proportion 
of all restaurant employees at this large restaurant chain who would say that work 
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stress has a negative impact on their personal lives is different from 0.75. 

2. The confidence interval gives the values of p? that are plausible based on the 
sample data; 0.75 is a plausible value because it is within the 90% confidence 
interval. So based on the confidence interval, we would fail to reject p=0.75 

p = 0.795 at the a=0.10a = 0.10 significance level. The 90% confidence interval 
provided gives an approximate set of p0P0’s that would not be rejected by a two- 
sided test at the a=0.10@ = 0.10 significance level. A two-sided test only allows 
us to reject (or fail to reject) a hypothesized value for a particular population 
parameter, where a confidence interval provides a set of hypothesized values that 
would not be rejected based on a two-sided test. 


Answers to Odd-Numbered Section 9.2 Exercises 

9.35 Random: SRS of 60 students from a large rural high school. 10%: 60 is less 
than 10% of all students at this large high school. Large Counts: 
np0=60(0.80)=48>10"Po = 60 (0.80) = 48 > 10 ang n(1-po)=60(0.20)=12>10 
n(1—po) = 60(0.20) = 12 >10 


9.37 The expected number of successes np0=10(0.5)=5"Po = 10 (0.5) = 5 ana 
failures n(1—p0)=10(0.5)=5” (1 — po) = 10(0.5) = 5 are both less than 10, so the 
Large Counts condition is not met. 


9.39 (a) The sample result gives some evidence for Ha:Ha : p<0.80P = 0.80 


p = + = 0.683 
because p\=4160=0.683 60 : , which is less than 0.80. 
oie 0.683-0.80 — 997 
/ 0.80(0.20) 
(b) z=0.683-0.800.80( 0.20)60=-2.27 \ 60 ; P-value=0.0116 


P-value = 0.0116. (c) Because the P-value of 0.0116<a=0.05 

0.0116 < a = 0.05, we reject HO#0. We have convincing evidence that the true 
proportion of all students at this large rural high school who have a computer at 
home is less than 0.80. 


9.41 (a) The P-value is P(z>2.19)=0.0143"° (2 2 2.19) = 0.0143 4 ccuming that 
the true population proportion is 0.5, there is a 0.0143 probability of getting a 
sample proportion as large as or larger than the one observed just by chance in a 
random sample of size n=200" = 200, (b) a=0.01@ = 0.01; because the P-value 
of 0.0143>a=0.010.0143 > a = 0.01, we fail to reject HOM0 . There is not 
convincing evidence that p>0.5? > 9-5, For a=0.05@ = 0.05, yes! The P-value of 
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0.0143<a=0.050.0143 < a = 0.05, so this time we will reject HO/40. There is 
convincing evidence that p>0.5? > 9-5. (c) Solve for pAP; 2.19=p-0.50.5( 


2.19 = 2 -, § = 0.5774 


/0.5(0.5) 
0.5)200 + p\=0.5774 \ “200 
9.43 STATE: HO: p=0.75/40 : p = 9.75, Ha: p>0.75Ha : Pp > 9.75 where p=P = 
the true proportion of all middle school students who engage in bullying behavior, 
using a=0.05@ = 0.05, PLAN: One-sample z test for p. Random: We have a 
random sample of 558 middle school students. 10%: 558 is less than 10% of all 


middle school students. Large Counts: np0=558(0.75)=418.5>10 

npo = 558 (0.75) = 418.5 > 10 and n(1-p0)=558(0.25)=139.5210 

n (1 — po) = 558 (0.25) = 139.5 > 10 po; 7=0,797-0.750.75( 0.25)558=2.56 
= 0.797—0.75 — 256 


\ ~ 58 ; P-value=0.0052P-value = 0.0052, CONCLUDE: 
Because the P-value of 0.0052<a=0.050.0052 < a@ = 0.05, we reject Holo. We 
have convincing evidence that the true proportion of all middle school students 


yA 
“a 


who engage in bullying behavior is greater than 0.75. 

9.45 (a) Type I: Finding convincing evidence that more than 37% of students were 
satisfied with the new parking arrangement, when in reality only 37% were 
satisfied. Consequence: The principal believes that students are satisfied and takes 
no further action. Type I: Failing to find convincing evidence that more than 37% 
are satisfied with the new parking arrangement, when in reality more than 37% are 
satisfied. Consequence: The principal takes further action on parking when none is 
needed. (b) STATE: HO: p=0.3740:P = 9.37. Ha: p>0.37Ha: P > 9-37, where 
p=the true? ~ the true proportion of all students who are satisfied with the 
parking situation after the change using a=0.05a = 0.05, PLAN: One-sample z2 
test for p. Random: We have an SRS of 200 students from the school. 10%: 200 is 
less than 10% of the population of size 2500. Large Counts: 


np0=200(0.37)=74>10"Po = 200 (0.37) = 74 > 10 ang 


n(1—p0)=200(0.63)=126>10" (1 — Po) = 200 (0.63) = 126 > 10 po. 
= 0.415—0.37 — 1.32 
| 0.37(0.63) 


z=0.415-0.370.37( 0.63)200=1.32 ¥ 200 ; P-value=0.0934 
P-value = 0.0934, 


CONCLUDE: Because the PP-value of 0.0934>0=0.050.0934 > a = 0.05, we 
fail to reject Hoo . We do not have convincing evidence that the true proportion 


yl 
“~ 
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of all students who are satisfied with the parking situation after the change is 
greater than 0.37. 

9.47 A lower-tailed test (HO: p=0.50; Ha: p<0.50)(40 : p = 0.50; Ha : p < 0.50) 
will never reject the null hypothesis when the sample proportion is greater than the 
hypothesized value. 

9.49 (a) HO: p=0.75; Ha: p#0.750 : p = 0.75; Ha : p # 0.75 Where p=the true 
p = the true proportion of peas that will be smooth using «=0.05a = 0.05, 


p= > = B=0.761 
(b) p\=423423+133=423556=0.761" ~ 4234133 ~ 556 ~ OT!" 
Po 0.761—0.75 — 0.60 


| 0.75(0.25) 
z=0.761-0.750.75( 0.25)556=0.60 \ 6 — 
P-value=0.5568P-value = 0.5568. (c) Assuming that the true proportion of 
smooth peas is 0.75, there is a 0.5568 probability of getting a sample proportion as 
different from 0.75 as 0.761 (in either direction) just by chance in a random sample 
of 556 peas. CONCLUDE: Because the P-value of 0.5568>a=0.05 
0.5568 > a = 0.05, we fail to reject HO0. We do not have convincing evidence 
that the true proportion of peas that are smooth is different from 0.75. 
9.51 STATE: HO: p=0.60; Ha:p~0.60/40:? = 9.60; Ha : p # 0.60 Where 
p=the true? — the true the true proportion of teens who pass their driving test on 
the first attempt using a=0.05@ = 0.05, PLAN: One-sample z2 test for p. 
Random: We have an SRS of 125 teens. 10%: 125 is less than 10% of all teens 
who take the driving test. Large Counts: np0O=125(0.60)=75210 
npo = 125 (0.60) = 75 > 10 ang n(1—po)=125(0.40)=50>10 
n(1— po) = 125 (0.40) = 50 > 10 Ho: 7=0,688-0.600.60( 0.40)125=2.01 
> — 0.688-0.60 _ 5 9] 


\ 2 
P-value=0.0444P-value = 0.0444, CONCLUDE: Because the PP-value of 
0.0444<a=0.050.0444 < a = 0.05, we reject Ho40 . There is convincing 
evidence that the true proportion of teens who pass the driving test on their first 
attempt is different from 0.60. 
9.53 (a) STATE: p=the true? = the true proportion of teens who pass their 
driving test on the first attempt. PLAN: One-sample z interval for p. The conditions 
have been met. DO: (0.607,0.769)(9-607, 0.769) CONCLUDE: We are 95% 
confident that the interval from 0.607 to 0.769 captures the true proportion of teens 
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who pass the driving test on the first attempt. (b) The confidence interval gives the 
values of p that are plausible based on the sample data; 0.60 is not a plausible value 
because it falls outside the 95% confidence interval. So based on the confidence 
interval, we would reject p=0.60? = 9-69 at the a=0.05a = 0.05 significance 
level. The 95% confidence interval provided gives an approximate set of p0P0’s 
that would not be rejected by a two-sided test at the a=0.05@ = 0.05 significance 
level. A two-sided test only allows us to reject (or fail to reject) a hypothesized 
value for a particular population parameter. 

9.55 No; because the value 0.17 is included in the confidence interval, it is a 
plausible value for the true proportion of U.S. adults who would say they use 
Twitter. In other words, we do not have convincing evidence that the true 
proportion of U.S. adults who would say they use Twitter differs from 0.17. 

9.57 (a) p=the true? = the true proportion of undergraduates at this large 
university who would be willing to report cheating by other students. (b) Random: 
We have an SRS of 172 undergraduate students. 10%: 172 is less than 10% of all 
undergraduates at this large university. Large Counts: npO=172(0.15)=25.8>10 
npo = 172 (0.15) = 25.8 > 10 ang n(1—po)=172(0.85)=146.2>10 

n(1—po) = 172 (0.85) = 146.2 > 10) (c) The P-value is 0.146. Assuming that 
the true proportion of undergraduates at this large university who would report 
cheating is 0.15, there is a 0.146 probability of getting a sample proportion that is 
at least as different from 0.15 (in either direction) as p\=0.11P = 0.11, (d) No; 
because the P-value of 0.146>a=0.050.146 > a = 0.05, we fail to reject HOMO . 
We do not have convincing evidence that the true proportion of undergraduates at 
this large university who would report cheating differs from 0.15. 

vee ie 

9.61b 

9.63 (a) x-YX = ¥ has a Normal distribution with mean UX-Y=pX 
—pY=5.3-5.26=0.04HxX-Y¥ = Mx — Hy = 5.3 — 5.26 = 0.04 and standard 
deviation oX-Y=o0X2+o0Y2=(0.01)2+(0.02)2=0.0224 


a /% +02, = \/ (0.01)” + (0.02)? = 0.0224 i OF ct 


on a positive number. (b) We want to find P(x-Y>0)P (X-Y> 0). 


(i) z=-1.792 = —1.79. p(z>-1.79)=0.9633" (2 > —1.79) = 0.9633 (iy 
normalcdf (lower: 0, upper: 1000, mean: 0.04, SD: 0.0224)=0.9629 
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normalcdf (lower: 0, upper: 1000, mean:0.04,SD:0.0224) = 0.9629 tore isa 
0.9629 probability that a randomly selected CD will fit in a randomly selected 


100 
case. (c) P(all fit)=(0.9629)100=0.0228"? (allfit) = (0.9629) = 0.0228. there 
is a 0.0228 probability that all 100 CDs will fit in their cases. 


Section 9.3 
Answers to Check Your Understanding 


page 591: 1. HO: 1=320; Ha: p#32040 : # = $20; Ha : wh # 320 Where 


eft = the true mean amount of active ingredient (mg) in Aspro tablets 


=the tru 
from this batch of production using a=0.05@ = 0.05, 
2. Random: We have a random sample of 30 tablets. 10%: Assume the sample of 
size 30 is less than 10% of the population of all tablets in this batch. Normal/Large 
Sample:n=30230" = 30 > 30. All conditions are met. 

+ — 319-320 _ _1 93 
3. t=319-320330=-1.83 v% ; P-value=0.0775 
P-value = 0.0775 
4. Because the P-value of 0.0775>a0=0.050.0775 > a@ = 0.05, we fail to reject HO 


A. There is not convincing evidence that the true mean amount of the active 


ingredient in Aspro tablets from this batch of production differs from 320 mg. 


page 595: STATE: HO: p=8; Ha: u<g/o >= 8;Ha > M@ <8 where u=the true 
2 = the true mean amount of sleep that students at the teacher’s school get each 
night using a=0.05a = 0.05, PLAN: One-sample ¢ test for np’. Random: The 
teacher selected a random sample of 28 students. 10%: Assume the sample size 
(28) is less than 10% of the population of students at this school. Normal/Large 
Sample: There were only 28 students, so we need to examine the sample data. The 
histogram indicates that there is not much skewness and no outliers, so it is 


reasonable to use a t procedure. 


B44 
m 34 
2- 
i 
0 T T 


T T T T T T 
30 45 60 75 90 105 12.0 
Amount of sleep (h) 
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= —3.625 

DO: t=6.643-81.98128=-3.625 v2 ; P-value=0.0006 
P-value = 0.0006. 

CONCLUDE: Because the P-value of 0.0006<a=0.059.0006 < a = 0.05, we 
reject Hoo. There is convincing evidence that students at this university get less 


than 8 hours of sleep, on average. 


page 597: 1. HO: p=128440 * # = 128. Ha: y2128Ho : MF 128 Where 


eft = the true 


=the tru mean systolic blood pressure for the company’s middle- 


aged male employees. 


2. Random: The director examines the medical records of a random sample of 72 
male employees in this age group. 10%: The sample size (72) is less than 10% of 
the population of middle-aged male employees at this large company. 
Normal/Large Sample:n=72>30" = 72 = 30, 

3. The 95% confidence interval does include 128 as a plausible value, so we fail to 
reject HO/0 at the a=0.05a = 0.05 significance level. We do not have convincing 
evidence that the true mean systolic blood pressure for the company’s middle-aged 
male employees is different from 128. 


page 603: 1. If the true mean percent change in TBBMC during the exercise 


1 


program is p=1/ ~ “, there is a 0.80 probability that the researchers will find 


convincing evidence for Ha: u>oHa sp> 0. 


2. P(Type I error)=a=0.05" (Typelerror) = a = 0.05, P(Type II 
error)=1-Power=1-0.80=0.20" (Typellerror) = 1 — Power = 1 — 0.80 = 0.20 


3. The researchers could increase the power of the test in Question 1 by increasing 


the significance level (a) (@) or increasing the sample size (n)(2), 


Answers to Odd-Numbered Section 9.3 Exercises 

9.65 Random: The teacher selected an SRS of 45 students. 10%: 45 is less than 
10% of the population size of 1000. Normal/Large Sample:n=45>30" = 45 > 30. 
9.67 (a) HO: p=10.5; Ha: p<10.540 : # = 10.5; Ha sw < 10.5 Where p=the true 
pe = the true mean battery life when playing videos for all tablets. (b) Random: A 
random sample of 20 tablets was selected. 10%: 20 is less than 10% of the 
population of all tablets. Normal/Large Sample: The sample size is less than 30 
and the dotplot of the distribution of battery life is strongly skewed to the right. 
This condition is not met. 
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9.69 (a) The sample result gives some evidence for the alternative hypothesis 
because x =62.8%642 = 62.8 # 64 (4) t=62.8-645.3625=-1.12 
+ — 828-64 _ 4 49 


P-value: ; Table B: df=25-1=244f = 25 — 1 = 24p_vyalue is between 0.20 and 
0.30. Tech: 2: tcdf(lower: —1000, upper: —1.12, df: 24)=0.2738 
2 - tcdf (lower: — 1000, upper: — 1.12,df : 24) = 0.2738 


t= SO — 2.41 


9.71 (a) t=125.7-11529.845=2.41 vB (b) P-value: df=45-1=44 
df = 45 — 1 = 44. Table B: Using df=40df = 40, the P-value is between 0.01 and 


0.02. Tech: (c) tcdf(lower: 2.41, upper:1000, df: 44)=0.0101 


tedf(lower:2.41,upper:1000,df : 44) = 0.0101 pocanse the P-value of 


0.0101<a=0.0101 < a =0,050.05, we reject HO/0. We have convincing 
evidence that the true mean SSHA score in the population of students at her college 
who are at least 30 years old is greater than 115. 

9.73 (a) STATE: HO: p=25/40 + # = 25; Ha: p>25/4a +H > 25, where p=the true 
je = the true the true mean speed of all drivers in a construction zone using 
a=0.01@ = 0.01, PLAN: One-sample tt test for p. Random: The 10 drivers were 
randomly selected. 10%: 10 is less than 10% of all drivers in a construction zone. 
Normal/Large Sample: Because the sample is small, we need to graph the sample 
data. There is no strong skewness or outliers in the sample, so it is reasonable to 
use a tt procedure. 


e . . ee eee 
rh. ot  & ck «©. & © wo 
20 «22 24 «26 28 #4300 «632 3436 
Speed (mph) 


DO: x =28.8% = 28.8 and 

; df=9df = 9 and P-value=0.0069P-value = 0.0069 

CONCLUDE: Because the P-value of 0.0069<a=0.01 

P-value of 0.0069 < a = 0.01, we reject HO/40. We have convincing evidence 
that the true mean speed of all drivers in the construction zone is greater than 25 
mph. (b) Because we rejected Hoo , it is possible we made a Type I error— 
finding convincing evidence that the true mean speed is greater than 25 mph when 
it really isn’t. 

9.75 STATE: HO: p=5/40 ? # = 5; Ha: p<5Ha tH <5 where p= = the true 
mean reading level of all pages in this novel using a=0.05@ = 0.05, PLAN: One- 
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sample t test for p’. Random: Random sample of 40 pages from this novel. 10%: 
Assume the sample size (40) is less than 10% of all pages in this novel. 
Normal/Large Sample:n=40>30" = 40 > 30 po: x =4.8% = 4.8, sx=0.8 

Sz = 0.8; j=-1 5gt = —1.58. gf=39df = 39 and P-value =0.0610= 0.0610, 
CONCLUDE: Because the P-value of 0.0610>a0=0.050.0610 > a = 0.05, we fail 
to reject Hoo. There is not convincing evidence that the true mean reading level 
for this novel is less than 5. 

9.77 STATE: HO: p=11.540 * # = 11.5; Ha: 411.540 + HF 11-5, where 
=the true = the true mean hardness of the tablets using a=0.05a = 0.05, 
PLAN: One-sample tt test for pu’. Random: Random sample of 20 tablets from 
one large batch. 10%: 20 is less than 10% of all tablets in the batch. Normal/Large 
Sample: Because the sample is small, we need to graph the sample data. There is 
no strong skewness or outliers in the sample, so it is reasonable to use a t 
procedure. 


Frequency 


11.35 11.40 11.45 11.50 11.55 11.60 11.65 11.70 
Hardness 


DO: x =11.5164% = 11.5164 and sx=0.095; t=0.778« = 9-095; t = 0.77. qf=19 
df = 19 and P-value=0.4494P-value = 0.4494 

. CONCLUDE: 0.4494>0=0.050.4494 > a = 0.05 Because the P-value of HO/Zo , 
we fail to reject p= ~. We do not have convincing evidence that the true mean 
hardness of these tablets is different from 11.5. 


9.79 (a) STATE: p= ~ the true mean hardness for this type of pill. PLAN: One- 
sample tt interval for p’. The conditions are met. DO: x _=11.516% = 11.516, 
sx=0,0958 = 9-095 and n=20n = 20; df=19df = 19; tx=2.093t" = 2.093, 
(11.472,11.561)(11.472, 11.561) CONCLUDE: We are 95% confident that the 
interval from 11.472 to 11.561 captures the true mean hardness measurement for 
this type of pill. (b) The confidence interval agrees with the test done in Exercise 
77. Both give 11.5 as a plausible value for the true mean hardness pi’. The 
confidence interval, however, gives other plausible values as well. 


9.81 (a) HO: p=200 milliseconds/40 : # = 200 milliseconds, ty4.1,4200 


H, : p # 200 milliseconds = the true 


milliseconds , where pi=the true mean 
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response time of European servers. (b) Random: Random sample of 14 servers in 
Europe. 10%: 14 is less than 10% of all servers in Europe. Normal/Large Sample: 
The sample size is small, but the problem states that a graph of the data reveals no 
strong skewness or outliers, so it is reasonable to use a t€ procedure. (c) Because 
the 95% confidence interval does not contain 200, we reject Hoo at the a=0.05 
a = 0.05 significance level. We have convincing evidence that the mean response 
time of European servers is different from 200 milliseconds. (d) No! We cannot 
draw any conclusions about the mean response time of servers in the United States 
because we only collected information from a random sample of servers in Europe. 
9.83 (a) Yes; because the PP-value of 0.06>a=0.050.06 > a@ = 0.05, we fail to 
reject HO: p=10/40 : # = 10 at the 5% level of significance. The 95% confidence 
interval will include 10. (b) No; because the P-value of 0.06<a=0.10 

0.06 < a = 0.10, we reject HO: u=1040 > = 10 at the 10% level of 
significance. The 90% confidence interval would not include 10 as a plausible 
value. 

9.85 If the true proportion of potatoes with blemishes in a given truckload is 
p=0.11P = 0.11 there is a 0.764 probability that the company will find convincing 
evidence for Ha: p>0.08/4a: p > 0.08, 

9.87 (a) Power will increase. Using a larger significance level makes it easier to 
reject Hoo when Hae is true. (b) Power will decrease. A smaller sample size 
gives less information about the true proportion p. (c) Power will decrease. It is 
harder to detect a smaller difference between the null and alternative parameter 
value. 

9.89 (a) A disadvantage is that the larger significance level will increase the 
probability of a Type I error. (b) A disadvantage is that the larger sample size 
would require more time and money. 

9.91 (a) If the true mean breaking strength of this company’s classroom chairs is 
y=2944 = 294 pounds, there is a 0.71 probability that I will find convincing 
evidence for Ha:<3004a p< 300. (b) P(Type I error)=a=0.05 

P (Type Lerror) = a = 0.05 and pcType II error)=1-0.71=0.29 

P (Type Il error) = 1 — 0.71 = 0.29. (c) The power would increase by 
increasing the sample size or using a larger significance level. 

9.93 (a) Power=1-P(Type II error)=1—0.14=0.86 

Power = 1 — P(Type II error) = 1 — 0.14 = 0.86 
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(b) P(Type I error)=a=0.01P (Typelerror) = a = 0.01 

9.95 (a) No; in a sample of size n=5007 = 500, we expect to see about (500) 
(0.01)=5(500) (0.01) =5 people who do better than random guessing, with a 
significance level of 0.01. These four might have ESP, or they may simply be 
among the “lucky” ones we expect to see just by chance. (b) The researcher should 
repeat the procedure on these four to see if they again perform well. 

9.97 Although the hypothesis test shows that the results are statistically significant 
(we have convincing evidence that 1>128/! ~ 128), this is not practically 
significant. A test with such a large sample size will often produce a significant 
result for a very small departure from the null value. There is little practical 
significance to an increase in average SAT score of only 2 points. 

9.99 It would not be wise to use these data to carry out a significance test about the 
mean amount spent by all shoppers at the supermarket because this sample wasn’t 
randomly selected—it was a convenience sample. Depending on the time of day or 
the day of the week, certain types of shoppers may be underrepresented. 

9.101 (a) Shape: Because np=500(0.08)=40>10"P = 500 (0.08) = 40 > 10 ang 
n(1—p)=500(0.92)=460>10" (1 — p) = 500 (0.92) = 460 > 10 the sampling 
distribution of p\P is approximately Normal. Center: The mean of the sampling 
distribution of p\P is equal to the population proportion; p/=p=0.08 

Hp ~ P= seal Variability: The sample size (500) is less than 10% of the 
population of all potatoes, so the 10% condition has been met. 
op=0.08(0.92)500=0.0121°? — V— 500 ~ 99121 4) 0 5 area to the 
right of zz=1.6452 — z = 1.645, Solving 1.645=p-0.080.0121 


_ p—0.08 . 
1.645 = “TIT gives pA=0.0999P = 0.0999. 


(ii) invNorm(area: 0.95, mean: 0.08, SD: 0.0121)=0.0999 
invNorm (area:0.95,mean:0.08,5D:0.0121) = 0.0999 


(c) Shape: Because np=500(0.11)=55$108P = 500 (0.11) = 55$10 ang 
n(1—p)=500(0.89)=445>10" (1 — p) = 500 (0.89) = 445 > 10 the sampling 
distribution of p\P is approximately Normal. Center: The mean of the sampling 
distribution of p\p is equal to the population proportion; p/=p=0.11 

Hp ~ P= sal Variability: The sample size (500) is less than 10% of the 
population of all potatoes, so the 10% condition has been met. 
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/0.11(0.89) 
o- — } 
op/=0.11(0.89)500=0.0140 ” V ~~ 500 
» = 9.0999-0.11 _ _g 79 
(d) (i) z=0.0999-0.110.0140=—-0.72" 0.0140 “**. B(z>-0.72)=0.7642 


P(z > —0.72) = 0.7642 


= 0.0140 


(ii) P(p4=0.0999)=normalcdf(lower: 0.0999, upper: 1000, pmean: 0.11, SD: 


0.014)=0.7647. 
P(p > 0.0999) = normalcdf(lower:0.0999,upper:1000, pmean:0.11,SD:0.014) = 0.7647. 


The power of the test to detect p=0.11P = 0.11 is 0.7647, 

9.103 a 

O05 c 

9.107 a 

9.109 (a) Not included; the margin of error does not account for undercoverage. 


(b) Not included; the margin of error does not account for nonresponse. (c) 
Included; the margin of error does account for sampling variability. 


Answers to Chapter 9 Review Exercises 


R9.1 (a) HO: p=64.240 : # = 64.2. pa: 1464.2Ha : HF 64.2 Where =the true 
pe = the true mean height of this year’s female graduates from the local high 
school. The sample data give some evidence for Hala because x =63.5% 

= = 63.5 #64 964.2. (b) HO: p=0.25H0:P = 0.25; Ha: p>0,.25Ha : p > 0.25, 
where p=the true? = the true proportion of all students at this school who have 
played/danced in the rain. The sample data give some evidence for Hata because 
pr=2880=” = ~ 0.35>0.250.35 > 0.25, 

R9.2 (a) Appropriate test: One-sample t test for p’. Random: We have an SRS of 
48 female graduates from this high school. 10%: Assume the sample size (48) is 
less than 10% of all female graduates from this large high school. Normal/Large 
Sample:n=48>30" = 48 > 30. (b) Appropriate test: One-sample z2 test for p. 
Random: We have a random sample of 80 students from this school. 10%: Assume 
the sample size (80) is less than 10% of all students at this school. Normal/Large 
Sample: np0=80(0.25)=20>10"Po = 80 (0.25) = 20 > 10 ang 
n(1-p0)=80(0.75)=60>10" (1 — po) = 80 (0.75) = 60 > 10 

R9.3 (a) t=-1.31! = —1.31, p-value=0.1963P-value = 0.1963 CONCLUDE: 
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We will use a=0.05a = 0.05, Because the P-value of 0.1963>a=0.05 

0.1963 > a = 0.05, we fail to reject HO/0. There is not convincing evidence that 
the true mean height for all female graduates from this high school differs from the 
national average of 64.2 inches. 


(b) z=2.072 = 2.07; P-value = 0.0194P-value = 0.0194 CONCLUDE: We will 
use a=0.05 = 0.05, Because the P-value of 0.0194<a=0.050.0194 < a = 0.05, 
we reject Ho0. We have convincing evidence that the true proportion of all 
students at this school who have played/danced in the rain at some point in their 
lives is greater than 0.25. 

R9.4 STATE: HO: p=22; Ha: p>22Mo : M = 22; Ha : > 22 where =the mean 
4 = the mean amount of time (in seconds) it takes for adults to read four 
paragraphs of text in the ornate font Gigi. We will use a=0.05@ = 0.05, PLAN: 
One-sample t test for p’. Random: We have a random sample of 24 adults. 10%: 
24 is less than 10% of the population of adults. Normal/Large Sample: Because the 
sample size is small, we need to graph the sample data. The histogram below 
shows that the distribution is roughly symmetric with no outliers, so using a t 
procedure is appropriate. 


0 T T T T T T T T 
15 18 21 24 27 30 33 36 


Time (sec) 


DO: x =26.496% = 26.496, sx=4.72882 = 4.728; t=4.66t = 4.66; df=23 

df = 23 and P-value=0.000054P-value = 0.000054, CONCLUDE: Because the 
P-value of 0.000054<a=0.050.000054 < a = 0.05, we reject HO/0. There is 
convincing evidence that the mean amount time it takes to read four paragraphs of 
text in the ornate font Gigi is greater than 22 seconds. 

R9.5 (a) HO: p=0.0540 : p = 9.05, Ha: p<0,054a : Pp < 9.05 where pP is the 
true proportion of adults who will get the flu after using the vaccine. We will use 
a=0.05@ = 0.05. (b) Type I: Finding convincing evidence that less than 5% of 
patients would get the flu, when in reality at least 5% would. Consequence: The 
company might get sued for false advertisement. 

Type II: Failing to find convincing evidence that less than 5% would get the flu, 
when in reality less than 5% would. Consequence: Loss of potential income. (c) 
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Because a Type I error is more serious in this case, and P(Type I error)=a 
P (Typelerror) = @ | would recommend a significance level of a=0.01a = 0.01 
to minimize the possibility of making this type of error. (d) If the true proportion of 
adults who will get the flu after using the vaccine is p=0.03P? — 0.03 there is a 
0.9437 probability that the researchers will find convincing evidence for 

Ha: p<0.054 ai p < 0.05, (e) The power could be increased by increasing the 
sample size or increasing the significance level a@. 

R9.6 STATE: HO: p=0.0540 : p = 9.05. ta: p<o,05He : p < 0.05 where 

p=the true? = the true proportion of adults who will get the flu after using the 
vaccine. We will use a=0.05@ = 0.05, PLAN: One-sample z test for p. Random: 
We have a random sample of 1000 adults. 10%: 1000 is less than 10% of the 
population of adults. Large Counts: np0=1000(0.05)=50>10 

npo = 1000 (0.05) = 50 > 10 ang n(1-po)=1000(0.95)=950$10 

n(1— po) = 1000 (0.95) = 950810 po. ,=-1.922 = 1.02. p-yalue=0.1549 
P-value = 0.1549, CONCLUDE: Because the P-value of 0.1549>a=0.05 

0.1549 > a = 0.05, we fail to reject HO0. We do not have convincing evidence 
that the true proportion of adults who will get the flu after using the vaccine is less 
than 0.05. 

R9.7 (a) Assuming that the roulette wheel is fair, there is a 0.0384 probability that 


p = 31/50) at least this 


we would get a sample proportion of reds (pA=3 1/50)\ 
different from the expected proportion of reds (18/38) by chance alone. (b) 
Because the P-value of 0.0384<a=0.050.0384 < a = 0.05, we reject HO/40. We 
have convincing evidence that the true proportion of reds is not equal to 
18/38=0.47418/38 = 0.474 (¢) Because 18/38=0.47418/38 = 0.474 ic one of 
the plausible values in the interval, this interval does not provide convincing 
evidence that the wheel is unfair. It does not, however, prove that the wheel is fair 
as there are many other plausible values in the interval that are not equal to 18/38. 
Also, the conclusion here is inconsistent with the conclusion in part (b) because the 
manager used a 99% confidence interval, which is equivalent to a test using a=0.01 
a = 0.01. If the manager had used a 95% confidence interval, 18/38 would not be 
considered a plausible value. 


Answers to Chapter 9 AP® Practice Test 
T9.1b 
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T3.2¢ 

T9.3.c 

T9.4e 

T9.5 b 

T3.6c¢c 

13.7 e 

T9.8 d 

T9.9a 

T9.10e 

T9.11 (a) STATE: HO: p=0.20/0:P = 9.20. ta: p>0.20Ma‘p > 0.20) where 
p=the true? — the true proportion of customers who would pay $100 for the 
upgrade using a=0.05@ = 0.05, PLAN: One-sample z test for p. Random: We 
have a random sample of 60 customers. 10%: 60 is less than 10% of this 
company’s customers. Large Counts: np0=60(0.20)=12>10 

npo = 60 (0.20) = 12 > 10 ang n(1—po)=60(0.8)=48>10 

n(1— po) = 60 (0.8) = 48 > 10 Ho. 7=1.292 = 1.29; P-value=0.0985 
P-value = 0.0985 

CONCLUDE: Because the P-value of 0.0985>a0=0.050.0985 > a = 0.05, we fail 
to reject Hoo. We do not have convincing evidence that the true proportion of 
customers who would pay $100 for the upgrade is greater than 0.20. (b) A Type I 
error would be finding convincing evidence that more than 20% of customers 
would pay for the upgrade, when in reality they would not. A Type II error would 
be not finding convincing evidence that more than 20% of customers would pay 
for the upgrade, when in reality more than 20% would. For the company, a Type I 
error is worse because they would go ahead with the upgrade and lose money. (c) 
If the true proportion of customers who would pay $100 for the upgrade is p=0.30 
P= 0.30 there is a 0.60 probability that you will find convincing evidence for 
Ha: p>0.204a: p> 0.20. 

T9.12 (a) HO: u=41.9Ho:H = 41.9. Ha: u<41.9Ha Sf < ALD Where u=the true 
js = the true mean age of all employees of a large technology company. (b) 
Random: We have a random sample of 12 employees from this company. 10%: 12 
is less than 10% of all employees of this large technology company. Normal/Large 
Sample: Because the sample is small, we need to graph the sample data. There is 
no strong skewness or outliers in the sample, so it is reasonable to use a t 
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procedure. 
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(c) Assuming that the true mean age of all employees at this large technology 
company is 41.9 years, there is a 0.003 probability of getting a sample mean of 

x =34.833£ = 34.833 or less just by chance in a random sample of 12 employees. 
CONCLUDE: We will use a=0.05a = 0.05; because the P-value of 0.003<a=0.05 
0.003 < a = 0.05, we reject HO. We have convincing evidence that the true 
mean age of all employees of this large technology company is less than 41.9 
years. 

T9.13 STATE: HO: p=$158/0 ? # = $158. pa: 12915gHa:H# # $158 where 

u =the true/4 = the true mean amount spent on food by households in this city. 
We will perform the test at the a=0.05a@ = 0.05 significance level. PLAN: One- 
sample t test for np’. Random: We have a random sample of 50 households. 10%: 
50 is less than 10% of households in this small city. Normal/Large 
Sample:n=50>30" = 50 > 30. po: t=2.47; df=49t = 2-47; df = 49 and p- 
value=0.0170P-value = 0.0170, CONCLUDE: Because the P-value of 0.0170 is 
less than a=0.05@ = 0.05, we reject HO0. We have convincing evidence that the 
true mean amount spent on food per household in this city is different from the 
national average of $158. 

pale pi?! 


. /0.08(0.92) 


(P= Vn = 0.0121 


opA=0.08(0.92)500=0.0121 pp’ opr?» 
Chapter 10 

Section 10.1 

Answers to Check Your Understanding 

page 630: STATE: 95% CI for p1—p2P1 — P2 , where p1=trueP1 = true 
proportion of working women who would say that job security was very or 
extremely important and p2=trueP2 = 'UE proportion of working men who would 
say that job security was very or extremely important. PLAN: Two-sample z 
interval for p1—p2P1 ~ P2. Random: The data come from independent random 
samples of 806 working women and 944 working men. 10%: n1 = 806<10% 
n, = 806 < 10% of the population of all working women and n2=944<10% 
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nz = 944 < 10% of the population of all working men. Large Counts: 709, 
806-709=97806 — 709 = 97, g02, and 944-802=142944 — 802 = 142 are all 
$10. 


5, = “2 — 0.880 
DO: p1=709806=0.880"! ~ %06 , p\2=802944=0.850 


a 802 
Po = 044 = 0.850 


(0.88-0.85)+1.960.88(0.12)806+0.85(0.15)944=0.03+0.032 


[O88(O-12) 0850.15) _ : 
(0.88 — 0.85) + 1. 964/ 56 a 0.03 + 0.032 =(-0.002, 0.062) 


= (—0.002, 0.062) Tech; 2-PropZInt gives (-0.002, 0.062) 
(—0.002, 0.062) 


CONCLUDE: We are 95% confident that the interval from —0.002~—9-002 
to 0.062 captures p1—p2=theP ~ P2 = the difference in the true 
proportions of all working women and all working men who would say 
that job security is very or extremely important. 


page 637: 1. STATE: HO: p1—p2=040 > Pi — P2 = 0 pa: p1-p2<0 

Hq: pi ~ P2 <9) where p1=the?1 = the true proportion of children like the 
ones in the study who attend preschool that use social services later and p2=the 

po = the tre proportion of children like the ones in the study who do not attend 
preschool that use social services later using a=0.05@ = 0.05. PLAN: Two-sample 
z test for p1—p2P1 ~ P2. Random: The children were randomly assigned to attend 
or not attend preschool. Large Counts: 38, 62-38=2462 — 38 = 24 49 and 
61-49=1261 — 49 = 12 are 


2 = 0.8033 


= 0.6129 .9=4961=0.803372 — 


DO: p\1=3862=0.6129?! = 
~ 38449 
and aol ~ 62461 
87123=0.7073 is = O- us 
ee We a 
= ) (0.6129—0.8033)—0 _ 2.32 
} 0.7073(0.2927) 0.7073(0.2927) 
i a 


P-value=0.0102P-value = 0.0102, CONCLUDE: Because the P-value of 
0.0102<a=0.050.0102 < a = 0.05, we reject HO#0. There is convincing 
evidence that the true proportion of children like the ones in the study who do 
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attend preschool who use social services later is less than the true proportion of 
children like the ones in the study who do not attend preschool who use social 
services later. In other words, children like those in this study who participate in 
preschool are less likely to use social services later in life. 


2. Because we rejected Hoo , we may have made a Type | error: finding 
convincing evidence that the true proportion of children like the ones in the study 
who attend preschool who use social services later is less than the true proportion 
of children like the ones in the study who do not attend preschool who use social 
services later, when in reality the true proportions are equal. 

3. No; the children who participated in the study were recruited from low-income 
families in Michigan, not randomly selected, so the results cannot be generalized to 
all children from low-income families. 


Answers to Odd-Numbered Section 10.1 Exercises 
10.1 (a) Approximately Normal because nCpC=15, nC(1—pC)=35, nApA=15 
ncpo = 15,nc (1 — pc) = 35, napa = 15 and nA(1-pA)=85 
na (1— pa) = 85 are >10> 10, (b) -=0.30-0.15=0.15 
— = 0.30 — 0.15 = 0.15. (¢) Because 50<1050 < 10, of the jelly beans in the 
Child mix and 100<10%100 < 10% of the jelly beans in the Adult mix, opC 
—pA=0.3(0.7)50+0.15(0.85)100=0.0740 

/0.3(0.7) 0.15(0.85) 
?rc-Pa = V “50 10000 ial The difference 
(Child mix—Adult mix)(Childmix ~ Adultmix) in the sample proportions of red 
jelly beans typically varies by about 0.0740 from the true difference in proportions 
of 0.15. 
10.3 (a) P(p1-p/2<0)=P(z<0—-0.150.0740)=0.0212 


“ “ - 0-0.15 
P (pi — po < 0) = P(z Ss ioe = 0.0212 


(b) Yes, we might doubt the 
company’s claim. There is only a 2% chance of getting a proportion of red jelly 
beans in the Child sample less than or equal to the proportion of red jelly beans in 
the Adult sample if the company’s claim is true. 

10.5 Random: Not met because these data do not come from independent random 
samples or two groups in a randomized experiment. 10%: Since no sampling took 
place, the 10% condition does not apply. Large Counts: Not met because there 
were fewer than 10 successes (3) in the group from the west side of Woburn. 


2038 


10.7 Random: Cockroaches were randomly assigned to one of the two treatment 
groups. 10%: Since no sampling took place, the 10% condition does not apply. 
Large Counts: 25, 11, 18, and 18 are $10. The conditions are met. 


10.9 (a) STATE: p1=theP?1 ~ the true proportion of young men who live in their 
parents’ home and p2=theP?2 ~ the tre proportion of young women who live in 
their parents’ home. PLAN: Two-sample z interval for p1-p2P1 ~ P2. Random: 
Independent random samples. 10%: n1=2253<10%"1 = 2253 < 10% of all 
young men and n2=2629<10%!"2 = 2629 < 10% of all young women. Large 
Counts: 986, 1267, 923, and 1706 are >10= 10. po: (0.051,0.123)(9-951, 0.123) 
CONCLUDE: We are 99% confident that the interval from 0.051 to 0.123 captures 
p1—p2=theP1 ~ P2 = the true difference in the proportions of young men and 
young women who live in their parents’ home. (b) Because the interval does not 
contain 0, there is convincing evidence that the true proportion of young men who 
live at their parents’ home is different from the true proportion of young women 
who live in their parents’ home. 

10.11 STATE: p1=theP1 = the true proportion of people like the ones in this study 
who would say they support President Obama’s decision when asked by Hassan 
and p2=theP2 ~ the true proportion of people like the ones in this study who 
would say they support President Obama’s decision when asked by Ken. PLAN: 
Two-sample z-interval for p1—p2P?1 ~ P2. Random: Two groups in a randomized 
experiment. 10%: Does not apply. Large Counts: 11, 39, 21, and 23 are >102 10, 
DO: (-0.414, -0.100)(—9-414, — 0.100) CONCLUDE: We are 90% confident 
that the interval from -0.414~9-414 to -0.100~-9-100 captures p1—p2=the 

Pi ~ p2 = the true difference in the proportions of people like the ones in this 
study who would say they support President Obama’s decision when asked by 
Hassan or by Ken. 

10.13 No; if the two population proportions were equal, the true difference in 
proportions would be 0. Because 0 is captured in the interval, we think it is 
plausible that the population proportions are equal, but this does not provide 
convincing evidence that the two proportions are equal. 

10.15 (a) HO: pi-p2=040 : Pi — P2 = 9, Ha: pi-p2<0Ha : Pi — p2 <0, 
where p1=the?1 = the true proportion of 4- to 5-year-olds who would sort 
correctly and p2=theP2 ~ the true proportion of 6- to 7-year-olds who would sort 
correctly. (b) Random: Independent random samples. 10%: n1=50<10, 

ny = 90 < 10, of all 4- to 5-year-olds and n2=53<1082 = 53 < 10 of all 6- to 7- 
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year olds. Large Counts: 10, 40, 28, and 25 are all >102 10, 

10.17 (a) HO: p1-p2=040 : pi — P2 = 9, Ha: p1-p2>0Ha : Pi — p2 > 0, 
where p1=the?! =~ the true proportion of shrubs that would resprout after being 
clipped and bummed and p2=theP2 = the true proportion that would resprout after 
being clipped. (b) Random: Two groups in a randomized experiment. Large 
Counts: Not met because there were fewer than 10 failures (0) in the treatment 
group, fewer than 10 successes (8) in the control group, and fewer than 10 failures 
in the control group (4). 

10.19 (a) Yes; pA1—pA2=-0.328P lia 0.328 which gives some evidence 
in favor of Ha: pl-p2<0Ma : Pi ~ P2 < 9, () z=-3.452 = —3.45; p 
value=0.0003P-value = 0.0003 (c) Because the P-value of 0.0003<a=0.05 
0.0003 < a = 0.05, we reject Hoo. We have convincing evidence that the true 
proportion of 4- to 5-year-olds who would sort correctly is less than the true 
proportion of 6- to 7-year-olds who would sort correctly. 

10.21 (a) STATE: HO: p1-p2=0/0 : Pi — p2 = 9 Ha: p1—p2<0 

Hy : pi ~ p2 <0 where pl=the?1 =~ the trye proportion of sophomores who 
bring a bag lunch and p2=theP2 = the true proportion of seniors who bring a bag 
lunch. PLAN: Two-sample z test for p1—p2P1 ~ P2. Random: Independent 
random samples. 10%: n1=80<10,"1 = 80 < 10, of all sophomores and 
n2=104<10"2 = 104 < 10 of all seniors. Large Counts: 52, 28, 78, and 26 are 
>102 10, DO: z=-1.48; P-value=0.06997 = —1.48; P-value = 0.0699 
CONCLUDE: Because the P-value of 0.0699>0=0.050.0699 > a = 0.05, we fail 
to reject Ho/40 . There is not convincing evidence that the true proportion of 
sophomores who bring a bag lunch is less than the true proportion of seniors who 
bring a bag lunch. (b) If there is no difference in the true proportion of sophomores 
and seniors who bring a bag lunch, there is a 0.0699 probability of getting a 
difference in the proportions as large as or larger than the one observed 
(0.65-0.75=-0.10)(9-65 — 0.75 = —0.10) by chance alone. 

10.23 STATE: HO: p1-p2=0M0 : pi — p2 = 9 Ha : pi-p20 

Ha : pi ~ p2 #0 where p1=theP1 = the true proportion of children like the 
ones in this study who are exposed to peanut butter as infants who are allergic to 
peanuts at age 5 and p2=theP2 ~ the tue proportion of children not exposed to 
peanut butter who are allergic at age 5, using a=0.05a@ = 0.05. PLAN: Two- 
sample z test for p1—p2P1 ~ P2. Random: Two groups in a randomized 
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experiment. Large Counts: 10, 297, 55, and 266 are $10310. DO: z=-5.71; 
P{values0? = —5.71; P{value ~ 0 CONCLUDE: Because the P-value of 
approximately 0<a=0.050 < a = 0.05, we reject HO/0. There is convincing 
evidence that the true proportion of children like the ones in this study who are 
exposed to peanut butter as infants who are allergic to peanuts at age 5 is different 
from the true proportion of children like the ones in this study who are not exposed 
to peanut butter as infants who are allergic to peanuts at age 5. (b) Because we 
rejected Hoo, we may have made a Type I error. (c) No; this experiment likely 
had parents who volunteered their infants as subjects. When subjects are not 
randomly selected, we should not generalize the results of an experiment to some 
larger population of interest. 


10.25 (a) STATE: p1=the?! ~ the true proportion of children like the ones in this 
study who are exposed to peanut butter as infants who are allergic to peanuts at age 
5 and p2=theP2 ~ the trye proportion of children not exposed to peanut butter 
who are allergic at age 5. PLAN: Two-sample z interval for p1—p2P1 ~ P2. The 
conditions are met. DO: (-0.185, -0.093)(—9-185, — 0.093) ConcLUDE: We 
are 95% confident that the interval from —0.185~9-185 to -0.093 captures 
pl—p2=theP1 ~ P2 = the trye difference between the proportions of infants like 
these who become allergic to peanut butter if they consume a baby-food form of 
peanut butter versus if they avoid peanut butter. (b) The confidence interval gives 
the values of p1—p2P1 ~ P2 that are plausible based on the sample data. The two- 
sided test only allows us to reject (or fail to reject) a difference of 0, where a 
confidence interval provides a set of plausible values for the true difference. 

10.27 (a) Two-sample z test for p1—p2?1 ~ P2. Random: Two groups ina 
randomized experiment. Large Counts: 44, 44, 21, and 60 are >10= 10, (b) If 
there is no difference in the true pregnancy rates of women who are being prayed 
for and those who are not, there is a 0.0007 probability of getting a difference in 
pregnancy rates as large or larger than the one observed in the experiment 
(0.500-0.259=0.241) (0-500 — 0.259 = 0.241) by chance alone. (c) Because the 
P-value of 0.0007 is less than a=0.05 = 0.05, we reject HO/0 . There is 
convincing evidence that the pregnancy rates among women like these who are 
prayed for is higher than the pregnancy rates for those who are not prayed for. (d) 
It is important that the women who were trying to get pregnant didn’t know 
whether they were being prayed for or not. This knowledge might have affected 
their behavior in some way (even unconsciously) that would have affected whether 
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they became pregnant or not. Then we wouldn’t know if it was the prayer or the 
other behaviors that caused the higher pregnancy rate. 

10.29 (a) HO : p1-p2=0, Ha: p1-p2z0/40 : Pi — P2 = 9, Ha : pi — p2 FO, 
where p1=the?1 the true proportion of subjects like these who would admit to 
texting and driving when asked Version A of the question and p2=the?2 ~ the 
true proportion of subjects like these who would admit to texting and driving when 
asked Version B. (b) Conditions not satisfied; the number of subjects who did not 
admit to texting and driving when asked Version A (25-16=9)(25 — 16 =9) is 
less than 10. (c) p1-p2=0.64—0.48=0.16P1 ~ P2 = 0.64 — 0.48 = 0.16. there 
are 21 trials of the simulation with a difference of proportions less than or equal to 
-0.16—9-16 and 14 trials of the simulation with a difference of proportions greater 
than or equal to 0.16. Estimated P-value=35/100=0.35/ ~Value = 35/100 = 0.35. 
(d) Because the estimated P-value of 0.35>a=0.050.35 > a@ = 0.05, we fail to 
reject Hoo. We do not have convincing evidence that there is a difference 
between the proportions of subjects like these who would admit to texting and 
driving when asked Version A versus when asked Version B of the question. 

10.31 a 

10.33.¢ 

10.35 (a) yA=-13,832+14,954x¥ = —13, 832 + 14, 954% where yA=they = the 
predicted mileage and x=thet = the age in years of the cars. (b) The predicted 
number of miles goes up by 14,954 for each increase of 1 year in age. (c) yA= 
~13,832+14,954(10)=135,7089 = —13, 832 + 14,954 (10) = 135, 708 
Residual=110,000-135,708=—25,708 

Residual = 110,000 — 135, 708 = —25, 708 miles. The student’s car had 25,708 


fewer miles than predicted, based on its age. 


Section 10.2 

Answers to Check Your Understanding 

page 655: 1. STATE: 95% CI for 1-2/1 ~ #2, where p1=thel4i = the tre 
mean change in pulse rate for students like these after drinking 12 ounces of cola 
with caffeine and :2=the#2 ~ the true mean change in pulse rate for students like 
these after drinking 12 ounces of caffeine-free cola. PLAN: Two-sample t interval 
for 11-21 ~ #2, Random: The volunteers were randomly assigned to drink 
either cola with caffeine or caffeine-free cola. Normal/Large Sample: The sample 
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sizes are small, but the dotplots do not show any outliers or strong skewness. 


Caffeine 


e 
J J 
. 
No Caffeine --——}—__—__$ ° 
2 -l 0 1 3 4 5 6 7 8 
Change (Final — Initial) in 
pulse rate (beats/min) 


n -e 


2.62, n2=82 = 10 


Option 1: 2-Samp TInt gives (-1.302, 3.702)(—1-302, 3.702) using df=17.986 
df = 17.986, 


Option 2: df=9, t*=2.262df = 9, t" = 2.262 
(3.2-2)+2.2622.70210+2.62210=1.2+2.691=(-1.491, 3.891) 


/2.702 2.622 


= 1.2 + 2.691 = (—1.491, 3.891) 
CONCLUDE: We are 99% confident that the interval from -1.491~1-491 
to 3.891 beats per minute captures p1—p2=the#1 ~ #2 ~ the difference in 
the true mean change in pulse rate for students like these after drinking 12 
ounces of cola with versus without caffeine. 


2. The interval in Question 1 does not suggest that caffeine increases the average 
pulse rate of subjects like these. Because the interval contains 0, it is plausible that 
the true mean change in pulse rate for all students like these after drinking 12 
ounces of cola with versus without caffeine may equal 0, indicating no difference. 
page 663: STATE: HO: y1-p2=0/L0 aS i ial 0 Ha : p1-p2>0 

Ha : 4 — H2 > 0 where 1=theM1l = the true mean breaking strength of 
polyester fabric buried for 2 weeks and p2=the2 = the tue mean breaking 
strength of polyester fabric buried for 16 weeks. PLAN: Two-sample t test for 
1-21 ~ #2, Random: The strips were randomly assigned to the treatments. 
Normal/Large Sample: The dotplots show no strong skewness and no outliers. 


2 weeks oe Ss 
e 


Pk —— 


102 108 114 120 126 132 138 
Breaking strength (Ib) 


DO: x 1=123.8, s1=4.60, n1=5, x 2=116.4, s2=16.09 
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Z, = 123.8, 8; = 4.60, n) = 5, Fz = 116.4, s2 = 16.09. 


n2=5"2 = 9; t= 0.99. Option 1: P-value=0.1857P-value = 0.1857 using 
df=4.65df = 4.65, Option 2: P-value between 0.15 and 0.20 using df = 4. 
CONCLUDE: Because the P-value of 0.1857>a=0.050.1857 > a = 0.05, 
we fail to reject HO/0. We do not have convincing evidence that the true 
mean breaking strength of polyester fabric that is buried for 2 weeks is 
greater than the true mean breaking strength for polyester fabric that is 
buried for 16 weeks. 


Answers to Odd-Numbered Section 10.2 Exercises 
10.37 (a) Normal because both population distributions are Normal. (b) px M 
-xB=188-170=1842u—ep = 188-170 = 18 mg/dl (c) Because 25<10% 
25 < 10% of all 20- to 34-year-old males and 36<10%,36 < 10%, of all 14-year- 
old boys, 

~ - <= [ae 4 30% = 9.60 mg/dl 
ox M-x B=41225+30236=9.60 mg/dl *“~78 = -V 25 * 36 , 
The difference (20- to 34-year- 
old males — 14-year-old boys) in the sample mean blood cholesterol levels 
typically varies by about 9.60 mg/dl from the true difference in means of 18 mg/dl. 
10.39 (a) P(x M-x B<0)=P(z<0-189.60)=0.0304 


P(Sy —Es <0)=P (2 < t) = 0.0304 
tu Fe< 0) 9.60 (b) Yes; the likelihood that the 


sample mean cholesterol level of the boys is greater than the sample mean 
cholesterol level of the men is only 3%. 

10.41 Random: Met because these are two independent random samples. 10%: Met 
because 20<10%20 < 10%, of all males at the school and 20<10%20 < 10%, of 
all females at the school. Normal/Large Sample: Not met; there are fewer than 30 
observations in each group and a dotplot for males shows several outliers. 


TA. 
eee 
ese 
Males eceee ose © > oe 


Females > T 2+ 2 sie 8 er | T ~ ro 


0 5 10 15 20 25 3# 35 40 45 SO 55 60 


Number of pairs of shoes 


10.43 Random: Met because these are two independent random samples. 10%: Met 
because 50<10%90 < 10%, of students in the United Kingdom and 50<10% 
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50 < 10%, of students in South Africa. Normal/Large Sample: Met because 
50>3099 = 30 and 50230909 = 30, even though there is an outlier in the South 
African distribution. 

10.45 (a) The distributions of percent change are both slightly skewed to the left 
with no apparent outliers. The centers of the two distributions seem to be quite 
different, with people drinking red wine generally having more polyphenols in 
their blood. The distribution of percent change for the white wine drinkers is a bit 
more variable. (b) STATE: p1=the/! = the tue mean percent change in 
polyphenol level in the blood of people like those in the study who drink red wine 
and y.2=the#2 = the true mean percent change in polyphenol level in the blood of 
people like those in the study who drink white wine. PLAN: Two-sample t interval 
for p1-p2!1 ~ #2, Random: Two groups in a randomized experiment. 
Normal/Large Sample: The dotplots show no strong skewness and no outliers. DO: 
x 1=5.5, s1=2.51721 = 5-5, 81 = 2.517, 1 =9, x-2=0.23, s2=3.292 

nm = 9, 2 = 0.23, 82 = 3.292) and n2=9"2 = 9. Using df=14.97df = 14.97, 
(2.845, 7.689); using df=8,(2.701, 7.839)4f = 8; (2-701, 7.839) CONCLUDE: 
We are 90% confident that the interval from 2.845 to 7.689 captures 11-p2=the 

}41 — Ha = the trye difference in mean percent change in polyphenol level for 
men like these who drink red wine and men like these who drink white wine. 

10.47 (a) Skewed to the right because the earnings cannot be negative, yet the 
standard deviation is almost as large as the distance between the mean and 0. The 
use of the two-sample t procedures is justified because the sample sizes are both 
very large (675230 and 621>30)(675 = 30 and 621 > 30) () STATE: 11=the 
/41 = the tye mean summer earnings of male students and p2=thel42 = the tye 
mean summer earnings of female students. PLAN: Two-sample t interval for 
1-21 ~ #2, Random: It is reasonable to consider the two samples independent 
because knowing the response of a male shouldn’t help us predict the response of a 
female. 10%: n1=675<10%"1 = 675 < 10% of male students and n2=621<10% 
nz = 621 < 10% of female students. Normal/Large Sample: n1=675>30 

nm = 675 > 30 and n2=621230"2 = 621 > 30. Using df=1249.21df = 1249.21, 
(413.62, 634.64); using df=620, (413.54, 634.72)" = 620, (413.54, 634.72). 
CONCLUDE: We are 90% confident that the interval from $413.62 to $634.64 
captures }11-p1.2=the/#1 ~ #2 = the true difference in mean summer earnings of 
male students and female students at this large university. (c) If we took many 
random samples of 675 males and 621 females from this university and each time 
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constructed a 90% confidence interval in this same way, about 90% of the resulting 
intervals would capture the true difference in mean earnings for males and females. 
10.49 (a) No, because our interval includes a difference of 0 (no difference) as a 
plausible value. (b) No; instead, we don’t have convincing evidence that the mean 
reaction times are different. Zero is a plausible value for the difference in means, 
but there are many other plausible values besides 0 in the confidence interval. 
10.51 (a) HO: pi-p2=00 : Hi ~ #2 = 9 Ha: p1—p2%0He Mi — Me FO, 
where p1=thel = the true mean time it would take to sort the cards for students 
like these while listening to music and ,12=the/2 = the true mean time it would 
take to sort the cards for students like these while not listening to music. (b) 
Random: Two groups in a randomized experiment. Normal/Large Sample: Not 
met; the sample sizes are small. The distribution of time to sort for those who listen 
to music shows no strong skewness and no outliers, but the distribution of time to 
sort for those who listen to no music is skewed right with one upper outlier. 

10.53 (a) HO: p1-p2=00 : Hi — He = 9 a: pi—poz0He * Hi — Me FO 
where p1=thel = the true mean reliability rating of Anglo customers and 12=the 
Ha = the trye mean reliability rating of Hispanic customers. (b) Random: 
Independent random samples. 10%: n1=92<10%"1 = 92 < 10% of Anglo 
customers and n2=86<10%"2 = 86 < 10% of Hispanic customers. Normal/Large 
Sample: n1=92>30" = 92 = 30 and n2=86>30" = 86 = 30, 

10.55 (a) x 1-x 2=6.37-5.91=0.4621 — 2 = 6.387 — 5.91 = 0.46 points, which 
gives some evidence in favor of Ha: y1-p2204a > fi — M2 FO. (b) t=3.89 

t = 3.89: using df=143.69, P-value=0.00024f = 143.69, P-value = 0.0002. 
Using df=85, P-value=0.00024f = 85, P-value = 0.0002  (¢) Because the P- 
value of 0.0002<a=0.050.0002 < a = 0.05, we reject HO/0. We have 
convincing evidence that the true mean reliability rating for all Hispanic customers 
is different from the true mean reliability rating for all Anglo customers. 

10.57 (a) STATE: HO: 1-2-0440 ae ia a 0. Ha : p1-p2>0 

Ay : 1 — 2 > 0 j4, = true 


mean reduction in blood pressure 
of2 = true 


, where p11=true 
for subjects like these who take fish oil and p2=tru mean reduction in 
blood pressure for subjects like these who take regular oil. PLAN: Two-sample t 
test for 11-21 ~ #2, Random: Two groups in a randomized experiment. 


Normal/Large Sample: The dotplots show no strong skewness and no outliers. 
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Change in blood pressure 


DO: x 1=6.571, s1=5.855, nl=7, x 2=-1.143, s2=3.185 

E, = 6.571, s; = 5.855, ny = 7, F, = —1.143, 82 = 3.185 y9—-7n2 =7. 
t=3.06t = 3.06; using df=9.264df = 9.264, P-value=0.0065 

P-value = 0.0065, Using df=6df = 6, P-value=0.0111P-value = 0.0111, 
CONCLUDE: Because the P-value of 0.0065<a=0.050.0065 < a = 0.05, 
we reject HO0. We have convincing evidence that fish oil helps reduce 
blood pressure more, on average, than regular oil for subjects like these. 
(b) Assuming that the true mean reduction in blood pressure is the same 
regardless of whether the individual takes fish oil or regular oil, there is a 
0.0065 probability that we would observe a difference in sample means of 
7.714 or greater by chance alone. 


10.59 STATE: HO: p1-p2=0/0 + Hi ~ #2 = 9 Ha: 1-120 
Hq = #1 ~ M2 #9 where p1=the4l = the true mean number of words spoken 
per day by female students and 12=the#2 ~ the true mean number of words 
spoken per day by male students, using a=0.05@ = 0.05, PLAN: Two-sample t 
test for p1-2M1 ~ #2, Random: Independent random samples. 10%: n1=56<10% 
nm, = 56 < 10% of females at a large university and n2=56<10%"2 = 56 < 10% 


of males at a large university. Normal/Large Sample: n1=56>30"1 = 56 > 30 and 
n2=56>30% = 56 > 30, 


DO: t=-0.25t = —9.29; using df=106.195, P-value=0.8043 

df = 106.195, P-value = 0.8043 Using df=55df = 55, P-value=0.8035 
P-value = 0.8035, CONCLUDE: Because the P-value of 0.8043>a=0.05 
0.8043 > a = 0.05, we fail to reject HOM0. We do not have convincing 
evidence that the true mean number of words spoken per day by female 
students is different from the true mean number of words spoken per day 
by male students at this university. 


10.61 (a) DO: x 1=16,177%1 = 16,177 51-759081 = 7520 y1=567m1 = 96, 
x 2=15,569%2 = 15,569 <9-919882 = 9108 n2=-56"2 = 96. Using df=106.2, 
(-3521, 2737)4f = 106.2, (—3521, 2737). using df=55, (-3555, 2771) 

df = 55, (—3555, 2771) CONCLUDE: We are 95% confident that the interval 
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from —3521 to 2737 words captures p1—12=the/1 ~ #2 = the true difference in 
mean number of words spoken per day by female students and the mean number of 
words spoken per day by male students at this university. (b) The confidence 
interval gives the values of 1-2/1 ~ #2 that are plausible based on the sample 
data. The two-sided test only allows us to reject (or fail to reject) a difference of 0, 
where a confidence interval provides a set of plausible values for the true 
difference in means. 


10.63 (a) The score distribution for the activities group is slightly skewed to the 
left while the score distribution for the control group is slightly skewed to the right. 
Neither distribution has any clear outliers. The center of the activities group is 
higher than the center of the control group. The scores in the activities group are 
less variable than the scores in the control group. Overall, it appears that scores are 
typically higher for students in the activities group. (b) Because the P-value of 
0.013<a0=0.050.013 < a = 0.05, we reject HOM. We have convincing evidence 
that the true mean DRP score for third-grade students like the ones in the 
experiment who do the activities is greater than the true mean DRP score for third- 
grade students like the ones in the experiment who don’t do the activities. (c) Yes, 
because this was a randomized controlled experiment. (d) Because we rejected the 
null hypothesis, we could have committed a Type I error. 

10.65 (a) STATE: HO: y1-p2=10/40 as a! aa a 10 Ha : p1-p2>10 

Hy : [4 — f2 > 10° where p1=thel = the true mean cholesterol reduction for 
people like the ones in the study when using the new drug and p2=the#2 = the 
true mean cholesterol reduction for people like the ones in the study when using 
the current drug, using a=0.05@ = 0.05, PLAN: Two-sample ¢ test for p1-12 

#1 ~ #2. Random: The subjects were assigned at random to the new or current 
drug. Normal/Large Sample: The graphs of the data show no strong skewness or 
outliers. DO: t=0.98t = 0.98; using df=26.96, P-value=0.1675 

a= 26.96, P-value = 0.1675. Using df=13, P-value=0.1725 

df = 13, P-value = 0.1725 CONCLUDE: Because the P-value of 
0.1675>0=0.050.1675 > a = 0.05, we fail to reject HO/40. We do not have 
convincing evidence that the true mean cholesterol reduction is more than 10 mg/dl 
greater for the new drug than for the current drug. (b) Because we failed to reject 
the null hypothesis, we could have committed a Type II error. 


10.67 (a) The researchers randomly assigned the subjects to create two groups that 
were roughly equivalent at the beginning of the experiment. (b) The difference in 
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the sample means is 4.14. Based on the dotplot, only 1 out of the 100 differences 
were that great, meaning that the P-value is approximately 0.01. Assuming that the 


true difference (Intrinsic-Extrinsic) Untrinsic ~ Extrinsic) 


in mean creativity 
rating is 0, there is a 0.01 probability that we would observe a difference in sample 
means of 4.14 or greater by chance alone. (c) Because the P-value of 0.01<a=0.05 
0.01 < a = 0.05, we have convincing evidence that the true mean rating for 
students like these who are provided with internal reasons is higher than the true 
mean rating for students like these who are provided with external reasons. (d) 
Because we found convincing evidence that the mean is higher for students with 
internal reasons when it is possible that there is no difference in the means, we 
could have made a Type I error. 

10.69 d 

10.71 b 


10.73 (a) By the 68—95—99.7 rule, about 5% of all observations will fall outside of 
this interval. P(at least one mean outside interval)=1—(0.95)2= 0.0975 
P(at least one mean outside interval) = 1 — (0.95)? = 0.0975 (b) By the 68- 


95-99.7 rule, about 2.5% of all observations will be greater than px +20x 
Hz +202 Let X=theX = the number of samples that must be taken to observe 
one greater than px +2ox Mz 4 202 Xisa geometric random variable with 


p=0.025P = 9.025, 

P(X=4)=(1-0.025)3(0.025)=0,0232 (X = 4) = (1 — 0.025)’ (0.025) = 0.0232 
. (c) By the 68—95-99.7 rule, the probability of any one observation falling within 
the interval px -ox Mz ~ %% to px tox Mz + % is about 0.68. Let X=the 

X = the number of sample means out of 5 that fall outside this interval. Assuming 
the samples are independent, X is a binomial random variable with n=5" = 5 and 
p=0.32P = 9.32. p(x>4)=1-binomcdf(trials: 5, p: 0.32, x value: 3)=0.039 

P(X > 4) = 1—binomedf (trials : 5, p : 0.32, rvalue : 3) = 0.039 There is a 


0.039 probability that at least 4 of the 5 sample means fall outside of this interval. 
This is a reasonable criterion, because when the process is under control, we would 
get a “false alarm” only about 4% of the time. 


Section 10.3 
Answers to Check Your Understanding 
page 678: 1. STATE: 90% CI for pidiff=theMdiff = the true mean difference 
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(Music—Without music) (Music ~ Without music) in time for students like the 
ones in this study to complete the arithmetic test. PLAN: One-sample t interval for 
ydiffMdiff, Random: The volunteers were randomly assigned the order of the music 
and no-music treatments. Normal/Large Sample: ndiff=30>30"dift = 30 > 30. 
DO: x diff="aitt = 2.8, sdiff=7.4908aitt = 7-490 ndiff=30"ait = 30, With 90% 
confidence and df=30-1=29, t*=1.6994f = 30 — 1 = 29, 4° = 1.699, 
2.8+1.6997.49030=2.842.323=(0.477, 5.123) 

2.8 + 1.699 = 2.8 + 2.323 = (0.477, 5.123) 

Tech: The Tinterval function gives (0.477, 5.123) with df=29df = 29, 
CONCLUDE: We are 90% confident that the interval from 0.477 to 5.123 captures 
Music — Without music) 


the true mean difference (Music—Without music)| in 


time for students like these to complete the arithmetic test. 


2. The 90% confidence interval in Question 1 suggests that students are between 
0.477 and 5.123 seconds slower, on average, when completing the arithmetic test 
while music is playing. 

page 682: STATE: HO: pdiff=0/0 * Mait =9 Ha: pdift-oHa * Maite > 0, 
where jdiff=diff = the true mean difference (Air—-Nitrogen)(Air — Nitrogen) jy 
pressure lost using a=0.05@ = 0.05, PLAN: Paired t test for pdiffHdiff. Random: 
Tires in each pair are randomly assigned to be filled with air or nitrogen. 
Normal/Large Sample: ndiff=31230"aitt = 31 = 30. po; x diff=1.252 

Gait = 1.252, cqitf=1.202S8aitt = 1.202, and ndiff=31"ait = 31, 

t= 2 = 5.80 

t=1.252-01.20231=5.80 vil ; df=30df = 30 and P-value~0 
P-value ~ 0, CONCLUDE: 


Because the P-value of approximately 0<a=0.050 < a = 0.05, we reject HOM. 
We have convincing evidence that the true mean difference (Air—Nitrogen) 

(Air — Nitrogen) j, pressure lost is greater than 0. In other words, we have 
convincing evidence that tires lose less pressure when filled with nitrogen than 


when filled with air, on average. 


Answers to Odd-Numbered Section 10.3 Exercises 
10.75 (a) 
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Difference (Weight — Groove) in the 
estimate of total distance traveled 
(1000s of mi) 


(b) Most of the differences are positive, meaning that the estimates of distance 
traveled as gauged by the weight tend to be greater than the estimates of distance 
traveled as gauged by the grooves of the tire. (c) x diff=4.556 aiff = 4.556, 
sdiff=3.226Sditt = 3.226. The estimate of the number of miles driven using the 
“weight” method is 4.556 thousand miles greater, on average, than the estimate of 
the number of miles driven using the “groove” method. 

10.77 (a) The pilot recorded two values for each individual (each of 12 randomly 
selected days). (b) Most of the differences are positive, meaning that the outbound 
flights (Dubai to Doha) tend to take longer. (c) x diff=10.083* aiff = 10.083 | 


sdiff=10.766Sditt = 10.766. the difference (Outbound-Return) 


(Outbound — Return) ;, flight times typically varies by about 10.766 minutes 


from the mean of 10.083 minutes. 

10.79 STATE: pdiff=theMdift = the true mean difference in the estimates from 
these two methods in the population of tires. PLAN: One-sample t interval for 
udiff4diff, Random: Random sample of 16 tires. 10%: 16<10%16 < 10% of all 
tires. Normal/Large Sample: The dotplot in Exercise 75 does not show any strong 
skewness or outliers. 

DO: x diff=4.556%aitt = 4.556 cqitf=3,2268aitt = 3-226, and ndiff=16; df=15 
naitt = 16; df = 15. (2.837, (2-837, 6.275)6-275)_ CONCLUDE: We are 95% 
confident that the interval from 2.837 to 6.275 thousand miles captures the true 
mean difference in the estimates from these two methods in the population of tires. 


10.81 (a) STATE: diff=the/aitt = the trye mean difference (After-Before) 


(After — Before) ;, reasoning scores for all preschool students who take six 


months of piano lessons. PLAN: One-sample t interval for pdiffHdiff . Random: 
Random sample of 34 preschool children. 10%: 34<10%,34 < 10%, of all 
preschool children. Normal/Large Sample: ndiff=34230"aitt = 34 = 30 po: 

x diff=3.6187 ait = 3-618  caiff=3.055Saitt = 3-055 ndiff=34nair = 34; f= 
43, (0731, 4.505) 44 = 33, (2.731, 4.505) 

CONCLUDE: We are 90% confident that the interval from 2.731 to 4.505 captures 
ydiff=Mdiff ~ the true mean difference (After-Before)(After ~ Before) jy 
reasoning scores for all preschool students who take six months of piano lessons. 
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(b) No; a randomized experiment is needed to show causation. 


10.83 The dotplot of the distribution of difference (Outbound—Return) 


(Outbound — Return) ;, flight times is skewed right with a possible high outlier, 


violating the Normal/Large Sample condition. 

10.85 STATE: HO : pdiff=0, Ha: pdiffzo40 : Maite = 9,Ha : Maite #0 where 
udiff=theMdift = the tue mean difference (Inside — Drive-thru) in time it would 
take to receive an iced coffee at this Dunkin’ Donuts restaurant after placing the 
order, using a=0.05@ = 0.05, PLAN: Paired t test for pdiffMaitt , 


aT: ia = ‘hg 
-5 0 5 10 15 
Difference (Inside — Drive-thru) 
in time (sec) 


Random: The times were randomly selected. Normal/Large Sample: The dotplot 
does not show any strong skewness or outliers. 

DO: x diff=3.9% ait = 3-9 seconds, sdiff=5.646Saitt = 5-646 | and ndiff=10 

naistt = 10, ¢=2,1g¢ = 2.18; df=9df = 9; P{value=0.0568/ {value = 0.0568, 
CONCLUDE: Because the P-value of 0.0568>a0=0.050.0568 > a = 0.05, we fail 
to reject HoZ0. We do not have convincing evidence that the true mean difference 


(Inside — Drive-thru) Hae: 


(Inside—Drive-thru) 
10.87 (a) Yes! The data arose from a randomized comparative experiment, so if the 
result of this study is statistically significant, we can conclude that the difference in 
the ability to memorize words was caused by whether students were performing the 
task in silence or with music playing. (b) STATE: HO: pdiff=0, Ha : pdiff+0 

Ho : bait = 0, Ha : bait #9 where ydiff=Mdiff = the true mean difference 
(Music—Silence)(Music — Silence) in the average number of words recalled by 
students at this school. a=0.01@ = 0.01, PLAN: Paired t test for pdiffMdiff , 
Random: The treatments were assigned in a random order. Normal/Large Sample: 
ndiff="aiff = 30 > 30. DO: x diff=" ait! = 1.57, sdiff=Sdiff = 2.70, and ndiff= 
Ndiff = 30; t=3.18t = 3.18; df = 29; P-value = 0.0034. CONCLUDE: Because the 
P-value of 0.0034 < a=0.01a = 0.01, we reject HOM0, We have convincing 


Music — Silence) j, the 


evidence that the true mean difference (Music—Silence)! 
average number of words recalled by students at this school is different from 0. (c) 
Because we rejected Hoo , we may have made a Type I error. 

10.89 (a) STATE: pdiff=Mdiff — the true mean difference (Music—Silence) 


(Music — Silence) jn the average number of words recalled by students at this 
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school. PLAN: One-sample t interval for pdiffHdiff. Random: The treatments were 
assigned in a random order. Normal/Large Sample: ndiff=30>30"diff = 30 > 30. 
DO: x diff=1.57% ait = 1-57, saitf=2.708ait = 2-70, ndiff=30aitt = 30; af= 29 
df = 29, (0,211, 2.929)(9-211, 2.929) CONCLUDE: We are 99% confident that 
the interval from 0.211 to 2.929 captures the true mean difference (Music—Silence) 
(Music — Silence) jy the number of words recalled by students at this school. (b) 
The confidence interval gives the values of jdiffdiff that are plausible based on 
the sample data. The two-sided test only allows us to reject (or fail to reject) a 
difference of 0, where a confidence interval provides a set of plausible values for 
the true mean difference. 

10.91 (a) Two-sample ¢ test; the data are being produced using two distinct groups 
of cars in a randomized experiment. (b) Paired t test; this is a matched pairs 
experimental design where both treatments are applied to each subject in a random 
order. (c) Paired t test; the data were collected from the male and female partners in 
AO couples. 

10.93 (a) The table shows two values for each individual (each student). (b) Most 
of the differences (Baseball—Softball)( Baseball — Softball) a1 positive, meaning 
the students tend to throw the baseball farther than the softball. (c) 

HO: diff=0, Ha: pdiff>o0 * Hai = 0, Ha : Maitt > 0 where pdiff=the 

Haitt = the tye mean difference (Baseball—Softball)( Baseball — Softball) jn 
distance thrown. (d) ndiff=24<30"diff — 24 < 30 and the boxplot of the 
differences has an outlier. (e) A mean difference of 6.54 or larger happened in 0 of 
the 100 simulation trials, so the estimated P-value is 0. Because the P- 
value<a=0.05P-value < a = 0.05, we reject HOM0. These data provide 
convincing evidence that students like these can throw the baseball farther, on 
average, than the softball. 

10.95 a 

10.97 e 

10.99 (a) One-sample z interval for a proportion. (b) Paired t test for the mean 
difference. (c) Two-sample z interval for the difference in proportions. (d) Two- 
sample t test for a difference in means. 

10.101 (a) Paired t test; we have paired data (two scores for each student). (b) 
STATE: HO: pdiff=00 * Maitt = 9 Ha: paitt>oM%e * Maitt > 9 where 


udiff=theMdift = the true mean increase in SAT verbal scores of students who 


2053 


were coached, using a=0.05@ = 0.05 PLAN: Paired t test for pdiffHdiff, Random: 
Random sample of 427 students. 10%: ndiff=427"diff = 427 is less than 10% of 
students who are coached. Normal/Large Sample: ndiff=427>30"dift — 427 > 30. 
DO: t=10.16t = 10.16; df=426; df = 426; pyaluexoP-value ~ 0. 
CONCLUDE: Because the P-value of approximately 0<a=0.050 < a = 0.05, we 
reject Hoo. There is convincing evidence that students who are coached increase 


their scores on the SAT verbal test, on average. 


Answers to Chapter 10 Review Exercises 


R10.1 (a) Approximately Normal because 80, 20, 42, and 28 are >10= 10. (b) 
up\N-pAK=0.80-0.60=0.20" PPK i= An Gaieeead (c) Because 
100<10%,100 < 10%, of all registered cars in Nathan’s state and 70<10% 


70 < 10% of all registered cars in Kyle’s state, op\N 
/0.8(0.2) 0.60.4) 


= Vv 00 * 7 = 0.071 


—~p\K=0.8(0.2)100+0.6(0.4)70=0.071° > PK 
The difference (Nathan—-Kyle)(Nathan ~ Kyle) in the sample proportions of 
registered cars that are made by American manufacturers typically varies by about 
0.071 from the true difference in proportions of 0.20. 

R10.2 STATE: pT=true proportion of teens?7? = true proportion of teens who 
use Facebook and pA=trueP?4A = tUE proportion of adults who use Facebook. 
PLAN: Two-sample z interval for pT-pAPT ~ PA. Random: Independent random 
samples. 10%: nT=1060<10%,"7 = 1060 < 107%, of all U.S. teens and 
nA=2003<10%tA = 2003 < 10% of all U.S. adults. Large Counts: 753, 307, 
1162, and 841 are >10> 10%, p0:0.084, 0.176)DO : (0.084, 0.176) 
CONCLUDE: We are 95% confident that the interval from 0.084 to 0.176 captures 
pT-pA=theP? ~ PA = the qifference in the true proportions of all U.S. teens and 
all U.S. adults who use Facebook. 

R10.3 (a) STATE: HO : p1-p2=0/0 : pi — P2 = 9) Ha : p1—p2<0 

H, : pi — p2 < 0, where pl=the?1 the trye proportion of patients like these 
who take AZT and develop AIDS and p2=theP?2 ~ the true proportion of patients 
like these who take placebo and develop AIDS, using a=0.05@ = 0.05, PLAN: 
Two-sample z test for pl1—p2P?1 ~ P2. Random: The subjects were assigned at 
random to take AZT or a placebo. Large Counts: 17, 418, 38, and 397 are >10 

2 10, DO:z=-2.93D0 : z = —2.93; p-vyalue=0.0017P-value = 0.0017 
.CONCLUDE: Because the P-value of 0.0017<a=0.050.0017 < a = 0.05, we 
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reject Hoo. We have convincing evidence that taking AZT lowers the proportion 
of patients like these who develop AIDS compared to a placebo. (b) Type I: 
Finding convincing evidence that AZT lowers the risk of developing AIDS, when 
in reality it does not. Consequence: Patients will pay for a drug that doesn’t help. 
Type II: Not finding convincing evidence that AZT lowers the risk of developing 
AIDS, when in reality it does. Consequence: Patients won’t take the drug when it 
could delay the onset of AIDS. 


R10.4 Shape: Because n1=49>30 = 49 = 30 and n2=49>30" = 49 = 30 the 
shape of the distribution of x 1-x 2%1 — 2 is approximately Normal. Center: 
ux 1-x2=15-15=0M#1-#2 = 15 -15=0 0 Variahitity: Because 49<10% 
49 < 10% of all candles made by Machine 1 and 49<10%49 < 10% of all 
candles made by Machine 2, ox 1-x 2=0.15249+0.10249=0.026 


/ 0.152 0.10? 


wd 49 


© ,—-T92 


R10.5 (a) STATE: p1=the! = the true mean NAEP quantitative skills test score 
for young men and p2=thel42 = the true mean NAEP quantitative skills test score 


~ #2. Random: It is 


for young women. PLAN: Two-sample t interval for p1—p21 
reasonable to consider the two samples independent because knowing the response 
of a male shouldn’t help us predict the response of a female. 10%: n1=840<10%, 
n, = 840 < 10%, of all young men and n2=1077<10%!"2 = 1077 < 10% of all 
young women. Normal/Large Sample: n1=840>30" = 840 > 30 and 
n2=1077230"2 = 1077 > 30, po; x1=272.4081 = 272.40, 5j=59,281 = 59.2, 
n1=840, x 2=274.73, s2=57.5'1 = 840, £2 = 274.73, 82 = 57.5, y9=1077 

nz = 1077, Using df=1777.52df = 1777.52, (-6.76, 2.10)\—-76; 2-10). using 
df=839, (-6.76, 2.10)¢f = 839, (—6.76, 2.10) CONCLUDE: We are 90% 
confident that the interval from —6.76 to 2.10 captures 11-p2=the/1 ~ #2 — the 
true difference in the mean NAEP quantitative skills test score for young men and 
the mean NAEP quantitative skills test score for young women. (b) Because 0 is in 
the interval, it is plausible that the true difference is 0. That is, we do not have 
convincing evidence of a difference in mean score for male and female young 
adults. 


R10.6 (a) The difference data come from two groups in a randomized experiment. 
(b) The distribution of differences for the control group is slightly skewed to the 
right while the distribution of differences for the treatment group is roughly 
symmetric. Neither distribution has any clear outliers. The center for the treatment 
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group is greater than the center for the control group. The differences in the control 
group are more variable (based on the JQR) than the differences in the treatment 
group. Overall, it appears that students in the treatment group had bigger 
improvements, on average. (c) STATE: HO : u1-12=0440 :p — pa = 0 

Ha: p1-p2>0/4a ‘Hi — H2 > O where p1=the!1 = the true mean difference in 
test scores for students like these who get the treatment message and 12=the 

}42 = the tye mean difference in test scores for students like these who get the 
neutral message, using a=0.05@ = 0.05, PLAN: Two-sample t test for 11-12 

#1 ~ #2. Random: The students were assigned at random to the treatment and 
control groups. Normal/Large Sample: The boxplots show no strong skewness and 
no outliers. DO: x 1=11.4, s1=3.169, n1=10, x 2=8.25, s2=3.69 

f, = 11.4, 8; = 3.169, n; = 10, Fz = 8.25, 82 = 3.69 yo=gn2 = 8. t=1.91 
t = 1.91. Using df=13.919df = 13.919, P-value=0.0382/ {value = 0.0382, 
using df=7df = 7, P-value=0.0489P-value = 0.0489, CONCLUDE: Because the 
P-value of 0.0382<a=0.050.0382 < a = 0.05, we reject HO/0. There is 
convincing evidence that the true mean difference in test scores for students like 
these who get the treatment message is greater than the true mean difference in test 
scores for students like these who get the neutral message. (d) We cannot 
generalize to all students who failed the test because our sample was not a random 
sample of all students who failed the test. It was a group of students who agreed to 
participate in the experiment. 


R10.7 (a) 


2 2 ee 
T T T T T Li T 
0.1 -0,05 0 005 O.1 015 02 025 03 


Difference (Standing — Blocks) in time (sec) 


(b) x diff=0.1317 aitt = 9-131, sqitt=9,119Saitt = 9-119 For the 8 sprinters in the 
sample, the 50-meter dash time averages 0.131 second longer with the standing 
start than with the blocks. This gives some evidence that the 50-meter dash takes 
longer with a standing start, on average, than with the blocks. (c) STATE: 

HO: uditt=040 ‘ Maitt = 9 Ha: udift>oZa : Maite > 9 where udiff=Mdiff = the 
true mean difference (Standing — Blocks) in 50-meter run time, using a=0.05 

a = 0.05, PLAN: Paired t test for pdiffHdiff. Random: The order was randomized. 
Normal/Large Sample: The dotplot shows no strong skewness and no outliers. DO: 
x diff=0.131 Fait = 0.131) caitt=0.1198aitt = 9-119 and ndiff=B"ait = 8; 
t=3.11t = 3.11. Using df=7df = 7, P-value=0.0085P-value = 0.0085, 
CONCLUDE: Because the P-value of 0.0085<a=0.050.0085 < a = 0.05, we 
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reject Hoo. We have convincing evidence that the true mean difference 
(Standing—Blocks)(Standing ~ Blocks) in 50-meter run time for sprinters like 
these is greater than 0. (d) STATE: diff=the trueMaitt = the true jean difference 
(Standing—Blocks)(Standing ~ Blocks) jn 50-meter run time. PLAN: One- 
sample t interval for j:diffMdiff, Conditions were checked in part (c). DO: 

(0.0513, 0.2112)(9-0518, 0.2112) CONCLUDE: We are 90% confident that the 
interval from 0.051 to 0.211 captures jdiff=the true/aite = the true mean 


Standing — Blocks) 


difference (Standing—Blocks)\ in 50-meter run time for 


sprinters like these. The confidence interval gives the values of pdiffMdiff that are 
plausible based on the sample data. The test allows us to reject (or fail to reject) a 
difference of 0, where a confidence interval provides a set of plausible values for 
the true mean difference. 


Answers to Chapter 10 AP® Practice test 

10.1 

T10.2 b 

TIG.3 a 

T10.4a 

T10.5e 

T10.6e 

2107 Cc 

T10.8 ¢c 

TIO c 

T10.10 a 

T10.11 (a) STATE: 95% CI for p1-p2/41 ~ #2, where p1=the/41 = the true mean 
hospital stay for patients like these who get heating blankets during surgery and 
.2=the true#2 = the true trye mean hospital stay for patients like these whose 
temperatures were reduced during surgery. PLAN: Two-sample t interval for 


1-p2M1 ~ #2. Random: Two groups in a randomized experiment. Normal/Large 
Sample: n1=104>30% = 104 > 30 and n2=96>30"2 = 96 = 30. po: x1=12.1 
7) = 12.1 s1=4.4, n1=10451 = 4.4,n, = 104. x 2=14,7£2 = 14.7, 

52=6.5, n2=9682 = 6.5, m2 = 96, Using df=165.12df = 165.12, (-4,16, -1.04) 
(—4.16, —1.04). using df=95df = 95, (4.17, -1.03)(—4-17, —1.03) 
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(CONCLUDE: We are 95% confident that the interval from -4.16~4-16 to -1.04 
1.04 captures j1-p12=the true! ~ #2 = the true tue difference in mean 
length of hospital stay for patients like these who get heating blankets during 
surgery and those who have their core temperatures reduced during surgery. (b) 
Yes; because 0 is not in the interval, we have convincing evidence that the true 
mean hospital stay for patients like these who get heating blankets during surgery 
is different from the true mean hospital stay for patients like these who have core 
temperatures reduced during surgery. (c) If we were to repeat this experiment many 
times and calculate 95% confidence intervals for the difference in mean length of 
hospital stay each time, about 95% of the intervals would capture the true 
difference in mean hospital stay for patients like these who get heating blankets 
during surgery and mean hospital stay for patients like these who have core 
temperatures reduced during surgery. 
T10.12 (a) STATE: HO: p1—p2=0/40: P: — P2 = 9 Ha: p1—p2>0 
H, : p, — p2 > 0 where pl=the?1 the trye proportion of cars that have the 
brake defect in last year’s model and p2=theP2 = the true proportion of cars with 
the defect in this year’s model, using a=0.05@ = 0.05. PLAN: Two-sample z test 
for p1—p2P1 ~ P2. Random: Independent random samples. 10%: n1=100<10% 
n, = 100 < 10% of last year’s model and n2=350<10%"2 = 350 < 10% of this 
year’s model. Large Counts: 20, 80, 50, and 300 are >102 10. DO: DO:z=1.39 
DO:z = 1.39; P-value=0.0822P-value = 0.0822 CONCLUDE: Because the P- 
value of 0.0822>a=0.050.0822 > a = 0.05, we fail to reject HoL0. We do not 
have convincing evidence that the true proportion of brake defects is smaller in this 
year’s model compared to last year’s model. (b) Because we failed to reject the 
null hypothesis, we may have made a Type II error. The automaker may take other 
action to improve the production process, even though it is working better, which 
could be costly. 
T10.13 (a) Students may improve from Monday to Wednesday just because they 
have already done the task once. Then we wouldn’t know if the experience with 
the test or the caffeine is the cause of the difference in scores. A better way to run 
the experiment would be to randomly assign half the students to get 1 cup of coffee 
on Monday and the other half to get no coffee on Monday. Then have each person 
do the opposite treatment on Wednesday. (b) STATE: HO: pdiff=0/40 * Mais = 9 
, Ha: udift<o. a? Maite <0 where ydiff=theMdift = the trye mean difference 
(No coffee—Coffee)(Nocoffee ~ Coffee) in the number of words recalled by 
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students like the ones in this study without coffee and with coffee, using a=0.05 
a = 0.05. PLAN: Paired t test for pdiffHdiff. Random: The treatments were 
assigned in a random order. Normal/Large Sample: The dotplot shows no strong 
skewness and no outliers. 


e e e 
T T T T T 
-2 -15 -1 -0.5 0 


Difference (No coffee — Coffee) 
in number of words recalled 


DO: x diff=-1%aitt =—1, saitt=0.816%aitt = 9-816 ngitt=10"aie = 10, ~=-3.87 
t =—3.87, qf=odf = 9; P-value=0.0019P-value = 0.0019, CONCLUDE: 
Because the P-value of 0.0019<a=0.050.0019 < a = 0.05, we reject HOM0. We 
have convincing evidence that the true mean difference (No coffee—Coffee) 
(Nocoffee — Coffee) 


than 0. 


in word recall for students like the ones in this study is less 


Answers to Cumulative AP® Practice Test 3 
AP3.1e 
AP3.2 b 
AP3.3 d 
AP3.4 ¢ 
AP3.5 d 
AP3.6d 
AP3.7 Cc 
AP3.8 a 
AP3.9 d 
AP3.10 c 
AP3.11b 
AP3.12 c 
AP3.13 ¢ 
AP3.14d 
AP3.15d 
AP3.16 c 
AP3.17 b 
AP3.18 b 
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AP3.19 e 

AP3.20 c 

AP3.21a 

AP3.22 d 

AP3.23 b 

AP3.24 e 

AP3.25 a 

AP3.26 b 

AP3.27 c 

AP3.28 d 

AP3.29 a 

AP3.30 b 

AP3.31 STATE: HO: pdiff=040 * Maite = 9) Ha : wdift<oMa : Haitt <9; where 
udiff=the true/aite = the true tye mean change in weight (After—Before) 


(After — Before) ;, pounds for people like these who follow a five-week crash 
diet, using a=0.05a = 0.05, PLAN: Paired t test for pdiffMdiff, Random: Random 
sample of dieters. 10%: 15<10%15 < 10% of all dieters. Normal/Large Sample: 
The boxplot shows no strong skewness and no outliers. 


T T T T T T T T 
-25 -20 -15 -10 -5 0 5 10 15 


Difference (After - Before) in weight (Ib) 
DO: x diff=-3.62aitt =—3-6 caiff=11.53Saitf = 11-53) naiff=15Maite = 19. p= 
-1,21t =—1.21; df=14df = 14; p-value=0.1232P-value = 0.1232 
.CONCLUDE: Because the P-value of 0.1232>a=0.050.1232 > a = 0.05, we fail 
to reject HOM. We do not have convincing evidence that the true mean change in 
weight (After—Before)( After ~ Before) fo, people like these who follow a five- 
week crash diet is less than 0. 

AP3.32 (a) This is an observational study. No treatments were imposed. (b) 
STATE: HO: p1-p2=0/0 : pi — P2 = 9 Ha: p1—p2<o0Ha : Pi — P2 < 0, where 
pl=the?1 = the true proportion of VLBW babies who graduate from high school 
by age 20 and p2=theP2 = the true proportion of non-VLBW babies who graduate 
from high school by age 20, using a=0.05a = 0.05, PLAN: Two-sample z test for 
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p1—p2P1 ~ P2. Random: Independent random samples. 10%: n1=242<10% 

my = 242 < 10% of all VLBW babies and n2=233<10%"2 = 233 < 10% of all 
non-VLBW babies. Large Counts: 179, 63, 193, and 40 are >10= 19, Do: z= 
-2,33D0: z =—2.33; p-yalue=0.0095P-value = 0.0095, CONCLUDE: Because 
the P-value of 0.0095<a=0.050.0095 < a = 0.05, we reject HOM#0. We have 
convincing evidence that the true proportion of VLBW babies who graduate from 
high school by age 20 is less than the true proportion of non-VLBW babies who 
graduate from high school by age 20. 

AP3.33 (a) Because there is no leftover curved pattern in the residual plot. The 
residuals look randomly scattered around the residual=Oresidual = 0 line. (b) y= 
~73,.64+5.7188xY =—73.64 + 5.71882 where y/=predicted distance 

y = predicted distance anq x=temperature® = temperature (ec)! *C) (c) The 
predicted distance from the nearest fish to the outflow pipe increases by 5.7188 
meters for each additional 1-degree-Celsius increase in water discharge 
temperature. (d) y4=—73.64+5.7188(29)=92.21 

y = Zz = 73.64 + 5.7188(29) = 92.21 eters; the residual=78-92.21=-14.21 
residual = 78 — 92.21 =—14.21 meters. The actual distance from the nearest 
fish to the outflow pipe on this afternoon was 14.21 meters closer than the distance 
predicted by the regression line with x=29°C2 = 29°C, 

AP3.34 Define W=theW = the weight of a randomly selected gift box. Then, 
uw=27HW = 27 and ow=2.019W = 2-01. (b) The distribution of W will also be 


Normal. We want to find P(w>30)P (W > 30). 
30—27 


(i) 2=30-272.01=1.49% ~ 201 ~ 149. pzs1.49)=0.0681 
P(z > 1.49) = 0.0681 (ii) P(W>30)=normalcdf! (W > 30) = normalcdf 
(lower: 30, upper: 1000, mean: 27mean:27, SD: 2.01)=0.0678. 


SD:2.01)= 0.0678. There is a 0.0678 probability of randomly selecting a box that 


weighs more than 30 ounces. (c) PP? (at least one box is greater than 30 ounces) 


1-(1-0.0678)5=0.296030 ounces) 1 — (1 — 0.0678)" = 0.2960; there is a 0.2960 
probability of selecting a random sample of 5 boxes and having at least one box 


weigh more than 30 ounces. (d) The distribution of W W will also 


= pw = 27 


be Normal, with pW =)W=274 7 and oW =2.015= 0.899 


2.01 
vo . We want 


to find P(W->30)P (W > 30). 
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_ 30—27 
(i) z=30-270.899=3.34" —0.899 
P(z > 3.34) = 0.0004 


I. P(z>3.34)=0.0004 


(ii) PEW >30)=normalcdf (lower: 30, upper: 1000, 
P(W > 30) = normalcdf (lower: 30,upper: 1000, oan. 97. mean: 27, sp- 


0.899)=0.0004,9D: 9.899) = 0.0004. There is a 0.0004 probability of randomly 
selecting 5 boxes that have a mean weight of more than 30 ounces. 

AP3.35 (a) Assuming that the true difference in the mean annualized daily return 
for Stock A and Stock B is 0, there is a 0.042 probability that we would observe a 
difference in sample means of 4.7 or greater by chance alone. CONCLUDE: 
Because the P-value of 0.042<a=0.050.042 < a = 0.05, we reject HOM0. We 
have convincing evidence that the true mean annualized return for stock A is 
different from the true mean annualized return for stock B. (b) 

Ho : cA2-0B2=0, Ha: cA2-0B2>04%40 ? [4 —%e = 9, Ha : 04 — oR > 0. 


= the true 


2 
where oA2=the true? A variance of returns for stock A and oB2=the 


o% = the tue variance of returns for stock B. (c) F=(12.9)2(9.6)2=1.806 


p= 2 = 1,806 
(9.6 


» 


(9.6) . Values of F that are greater than 1 indicate 


that the price volatility for stock A is higher than that for stock B. The statistic 
provides some evidence for the alternative hypothesis because 1.806>11.806 > 1, 
(d) A test statistic of 1.806 or greater occurred in only 6 out of the 200 trials. Thus, 
the approximate P-value is 6/200=0.039/200 = 0-03 Because 0.03<a=0.05 

0.03 < a = 0.05, we reject HO. There is convincing evidence that the true 
variance of returns for stock A is greater than the true variance of returns for stock 
B. 


Chapter 11 

Section 11.1 

Answers to Check Your Understanding 

page 714: 1. Hoo: The company’s claimed color distribution for its Peanut 
M&M’S® is correct. Hata: The company’s claimed color distribution is not 
correct. 

2. There were 65 candies in the sample from the bag. The expected count of blue, 


365 (0.20) = 13 


orange, green, and yellow candies is 65(0.20)=1 , and the expected 
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count of red and brown is 65(0.10)=6.5°° (0.10) = 6.5, 

9 (14-13)? (9-13)? 
3. y2=(14-13)213+(9-13)213+-=2.3847% ~~ 13 3 
page 717: 1. The expected counts (13, 13, 13, 13, 6.5, 6.5) are all at least 5. We 
should use df=6-1=5df = 6-1 = 5, 


+--+ = 2.3847 


2. Table C: The P-value is greater than 0.25; Tech: P-value=y2?’ -value = x? 
cdf(lower: 2.3847, upper: 10000, df: 5) =0.7938= 0.7938, 

3. Because the P-value of 0.7938>a=0.050.7938 > a = 0.05, we fail to reject HO 
Ho | There is not convincing evidence that the color distribution of M&M’S Peanut 
Chocolate Candies is different from what the company claims. 


page 721: 1. STATE: Ho/40 : The distribution of car colors in Oro Valley is the 
same as the distribution of car colors across North America. 


Hata: The distribution of car colors in Oro Valley is not the same as the 
distribution of car colors across North America. We’ll use a=0.050 = 0.05, 
PLAN: Chi-square test for goodness of fit. Random: The data come from a random 
sample of 300 cars in Oro Valley. 10%: n=300n = 300 is less than 10% of all cars 
in Oro Valley. Large Counts: All expected counts (69, 54, 48, 45, 30, 27, 6, 21) are 
at least 5. 

DO: Test statistic: y2=(84—-69)269+(38-54)254+--=29,921 


_9 _ (84-69)? (38-54)? _ 
=-—3 7 FC atl 29.921 


P-value: df=8-1=74f = 8 — 1 = 7; Table C: P-value<0.0005P-value < 0.0005; 
Tech: yox” cdf(lower: 29.921, upper: 10000, df: 7) 0 0 CONCLUDE: Because 
the P-value of approximately 0<a=0.050 < a = 0.05, we reject HO/#0. We have 
convincing evidence that the distribution of car colors in Oro Valley is not the 
same as the distribution of car colors across North America. 


2. The table of chi-square contributions is shown below. 


Color Observed Expected O-E (O-E)7/E 
White 84 69 15 3.2609 
Black 38 54 —16 4.7407 
Gray 31 48 -17 6.0208 
Silver 46 45 1 0.0222 
Red 27 30 3 0.3 
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Blue 29 27 2 0.1481 
Green 6 6 0 0 
Other 39 21 18 15.429 


The two biggest contributions to the chi-square statistic came from gray and other 
colored cars. There were fewer gray cars than expected and more other-colored 
cars than expected. As for Cass’s question: It does seem that drivers in Oro Valley 
prefer lighter- colored cars as there were more white cars than expected and fewer 
black cars than expected, so it seems Cass might be onto something! 


Answers to Odd-Numbered Section 11.1 Exercises 
11.1 (a) Hoo : The company’s claimed distribution for its deluxe mixed nuts is 
correct. Ha/ta: The company’s claimed distribution is not correct. (b) Cashews: 
150(0.52)=78150 (0.52) = 78, almonds: 150(0.27)=40.5150 (0.27) = 40.5, 
Macadamia nuts: 150(0.13)=19.510 (0-18) = 19.5 and Brazil nuts: 
(c) x2=(83-78)278+(29-40.5)240.5+=6.599 

(83—78)" (29—40.5)* 


x" = 7a te +++: = 6.599 


11.3 (a) The P-value (0.061) is between 0.05 and 0.10. (b) The P-value (0.0003) is 
less than 0.0005. 

11.5 (a) All expected counts (see Exercise 11.1) are >5; df=32 9; df= 3. 

(b) The P-value (0.0858) is between 0.05 and 0.10. (c) Because the P-value of 
0.0858>a=0.050.0858 > a = 0.05, we fail to reject HO0. We do not have 
convincing evidence that the company’s claimed distribution for its deluxe mixed 
nuts is not correct. 

11.7 Time spent doing homework is quantitative. Chi-square tests for goodness of 
fit should only be used for distributions of categorical data. 

11.9 STATE: HOO: Froot Loops contain an equal proportion of each color. Ha 
Ha: Froot Loops do not contain an equal proportion of each color. a=0.05 

a = 0.05 PLAN: Chi-square test for goodness of fit. Random: Random sample of 
Froot Loops. 10%: n=120<10%n = 120 < 10% of all Froot Loops. Large 
Counts: All expected counts = 120(1/6) = 20 > 5. DO: y2=7.9X" = 7-9: af=s 

df = 5. The P-value (0.1618) is between 0.15 and 0.20. CONCLUDE: Because the 
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P-value of 0.1618>a=0.050-1618 > a = 0.05, we fail to reject HO0. We do not 
have convincing evidence that Froot Loops do not contain an equal proportion of 
each color. 

11.11 STATE: HOO: The distribution of eye color and wing shape is the same as 
what the biologists predict. Hae: The distribution of eye color and wing shape is 
not the same as what the biologists predict. 

a=0.05a@ = 0.05 PLAN: Chi-square test for goodness of fit. The conditions are 
met. DO: y2=6.187X = 6-187. gf=3df = 3. The P-value (0.1029) is between 
0.10 and 0.15. CONCLUDE: Because the P-value of 0.1029>a=0.05 

0.1029 > a = 0.05, we fail to reject HO/0. We do not have convincing evidence 
that the distribution of eye color and wing shape is different from what the 
biologists predict. 

11.13 (a) STATE: Ho/40 : Nuthatches do not prefer particular types of trees when 
searching for seeds and insects. Ha/ta: Nuthatches do prefer particular types of 
trees. a=0.05@ = 0.05 PLAN: Chi-square test for goodness of fit. Random: 
Random sample of 156 red-breasted nuthatches. 10%: n=156<10%," = 156 < 10, 
of all nuthatches. Large Counts: 84.24, 62.4, and 9.36 are all > 5. DO: y2=7.418 
x’ = 7.418. af=odf = 2. The P-value (0.0245) is between 0.02 and 0.025. 
CONCLUDE: Because the P-value of 0.0245<a=0.050.0245 < a = 0.05, we 
reject Hoo. There is convincing evidence that nuthatches prefer particular types 
of trees when they are searching for seeds and insects. (b) The breakdown of the 
chi-square statistic is y2=2.407+4.416+0.595X = 2.407 + 4.416 + 0.595. The 
largest contributors are the Douglas firs and the ponderosa pines. There are fewer 
red-breasted nuthatches observed in Douglas firs (70-84.24=—14.24) 


(70 — 84.24 = —14.24) and more red-breasted nuthatches observed in ponderosa 


pines (79-62.4=16.6)\79 — 62.4 = 16.6) than we would expect, so the nuthatches 
seem to prefer the ponderosa pines the most and the Douglas firs the least. 

11.15 (a) STATE: Ho40: Mendel’s 3:1 genetic model is correct. Ha/ta: Mendel’s 
3:1 genetic model is not correct. a=0.05@ = 0.05 PLAN: Chi-square test for 
goodness of fit. The conditions are met. DO: y2=0.3453X" = 0.3453 . af=1df = 1, 
The P-value (0.5568) is greater than 0.25. CONCLUDE: Because the P-value of 
0.5568>a=0.050.5568 > a = 0.05, we fail to reject HO0. We do not have 
convincing evidence that Mendel’s 3:1 genetic model is not correct. (b) HO: 
p=0.75440 7p =0.75 Ha: pz0.75 4a ‘PF 0.75 where p=? > the true proportion 
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of peas that will be smooth. a=0.05a = 0.05; z=0.58762 = 0.5876; p- 
value=0.5568P-value = 0.5568, CONCLUDE: Because the P-value of 
0.5568>a0=0.050.5568 > a = 0.05, we fail to reject Hoo. We do not have 
convincing evidence that the true proportion of peas that are smooth is different 
than 0.75. In both cases, we fail to reject the null hypothesis. The P-value for each 
test has the same value, and 72=y2" =’ ! 

11.17 (a) Ho : The true distribution of flavors for Skittles candies is the same as 
the company’s claim. Hae: The true distribution of flavors for Skittles candies is 
not the same as the company’s claim. 

(b) Each expected count=60(0.2)=12>5C0Unt = 60 (0.2) =12>5 (c) Using 
df=4df = 4 and Table C, the value for «=0.05a = 0.05 js 9.49 and for o=0.01 

a = 0.01 is 13.28. So yoX” statistics greater than 9.49 would provide significant 
evidence at the a=0.05@ = 0.05 level and x2X ” values greater than 13.28 would 
provide significant evidence at the a=0.01@ = 0.01 level. (d) One possibility is 6 
lemon, 6 green apple, 16 orange, 16 strawberry, and 16 grape. y2=10X" al 10, 
which is between 9.49 and 13.28. 


11.19 d 
100 - 
t : | 
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70 - | 
BB Doesn't play video games 
40 [] Plays video games 
30 4 
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0 T T T T 


121 ¢ 
18-29 30-49 50-64 65+ 


11.23 (a) 


Percent 
33 
1 


Age group 


(b) There is an association between age group and playing video games for the 
subjects in the study. As age increases, the proportion of adults that play video 
games decreases. 

11.25 (a) The conditions are met. Random: It is reasonable to consider the two 
samples independent because knowing the response of a heavy reader shouldn’t 
help us predict the response of a light reader. 10%: n1=47<10%!"1 = 47 < 10% 
of heavy readers and n2=32<10%"2 = 32 < 10% of light readers at this large 
school. Normal/Large Sample: n1=47>30"1 = 47 > 30 and n2=32>30 

ng = 32 > 30. (b) STATE: 95% CI for p1-p2/1 ~ #2 where p1=1 = the true 
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mean English grade of heavy readers and 12=M2 ~ the true mean English grade of 


light readers. PLAN: Two-sample t interval for p1—-p21 ~ #2, DO: df=31 

df = 31; Table B and df=30df = 30, (0.1163, 0.4517); Tech: (0.1197, 0.4483) 
with df=59.46df = 59.46 

CONCLUDE: We are 95% confident that the interval from 0.1197 to 0.4483 
captures the true difference in the mean English grade of heavy and light readers. 
(c) No; even though 0 is not in the confidence interval, this was an observational 
study so no conclusion of cause and effect can be made. 


Section 11.2 
Answers to Check Your Understanding 


page 732: 1. Ho40: There is no difference in the true distributions of superpower 
preference for all survey takers from the United Kingdom and from the United 


States. Ha/fa: There is a difference in the true distributions of superpower 


preference. 
2. The expected count for the U.K./Fly cell is: (99)(200)415=47.711 
<i ~ eo The rest of the expected counts are shown in the table: 
Country 
U.K. U.S. | Total 
Superpower preference Fly 91.289 99 


Freeze time 46.265 | 49.735 96 
Invisibility 32.289 | 34.711 67 


Super strength 20.723 | 22.277 | 43 
Telepathy 53.012 | 56.988 | 110 
Total 200 215 415 


3. y2=(54-47.711)247.711+(45-51.289)251.289+"-=6.29 
_9 _ (84-47.711)?  (45—51.289)? : ay 

~ #7711 51.289 +++ = 6.29 
page 737: 1. Random: Independent random samples of 200 survey takers from the 
U.K. and 215 survey takers from the U.S. 10%: n=200n = 200 is less than 10% of 
all U.K. survey takers and n=215” = 215 is less than 10% of all U.S. survey 
takers. Large Counts: All expected counts (47.7, 51.3, 46.3, 49.7, 32.3, 34.7, 20.7, 


22.3, 53.0, 57.0) are at least 5. 
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2. The test statistic is y2=6.29X = 6.29. p-value: df=(5-1)(2-1)=4 
df = (5-1)(2-1)=4 


Table C: The P-value is between 0.15 and 0.20. 
Tech: y2X cdf(lower: 6.29, upper: 10000, df: 4) =0.1785= 0.1785 


3. Assuming that the distribution of superpower preference is the same for all U.K. 
and U.S. survey takers, there is a 0.1785 probability of observing differences in the 
distributions as large as or larger than the ones in this study by chance alone. 

4. Because the P-value of 0.1785>a0=0.050.1785 > a@ = 0.05, we fail to reject HO 
Ho We do not have convincing evidence that there is a difference in the true 
distributions of superpower preference for all survey takers from the United 
Kingdom and the United States. 


page 740: STATE: Ho: There is no difference in the distribution of quality of 
life for patients who have suffered a heart attack in Canada and the U.S. Halla: 
There is a difference in the distribution of quality of life. We will use a=0.01 

a = 0.01, PLAN: Chi-square test for homogeneity. Random: The data came from 
independent random samples from Canada and the United States. 10%: n1=311 
nm = 311 jg Jess than 10% of all Canadian heart attack patients and n2=2165 

nz = 2165 jg Jess than 10% of all U.S. heart attack patients. Large Counts: All 
expected counts (77.37, 538.63, 71.47, 497.53, 109.91, 765.09, 41.70, 290.30, 
10.55, 73.45) are at least 5. DO:yx2=(75-77.37)277.37+°+ 

HO w= (75—77.37)" Fecal (65—73.45)" — 11.725 


77,37 73,45 


(65-73.45)273.45=11.725 
P-value: df=(5-1)(2-1)=44f = (5 — 1) (2-1) = 4. Table C: The P-value is 
between 0.01 and 0.02. Tech: yox” cdf(lower: 11.725, upper: 10000, df: 4) 
=0.0195= 0.0195 CONCLUDE: Because the P-value of 0.0195>a=0.01 

0.0195 > a = 0.01, we fail to reject HO0. There is not convincing evidence that 
there is a difference in the distribution of quality of life for heart attack patients in 
Canada and the United States. 

page 746: STATE: Ho/Z0 : There is no association between gender and perceived 
body image in the population of U.S. college students. Hala: There is an 
association between gender and perceived body image in the population of U.S. 
college students. We will use a=0.010 = 0.01, PLAN: Chi-square test for 
independence. Random: Random sample of 1200 U.S. college students. 10%: 
n=1200n = 1200 is less than 10% of the population of all college students. Large 
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Counts: All expected counts (541.50, 313.50, 148.83, 86.17, 69.67, 40.33) are at 
least 5. DO:y2=(560—-541.5)2541.5+(295-313.50)2313.50+-=47.176 


9 (560—541.5)" (295—313.50)" 
DO: x" = 541.5 313.50 +--+ = 47.176 


P-value: df=(3-1)(2-1)=24F = (3—1)(2-1)= 2. Table C: The P-value is less 
than 0.0005. Tech: y2x” cdf(lower: 47.176, upper: 10000, df: 2) x0* 0 
CONCLUDE: Because the P-value of approximately 0<a=0.010 < a = 0.01, we 
reject Hoo. There is convincing evidence that there is an association between 
gender and perceived body image in the population of U.S. college students. 


Answers to Odd-Numbered Section 11.2 Exercises 

11.27 Those who completed the red survey were more likely to choose a chocolate 
candy in a red wrapper, those who completed the blue survey were more likely to 
choose a chocolate candy in a blue wrapper, and those who completed the control 
survey had a slight preference for chocolate candy in a blue wrapper. 

11.29 (a) Ho/40 : The distribution of candy choice is the same for subjects like 
these who receive the red survey, the blue survey, and the control survey. Halla: 
The distribution of candy choice is not the same. 

(b) Expected count = (20)(26)/60 = 8.67. Remaining expected counts are 8.67, 
8.67, 11.33, 11.33, 11.33. (c) x2=6.65X = 6.65 

11.31 (a) Random: Treatments were randomly assigned. 10%: Not needed because 
the volunteers were not randomly selected from some population. Large Counts: 
All expected counts 8.67, 8.67, 8.67, 11.33, 11.33, and 11.33 are = 5. (b) y2=6.65 
x” = 6.65, Using df=(2-1)(3-1)=24F =e ed) = 2 the P-value 
(0.0359) is between 0.025 and 0.05. (c) Assuming the distribution of candy choice 
is the same for subjects like these who receive the red survey, the blue survey, and 
the control survey, there is a 0.0359 probability of observing differences in the 
distributions as large as or larger than the ones in this study by chance alone. (d) 
Because the P-value of 0.0359>a=0.010.0359 > a = 0.01, we fail to reject HO 
Ho. We do not have convincing evidence that the distribution of candy choice is 
different for subjects like these who receive the red survey, the blue survey, and the 
control survey 

11.33 Because we do not have the actual counts of the travelers in each category. 
We also do not know if the sample was taken randomly or if the samples are 
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independent. 

11.35 STATE: HO/0: There is no difference in the distribution of color for name- 
brand and store-brand gummy bears. Hata: There is a difference in the 
distribution of color. a=0.05a = 0.05 PLAN: Chi-square test for homogeneity. 
Random: Independent random samples. 10%: n1=373"1 = 373 is less than 10% 
of all name-brand gummy bears and n2=622!2 = 622 is less than 10% of store- 
brand gummy bears. Large Counts: All expected counts 130.83, 218.17, 58.86, 
98.14, 50.61, 84.39, 77.97, 130.03, 54.73, and 91.27 are > 5. DO: y2=1.81 

x’ = 1.81. Using df=(5-1)(2-1)=44f =i ed = 4 the P-value 
(0.7698) is greater than 0.25. CONCLUDE: Because the P-value of 
0.7698>a=0.050.7698 > a = 0.05, we fail to reject HO0. There is not 
convincing evidence of a difference in the distribution of color for name-brand and 
store-brand gummy bears. 

11.37 (a) 


Treatment 
Nicotine patch | Drug | Patch plus drug | Placebo | Total 
Outcome Success 
Failure 
Total 


(b) STATE: Hoo: The true proportions of smokers like these who quit for a year 
are the same for the four treatments. Ha/4e: The true proportions of smokers like 
these are not the same for the four treatments. a=0.050 = 0.05 PLAN: Chi-square 
test for homogeneity. Random: 4 groups in a randomized experiment. Large 
Counts: All expected counts 61.75, 61.75, 62, 40.49, 182.25, 182.25, 183, and 
119.51 are > 5. DO:y2=34.937D0 : x” = 34.937. Using df=(2-1)(4-1)=3 

df = (2-—1)(4-1)= 3 the P-value<0.0005(#0)/ -value < 0.0005 (= 0). 
CONCLUDE: Because the P-value of approximately 0<a=0.050 < a = 0.05, we 
reject Hoo. There is convincing evidence that the true proportions of smokers 
like these who are able to quit for a year are not the same for the four treatments. 


2 
11.39 The chi-square test statistic is calculated: y2=7.662+X = 7-662 
2.596 + 2.430 + 0.823 + 10.076 + 3.414 + 5.928 + 2.008 = 34.937. The largest 


component comes from those who had success using both the patch and the drug. 
Far more people fell in this group than would have been expected (87—62=25) 
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(87 — 62 = 25) making it the most effective treatment. The next largest 


component comes from those who had success using just the patch. Far fewer 
people were in this group than would have been expected (40-61.75=—21.75) 


(40 — 61.75 = ~21.75) | making it the least effective treatment. 


11.41 (a) STATE: Ho/40 : There is no association between weekly sauna frequency 
and suffering from sudden cardiac death in the population of middle-aged men 
from eastern Finland. Ha/4e: There is an association between weekly sauna 
frequency and sudden cardiac death in this population. (b) The expected counts are 
49.326, 124.177, 16.497, 551.674, 1388.823, 184.503. (c) x2=6.032X = 6.032, 
Using df=(2-1)(3-1)=24f = (2 — 1) (3 — 1) = 2 the P-value (0.0490) is 
between 0.025 and 0.05. (d) CONCLUDE: Because the P-value of 0.0490<a=0.05 
0.0490 < a = 0.05, we reject HO0. There is convincing evidence that there is 
an association between weekly sauna frequency and suffering from sudden cardiac 
death in the population of middle-aged men from eastern Finland. 

11.43 STATE: HO: There is no association between gender and relative finger 
length in the population of U.S. high school students who completed the survey. 
Ha/fa: There is an association between gender and relative finger length in this 
population. a=0.10@ = 0.10 PLAN: Chi-square test for independence. Random: 
Random sample of U.S. high school students. 10%: n=460n = 460 is <10,< 10, 
of the population of all U.S. high school students who completed the survey. Large 
Counts: All expected counts 77.97, 80.03, 42.44, 43.56, 106.59, and 109.41 are > 
5. DO: The calculator’s y2-TestX ~Lest gives y2=2.065X = 2-065 and p- 
value=0.3561P-value = 0.3561 using df=2df = 2. CONCLUDE: Because the P- 
value of 0.3561>c=0.100.3561 > a = 0.10, we fail to reject HO/0. There is not 
convincing evidence of an association between gender and relative finger length in 
the population of U.S. high school students who completed the survey. 

11.45 STATE: HOO: There is no association between age and opinion about 
loan-free tuition in the population of U.S. adults. Hala: There is an association 
between age and opinion in this population. a=0.05a = 0.05 

PLAN: Chi-square test for independence. Random: Random sample of U.S. adults. 
10%: n=1509<10%n = 1509 < 10% of all U.S. adults. Large Counts: All 
expected counts (68.07, 140.68, 285.33, 361.91, 44.93, 92.86, 188.33, 238.88, 
7.00, 14.46, 29.33, 37.21) are > 5. DO: The calculator’s y2-TestX” — Test gives 
x2=39.755X = 39.755 and p-values0P-value ~ 0 using df=6df = 6. 
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CONCLUDE: Because the P-value of approximately 0<a=0.050 < a = 0.05, we 
reject Hoo. We have convincing evidence that there is an association between 
age and opinion about loan-free tuition in the population of U.S. adults. 


11.47 (a) Chi-square test for homogeneity; the data came from two independent 
random samples. (b) Chi-square test for independence; the data came from a single 
random sample (n=1480n = 1480 U.S. adults), with the individuals classified 
according to two categorical variables (age and whether or not they are 
vegan/vegetarian). 

11.49 (a) Chi-square test for independence; the data came from a single random 
sample (n=4854n = 4854 young adults aged 19 to 25 years), with the individuals 
classified according to two categorical variables (gender and “Where do you live 
now?”). (b) Ho/40 : There is no association between gender and where people live 
in the population of young adults. Ha‘fa : There is an association between gender 
and where people live in this population. (c) Random: Random sample of young 
adults. 10%: n=4854<10%n = 4854 < 10% of all young adults. Large Counts: 
The expected counts in the Minitab output are all > 5. (d) Interpretation: Assuming 
there is no association between gender and where people live in the population of 
young adults, there is a 0.012 probability of getting a random sample of 4854 
young adults with an association as strong as or stronger than the one found in this 
study by chance alone. Conclusion: Because the P-value of 0.012<a=0.05 

0.012 < a = 0.05, we reject HO#0. There is convincing evidence that there is an 
association between gender and where people live in the population of young 
adults. 

11.51 y2=3.147+3.309+0.045+0.048+2.181+2.294+ 

x” = 3.147 + 3.309 + 0.045 + 0.048 + 2.181 + 2.294+ 


0.007+0.007=11.0380-007 + 0.007 = 11.038. the largest component comes from 
males who currently live in their parents’ homes. There were more males living in 


their parents’ homes than would have been expected (986—930.51=55.49) 


(986 — 930.51 = 55.49) The next largest component comes from females who 


currently live in their parents’ homes. There were fewer females living in their 
parents’ homes than would have been expected (923-978.49=-55.49) 
(923 — 978.49 = —55.49) 


11.53 (a) Hoo : There is no difference in the improvement rates for patients like 
these who receive gastric freezing and those who receive the placebo. Halla: 
There is a difference in the improvement rates for patients like these. (b) Assuming 
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there is no difference in the improvement rates between patients like these who 
receive gastric freezing and those who receive the placebo, there is a 0.570 
probability of observing a difference in improvement rates as large as or larger 
than the difference observed in the study by chance alone. CONCLUDE: Because 
the P-value of 0.570>a=0.050.570 > a = 0.05, we fail to reject HO/0 . There is 
not convincing evidence of a difference in the improvement rates for patients like 
these who receive gastric freezing and those who receive the placebo. (c) The P- 
value for this test is identical to the P-value for the test in part (a). Also, z2= 


(-0.57)2=0.3249%y2=0.3222 = ( —0.57)" = 0.3249 = x? = 0.322, 

11,55-e 

1157.0 

11.59 ¢ 

11.61 (a) One-sample t interval for p (b) Two-sample z test for the difference 
between two proportions 


11.63 (a) This was an experiment because a treatment (type of rating scale) was 
deliberately imposed on the students who took part in the study. (b) Several of the 
expected counts are less than 5. 


Answers to Chapter 11 Review Exercises 

R11.1 STATE: HO: The proposed 1:2:1 genetic model is correct. 

Ha!fa: The proposed 1:2:1 genetic model is not correct. a=0.01a = 0.01 

PLAN: Chi-square test for goodness of fit. Random: Random sample of 84 pairs of 
yellow-green parent plants. 10%: n=84<10%n = 84 < 10% of all yellow-green 
parent plants. Large Counts: The expected counts 21, 42, and 21 are all > 5. 
DO:x2=6.476D0 : x” = 6.476 Using df=3-1=20f = 3 - 1 = 2) the P-value 
(0.0392) is between 0.025 and 0.05. CONCLUDE: Because the P-value of 
0.0392>a=0.010.0392 > a = 0.01, we fail to reject HOM0. We do not have 
convincing evidence that the proposed 1:2:1 genetic model is not correct. 

R11.2 The result is not valid because several of the expected counts are less than 5. 
The expected counts are 12.5, 0.5, 4.5, 1.5, 5, 12.5, 0.5, 4.5, 1.5, and 5. 

R11.3 (a) Chi-square test for homogeneity. The data came from two independent 
random samples. (b) The value 252 is the expected count for the “2009/None” cell. 
It is calculated (366)(14,557)/21,177. The value 14.113 is the component of the 
chi-square test statistic that comes from the cell 2009/None. It is calculated 
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(192—251.587)? 14.113 
(192-251.587)2251.587=14.113 251.587 ~ *"". (©) The three cells that 


contribute most to the chi-square test statistic are: 2015/None, which had more 
households with no TV than expected (174-114=60)(174 — 114 = 60) : 
2015/One, which had more households with one TV than expected 
(1680-1494=186)(1680 — 1494 = 186). 44 2015/Five or more, which had fewer 


households with five or more TVs than expected (392-511=-119) 
(392 — 511 = —119) 


R11.4 (a) 


Treatment 
Stress management | Exercise | Usual care | Total 


Cardiac event? 


(b) The success rate was highest for stress management (30/33=0.909) 

(30/33 = 0.909) | followed by exercise (27/34=0.794)(27/34 = 0.794) and usual 
care (28/40=0.70)(28/40 = 0.70) (c) STATE: Ho/Z0: The true success rates for 
patients like these are the same for all three treatments. Hata: The true success 
rates for patients like these are not the same. a=0.05@ = 0.05 

PLAN: Chi-square test for homogeneity. Random: 3 groups in a randomized 
experiment. Large Counts: The expected counts (6.79, 6.99, 8.22, 26.21, 27.01, 
31.78) are all > 5. DO:y2=4,84D0 : x” = 4.84. Using df=(2-1)(3-1)=2 

df = (2—1)(3—1) =2 the p-value (0.0889) is between 0.05 and 0.10. 
CONCLUDE: Because the P-value of 0.0889>a0=0.050.0889 > a = 0.05, we fail 
to reject Hoo. We do not have convincing evidence that the true success rates for 
patients like these are not the same for all three treatments. 

R11.5 STATE: HO/0 : There is no association between gender and goals for 4th, 
oth, and 6th grade students in Michigan. Ha/a : There is an association between 
gender and goals for these students. a=0.05a@ = 0.05 


PLAN: Chi-square test for independence. Random: Random sample of students. 
10%: n=478<10%N = 478 < 10% of all 4th, Sth, and 6th grade students in 
Michigan. Large Counts: All of the expected counts 129.7, 117.3, 74.04, 66.96, 
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47.26, and 42.74 > 5. DO: The calculator’s y2-TestX ~Test gives y2=21.455 

x” = 21.455 and P-values0.00002P-value ~ 0.00002 using df=2df = 2. 
CONCLUDE: Because the P-value of 0.00002<a=0.050.00002 < a = 0.05, we 
reject Hoo. There is convincing evidence that there is an association between 
gender and goals for 4th, 5th, and 6th grade students in Michigan. 


Answers to Chapter 11 AP® Practice Test 

Tiiic 

Tite 

Tliga 

T11.4¢c 

THSd 

T11.6d 

T11.7b 

T11.8a 

T11.9d 

TIiLIod 

T11.11 STATE: HOO : The distribution of gas types among customers at these 
service stations is the same as the distributor’s claim. Ha/4a: The distribution at 
these service stations is not the same. a=0.05a@ = 0.05 

PLAN: Chi-square test for goodness of fit. Random: Random sample of drivers. 
10%: n=400<10%n = 400 < 10% of all customers at this distributor’s service 
stations. Large Counts: The expected counts 240, 80, and 80 = 5. DO:y2=13.15 
DO: x’ = 13.15. Using df=3-1=2df = 3 — 1 = 2) the P-value (0.0014) is 
between 0.001 and 0.0025. CONCLUDE: Because the P-value of 0.0014<a=0.05 
0.0014 < a = 0.05, we reject HOMO. There is convincing evidence that the 
distribution of gas type among drivers at these service stations is not the same as 
the distributor claims. 

T11.12 (a) Random assignment was used to create three roughly equivalent groups 
at the beginning of the study. (b) Ho/40: The true distributions of subsequent arrest 
status are the same for all three police responses for subjects like these. Ha/ta: The 
true distributions of subsequent arrest status are not the same for all three police 
responses for subjects like these. (c) If the true distributions of subsequent arrest 
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status are the same for all three police responses for subjects like these, there is a 
0.0796 probability of getting differences between the three distributions as large as 
or larger than the ones observed by chance alone. (d) Because the P-value of 
0.0796 is larger than o=0.050 = 0.05, we fail to reject HO/#0. There is not 
convincing evidence that the true distributions of subsequent arrest status are not 
the same for all three police responses for subjects like these. 


T11.13 STATE: HO/0 : There is no association between smoking status and 
educational level among French men aged 20 to 60 years. 

Ha/4a: There is an association between smoking status and educational level 
among these men. a=0.05@ = 0.05 PLAN: Chi-square test for independence. 
Random: Random sample of French men. 10%: n=459<10%" = 459 < 10% of 
all French men aged 20 to 60 years. Large Counts: The expected counts (59.48, 
44.21, 42.31, 50.93, 37.85, 36.22, 42.37, 31.49, 30.14, 34.22, 25.44, 24.34) are all 
> 5, DO: The calculator’s y2-TestX’-Lest gives y2=13.305X° = 13.305 and p- 
value=0.0384P-value = 0.0384 using df=6df = 6. CONCLUDE: Because the P- 
value of 0.0384<a=0.050.0384 < a = 0.05, we reject HO0. There is convincing 
evidence of an association between smoking status and educational level among 
French men aged 20 to 60 years. 


Chapter 12 

Section 12.1 

Answers to Check Your Understanding 

page 781: STATE: B1=the slope?1 = the slope of the true regression line relating 
y=scoreY = SCOT€ tg x=row£ = Ow for students like these. PLAN: t interval for 
the slope. Linear: The scatterplot shows a linear pattern, and the residual plot 
shows no leftover curved patterns. Independent: Knowing one student’s score 
shouldn’t help predict the score of another student. Normal: There is no strong 
skewness or outliers in the dotplot of the residuals. Equal SD: The amount of 
scatter of points around the residual =O= 9 line varies somewhat, but it is still 
plausible that o7 is the same for all x-values and that the differences are due to 
sampling variability. Random: Students were randomly assigned a seat location 
within the classroom. DO: With df=30-2=28, t*=2.048 

df = 30 — 2 = 28, f° = 2.048 _1.117+2.048(0.947)=(—3.056,0.822) 

—1.117 + 2.048(0.947) = ( —3.056, OSeel Tochs The calculator gives (-3.057, 
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0.823)( 3-057, 0.823) using af=28df = 28. CONCLUDE: We are 95% confident 
that the interval from —3.056~3-056 to 0.822 captures the slope of the true 
regression line relating y=scoreY ~ S©°T© to x=row# = row for students like 
these. 

page 786: 1. STATE: We want to test HO: 81=0/40 : 41 = 9, Ha: B1<0 

H, : 8; <0 where B1= the slope?1 = the slope of the true regression line 
relating y=scoreY ~ SCOT to x=row® = row for students like these using a=0.05 
a = 0.05, PLAN: t test for the slope. The conditions for regression inference are 
met. DO: According to the output, the standardized test statistic is t=-1.18 

t = —1.18. Because the computer output includes a two-sided P-value and the 
sample slope is consistent with Ha,P-value=0.248/2=0.124 

H,, P-value = 0.248/2 = 0.124 CONCLUDE: Because the P-value of 
0.124>a=0.050.124 > a = 0.05, we fail to reject HO/0. There is not convincing 
evidence of a negative linear relationship between row and score for students like 
these. 

2. Type I error: The teacher finds convincing evidence of a negative linear 
relationship between row and score, when the true slope is 0. Type II error: The 
teacher does not find convincing evidence of a negative linear relationship between 
row and score, when the true slope is <0< 0. Because we failed to reject the null 
hypothesis, it is possible that we could have made a Type II error. 


Answers to Odd-Numbered Section 12.1 Exercises 


12.1 (a) py=105+4.2(15)=168H4y = 105 + 4.2 (15) = 168 oy (hy Because the 
conditions were met, the distribution of heights for 15-year-old students follows a 
Normal distribution with a mean of 168 cm and a standard deviation of 7 cm. Let 


X=the height* = the height of 15-year-old students at this school. We want to 


_ 180-168 _ 
find P(X>180)P (X > 180) (jy 2=180-1687=1.717 7s the 


proportion of z-scores above 1.71 is 0.0436. (ii) normalcdf (lower: 180, upper: 
1000, mean: 168, SD: 7) =0.0432= 0.0432, About 4.32% of 15-year-old students 
are taller than 180 cm. (c) Probably not; the slope of the sample regression line 


would almost certainly differ from 4.2 due to sampling variability. We would, 
however, expect the slope of the sample regression line to be close to 4.2. 


12.3 The Equal SD condition is not met because the standard deviation of the 
residuals clearly increases as the laboratory measurement (x) increases. 
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12.5 Linear: There is no leftover curved pattern in the residual plot, indicating that 
a linear model is appropriate. Independent: Knowing the BAC for one subject 
should not help us predict the BAC for another subject. Normal: The histogram of 
the residuals shows no strong skewness or outliers. Equal SD: The residual plot 
shows a similar amount of scatter about the residual=Oresidual = 0 line for each 
x=number® = number of beers. Random: The students were randomly assigned a 
certain number of cans of beer to drink. 


12.7 (a) bO=-0.0127010 = —9.012701 If a student drinks 0 beers, the BAC will 
be -0.012701~9-012701 | on average. Of course, with 0 beers, BAC should be 0. 
(b) b1=0.017964"1 = 0.017964. for each increase of 1 beer consumed, the 
average BAC increases by about 0.017964. (c) s=0.02048 = 0.0204; the actual 
BAC amounts typically vary by about 0.0204 from the amounts predicted with the 


least-squares regression line using x=number® = number of beers. (d) 


SEb1=0.00249Eh = 0.0024. if we repeated the random assignment many times, 


the slope of the sample regression line would typically vary by about 0.0024 from 
the slope of the true regression line for predicting BAC from number of beers. 


12.9 (a) df=14df = 14, t=2.977t* = 2.977: (0.011, 0.025)(9-011, 0.025) q) 
We are 99% confident that the interval from 0.011 to 0.025 captures the slope of 
the true regression line relating y=-BACY ~ BAC to x=numbert = number of 
beers consumed. (c) If we repeated the experiment many times and computed a 
confidence interval for the slope each time, about 99% of the resulting intervals 
would contain the slope of the true regression line relating y=-BACY ~ BAC to 
x=number£ = number of beers consumed. 


12.11 STATE: B1=the slope?1 = the slope of the true regression line relating 
y=amount¥ = amount expelled to x=tapping? = tapping time. PLAN: t interval 
for the slope. Linear: The scatterplot shows a linear pattern. Also, the residual plot 
shows no leftover curved patterns. Independent: Knowing the amount expelled by 
one can does not help us predict the amount expelled by another can. Normal: 
There is no strong skewness or outliers in the histogram of the residuals. Equal SD: 
The amount of scatter of points around the residual=Oresidual = 0 line appears to 
be about the same at all x values. Random: Cans were randomly assigned a tapping 
time. DO: df=38df = 38, using df=30df = 30, t*=2.042t" = 2.042; (-2.9962, 
~2,2738)(— 2-9962, —2.2738) CONCLUDE: We are 95% confident that the 
interval from —2.9962~2-9962 to -2.2738~2-2738 captures the slope of the true 


regression line relating y=amountY = 2mMount expelled to x=tapping® = tapping 
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time. 


12,13 STATE: g1=P 1 = the slope of the population regression line relating 
y=number¥Y = number of clusters of beetle larvae to x=number® = number of 
stumps. PLAN: t interval for the slope. The conditions are met. DO: df=21df = 21 
, t=2.831t" = 2.831, (8.678,15.11)(8-678, 15-11) CONCLUDE: We are 99% 
confident that the interval from 8.678 to 15.11 captures the slope of the population 
regression line relating y=number¥Y = Number of clusters of beetle larvae to 
x=number® = number of stumps. 

12.15 STATE: Ho: 81=0/0 : 81 = 9, Ha: B1<0He : P1 < 0, where B1=1 = the 
slope of the true regression line relating y=cornY ~ ©°!™ yield to x=weeds 

x = weeds per meter; a=0.05a@ = 0.05. PLAN: t test for slope. Linear: There is 
no leftover curved pattern in the residual plot. Independent: Knowing the yield for 
one meter does not help us predict the yield for another meter. Normal: The dotplot 
of the residuals shows no strong skewness or outliers. Equal SD: The residual plot 
shows a similar amount of scatter about the residual=Oresidual = 0 line for all x 
values. Random: The plots were randomly assigned how many weeds were 
allowed to grow. DO: t=-1.92t = —1-92; P-value=0.075/2=0.0375 

P-value = 0.075/2 = 0.0375 CONCLUDE: Because the P-value of 
0.0375<a=0.050.0375 < a = 0.05, we reject HOM. There is convincing 
evidence that the slope of the true regression line relating corn yield to weeds per 
meter is negative. In other words, we have convincing evidence that having more 
weeds reduces corn yield. 

12.17 STATE: HO: 1-0/0 : 61 = 9, Ha: g1<0Me : 1 < 0 where B1=°1 = the 
slope of the population regression line relating y=heart¥ = heart qisease death 
rate to x=wine® = wine consumption in the population of countries; a=0.05 

a = 0.05, PLAN: t test for the slope. Linear: There is no leftover curved pattern in 
the residual plot. Independent: The sample size n=19<10%" = 19 < 10% of all 
countries. Normal: The histogram of residuals shows no strong skewness or 
outliers. Equal SD: The residual plot shows that the standard deviation of the death 
rates might be a little smaller for large values of wine consumption x, but it is hard 
to tell with so few data values. Random: The data come from a random sample. 
Because the Equal SD condition is questionable, we will proceed with caution. 
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Do: Using tech, t=-6.46¢ = —6.46 qf=17df = 17, and P-value ~~ 0. 
CONCLUDE: Because the P-value of approximately 0<a=0.050 < a = 0.05, we 
reject Hoo. There is convincing evidence of a negative linear relationship 
between wine consumption and heart disease death rate in the population of 
countries. 

12.19 STATE: Ho: 81=0/0 : 1 = 9, Ha: B1<0He : 61 < 0, where B1=the slope 
5, = the slope of the true regression line relating y=¥ = performance on the 10- 
question math test to x=® = volume of the music for students like these; a=0.05 
a = 0.05, PLAN: t test for the slope. The conditions are met. DO: t=-4.163 

t = —4.163, p-value=0.0003/2=0.00015/-value = 0.0003 /2 = 0.00015, 
CONCLUDE: Because the P-value of 0.00015<a=0.050.00015 < a = 0.05 we 
reject Hoo. There is convincing evidence of a negative linear relationship 
relating y=performanceY = performance on the 10-question math test to x=£ = 
volume of the music for students like these. 

12.21 (a) With df=19df = 19, t*=2.093t" = 2.093; (9016.4, 14,244.8) 

(9016.4, 14, 244.8) (b) Because the automotive group claims that people drive 
15,000 miles per year, it says that for every increase of 1 year, the mileage would 
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increase by 15,000 miles. (c) t=11,630.6-15,0001249=—2.70 


a 11,630.6—15,000 _2.70 
= 1249 ~  “""" df=19df = 19, the P-value is between 2(0.005)=0.01 


2 (0.005) = 0.01 ang 2(0.01)=0.022 (0-01) = 0.02. Tech-P-value=0.0142 
P-value = 0.0142, Because the P-value of 0.0142<a=0.050.0142 < a = 0.05, 
we reject Hoo. We have convincing evidence that the slope of the population 
regression line relating miles to years is not equal to 15,000. (d) Yes; because the 
interval in part (a) does not include the value 15,000, the interval also provides 
convincing evidence that the slope of the population regression line relating miles 
to years is not equal to 15,000. 

12,23 

2250 

12.27b 

12.29 (a) This was an experiment because the two treatments (say the color of the 
printed word and read the word) were deliberately assigned to the students. (b) He 
used a randomized block design where each student was a block. He did this to 
help account for the different abilities of students to read the words or to say the 
color they were printed in. (c) The random assignment was used to help average 
out the effects of the order in which people did the two treatments. For example, if 
every subject said the color of the printed word first and was frustrated by this task, 
the times for the second treatment (reading the word) might be worse. Then we 
wouldn’t know the reason the times were longer for the second treatment—because 
of frustration or because the second method actually takes longer. 


12.31 (a) There appears to be a moderately strong, positive linear association 
between the length of time to read the word and length of time to identify the color. 
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(b) y\=4.887+1.1321x¥ = 4.887 + 1.13212 Where yA=the predicted time 
y = the predicted time to jdentify the color and x=the amount of time 


x = the amount of time to read the words. (c) y\=4.887+1.1321(9)=15.076 


y = 4.887 + 1.1321 (9) = 15.076 seconds, so the residual is y-y\=13-15.076= 


-2.076Y — ¥ = 138 — 15.076 = —2.076 seconds. The actual time to complete the 
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color task was 2.076 seconds less than the time predicted by the regression line 
with x=92 = 9 seconds to complete the word task. (d) If the true slope of the 
regression line relating time to identify the colors and time to read the words is 0 
and this experiment were repeated many times, there is a 0.0215 probability of 
getting an observed slope of 1.1321 or larger by chance alone. 


Section 12.2 

Answers to Check Your Understanding 

page 810: 1. Option 1: premium/=—343+8.63(58)=$157.54 

premium = —343 + 8.63 (58) = $157.54 

Option 2: In(premium)/=—12.98+4.416(In 58)=4.9509 — y=e4.9509=$141.30 
In (premium) = —12.98 + 4.416 (In58) = 4.9509 > g = e+ = $141.30 


Option 3: In(premium)\=—0.063+0.0859(58)=4.9192 — y\=e4.9192=$136.89 
In (premium) = —0.063 + 0.0859 (58) = 4.9192 > 9 = e**!®? = $136.89 


2. The exponential model (Option 3) best describes the relationship because the 
scatterplot showing In(premium) versus age was the most linear and this model had 
the most randomly scattered residual plot. 


Answers to Odd-Numbered Section 12.2 Exercises 


12.33 (a) y\=-0.08594+0.21xY = —9.08594 + 0.21% where y is the period and 
x is the length. (b) y\=-0.08594+0.2180=1.792 
g = —0.08594 + 0.21,/80 = 1.792 seconds 


12.35 (a) yA2=-0.15465+0.0428xY" = —0.15465 + 0.04282 where y is the 
period and x is the length. (b) yA2=—0.15465+0.0428(80)=3.269 

y” = —0.15465 + 0.0428 (80) = 3.269 5 ya=3,269=1.808 

= V3.269 = 1.808 seconds. 

12.37 (a) The scatterplot of log(period) versus log(length) is roughly linear. Also, 
the residual plot shows no obvious 

leftover curved patterns. (b) log y=—0.73675+0.51701 log(x) 

log y = —0.73675 + 0.51701 log (x) 


where y is the period and x is the length. (c) log y=—-0.73675+0.51701 


2082 


$0 y\=100.24717=1.77 seconds = 10°47" = 1.77 seconds, 

12.39 log y=1.01+0.72 log(127)=2.525!08y = 1.01 + 0.72 log (127) = 2.525 
y\=102.525=334.97 grams = 10°°*° = 334.97 grams js the predicted brain 
weight of Bigfoot. 

12.41 (a) In distance=—3.1099+2.0361 : In(speed) 

Indistance = —3.1099 + 2.0361 - In (speed) 


(b) In distance=—3.1099+2.0361 : In(48)=4.772 
Indistance = —3.1099 + 2.0361 - In (48) = 4.772 so distance=e4.772=118.16 


distance = e* "7 = 118.16 feet. Residual=110.33-118.16=-7.83 

Residual = 110.33 — 118.16 = —7.83 feet, Mr. Waggoner traveled a distance of 
7.83 feet less than the distance predicted by the power model with x=48r = 48 
mph. 

12.43 (a) Because the scatterplot of In(count) versus time is fairly linear, an 
exponential model would be reasonable. (b) In yA=5.97316-0.218425x 

Iny = 5.97316 — 0.218425 where y is the count of surviving bacteria (in 
hundreds) and x is time in minutes. (c) In yA=5.97316-0.218425(17)=2.26 

In y = 5.97316 — 0.218425 (17) = 2.26 66 y=e.26=9,580 = €77° = 9.58 oy 
958 bacteria. 

12.45 (a) In population\=4.7885+0.0059(years) 

In population = 4.7885 + 0.0059 (years) 


(b) In population’=4.7885+0.0059(320)=6.6765 
In population = 4.7885 + 0.0059 (320) = 6.6765 ., 


population\=e(6.6765)=793,54Population = e765) — 793.54 minion people. 
12.47 (a) The exponential model because the scatterplot of In(percent) versus 
distance is roughly linear. A power model would not work as well because the 
scatterplot of In(percent) versus In(distance) is clearly curved. (b) In 
y\=4,6649-0.1091xIny = 4.6649 — 0.10912, where y=percentY = Percent 
made and x=distancet = distance (feet). (c) In y\=4.6649-0.1091 

In y = 4.6649 — 0.1091 (1)=9 3738(21) = 2.3738. 65 ya=e2,3738=10.738% 
y= e975 = 10.738 %, (d) Because the last few residuals (those farthest to the 
right) were all positive, we might expect that our prediction, which was based on 
x=21@ = 21, might also be too small. 
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12.49 (a) 
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There is a strong, positive curved relationship between heart weight and length of 
left ventricle for mammals. 


(b) 


In (weight) 


1 2 3 
In (length) 


Because the relationship between In(weight) and In(length) is roughly linear, heart 
weight and length seem to follow a power model. An exponential model would not 
be appropriate because the relationship between In(weight) and length is clearly 


curved. (c) In yA$=—0.314+3.1387 In xlny = —0.314 + 3.1387 Ina Where y is the 
weight of the heart and x is the length of the cavity of the left ventricle. (d) In y“= 


~0.314+3.1387 In(6.8)=5.7039 y = —0.314 + 3.1387 In (6.8) = 9.703 so 
y\=e5.703=299.779 = €'"* = 299.77 grams, 


12.51 ¢ 
(Psy ge 
z= © = -167 
12.55 (a) (i) z=3-4.50.9=-1.67 0.9 and z=6-4.50.9=1.67 
z= © =167 
0.9 ; the proportion of z-scores between —1.677 1.67 and 1.67 is 


0.9050. (ii) normalcdf (lower: 3, upper: 6, mean: 4.5, SD: 0.9)=0.9044 


0.9) = 0.9044 There is a 0.9044 probability that Marcela’s shower lasts between 


ee 0.67 = 2 
3 and 6 minutes. (b) (i) Solving —0.67=Q1-4.50.9 0.9 gives 
= 0.67 = 3 
Q1=3.897@1 = 3-897 minutes. Solving 0.67=Q3-4.50.9 0.9 gives 


Q3=5.103%3 = 5-103 minutes. (ii) Tech: invNorm(area: 0.25, mean 4.5, SD: 0.9) 
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gives Q1=3.89391 = 3-893 minutes and invNorm(area: 0.75, mean: 4.5, SD: 0.9) 
gives Q3=5.10793 = 5.107 minutes. Thus, an outlier is any value above 
5,107+1.515.107-3.8932=6.9289-107 + 1.515.107 — 3.8932 = 6.928 | Because 
7>6.9287 > 6.928, a shower of 7 minutes would be considered an outlier for 
Marcela. (c) (i) P(X>7)=P(z>7—4.50.9)=0.0027 


P(X>7)=P (z Ses a) = 0.0027 


sas ; (ii) normalcdf(lower: 7, upper: 1000, 


mean: 4.5, SD: 0.9)=0.0027= 0.0027 Let Y=the number of days 

Y = the number of days that Marcela’s shower is 7 minutes or higher. Y is a 
binomial random variable with n=10n = 10 and p=0.0027P = 9-0027, pry>2)=1 
P(Y¥>2) = 1 _binomcdt(trials: 10, p: 0.0027, x value: 1) =0.0003= 0.0003, 
There is a 0.0003 probability that Marcela’s shower time would be 7 minutes or 
more on at least 2 of the 10 days. (d) Because the distribution of X is Normal, the 
sampling distribution of x @ is also Normal; px =p=4.54z = = 4-5. Because 
10<10,10 < 10, of days that Marcela showers, the standard deviation is 

-= = = 0.285 


- o _ ates 
ox =0.910=0.285 A We want P(x >5)P ( > 5). (iy 


=1.7 


72524 50 9051.750" 5. P(z>1.75)=0.0401? (2 > 1.75) = 0.0401. 


(ii) normalcdf(lower: 5, upper: 1000, mean: 4.5, SD: 0.285) =0.03970.0397. There 
is a 0.0397 probability that the mean length of Marcela’s showers on these 10 days 
exceeds 5 minutes. 


12.57 (a) STATE: p=? = the true proportion of all AP® Statistics teachers 
attending this workshop who would say they have a tattoo. PLAN: One-sample z 
interval for p. Random: Random sample of teachers at the institute. 10%: n=98<10 
nm = 98 < 10% of the population of teachers at this workshop. Large Counts: 23 
and 75 are > 10. DO: (0.151, 0.319). CONCLUDE: We are 95% confident that the 
interval from 0.151 to 0.319 captures p=the true? ~ the true proportion of AP® 
Statistics teachers at this workshop who would say they have a tattoo. (b) No; 
because the value 0.29 is included in the interval of plausible values, we do not 
have convincing evidence that the true proportion of teachers at the workshop who 
would say they have a tattoo is different from 0.29. (c) If we had two more 


successes, p\=0.25P = 0.25 and the interval would be (0.165, 0.335) 


(0.165, 0.335) 1¢ we had two more failures, p\=0.23P = 0.23 and the interval 


would be (0.148, 0.312). In both cases, the value 0.29 is included in the interval, so 
even if both of these teachers had responded, the answer to part (b) would not 


2085 


change. 


Answers to Chapter 12 Review Exercises 

R12.1 (a) The scatterplot reveals that there is a negative, moderately strong, linear 
relationship between height (inches) and number of steps. (b) y4=113.57-0.9211x 
y = 113.57 — 0.92112 , where yay is the predicted number of steps and x is the 
height of the student (in inches). 


(c) (i) b0=113.5700 = 113.57, if a student has a height of 0 inches, the number of 
steps to walk the length of the school hallway is 113.57 steps, on average. Of 
course, a student with a height of 0 will not be able to walk anywhere, so this is 
unreasonable. (ii) b1=-0.921151 = 0.9211. for each increase of 1 inch in height, 
the average number of steps to walk the length of the school hallway decreases by 
0.9211 steps. (iii) s=3.50429s = 3.50429; the actual number of steps needed to 
walk the length of the school hallway typically varies by about 3.50429 from the 
amount predicted with the least-squares regression line using x=height” ~ height 
(iv) SEb1=0.19389 Hh = 90-1938 ; if we repeated the random selection many 
times, the slope of the sample regression line would typically vary by about 0.1938 
from the slope of the population regression line for predicting number of steps 


from height. 


R12.2 Linear: The residual plot shows no leftover curved pattern. Independent: 
Knowing the number of steps needed for one student should not help us predict the 
number of steps needed for another student. Also, the sample size n=36<10% 

n = 36 < 10% of all students at the high school. Normal: The histogram of the 
residuals does not show strong skewness or outliers. Equal SD: The residual plot 
shows roughly equal amounts of scatter for all x values. Random: The data come 
from a random sample. 

R12.3 STATE: Ho: B1=0/40 : 61 = 9 Ha: B1<0Me : 81 < 9, where B1=the slope 
5; = the slope of the population regression line relating y=numberY = number 
of steps to x=height® = height ysing a=0,05a = 0.05, PLAN: ¢ test for the slope. 
The conditions are met. DO:t=-4.75DO : t = —4.75, with df=34df = 34, using 
df=30df = 30, the P-value (0) is less than 0.0005. CONCLUDE: Because the P- 
value of approximately 0<a=0.050 < a = 0.05, we reject HOMO . There is 
convincing evidence that the slope of the true regression line relating height to 
number of steps is negative. In other words, we have convincing evidence that 
taller students require fewer steps to walk the length of the hallway. 
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R12.4 STATE: g1=9 1 = the slope of the population regression line relating 
y=number¥Y = number of steps to x=height® = height, py an: ¢ interval for the 
slope. The conditions are met. DO: With df=34df = 34, using df=30df = 30, 
te=2.042t* = 2.042; (-1.317, -0.525)(— 1-317, — 0.525) CONCLUDE: We are 
95% confident that the interval from -1.317~1-317 to -0.525~-9-525 captures the 
slope of the population regression line relating y=number¥ = Bumber of steps to 
x=height® ~ height The interval provides more information because it gives an 
interval of plausible values for the population slope. 

R12.5 (a) The transformation did achieve linearity because there is no leftover 
curved pattern in the residual plot. (b) y\=—0.000595+0.3(1x2) 


j — —0.000595 + 0.3 (+) a 
x? / where y=intensityY = Intensity and x=distance 


x = distance (c) yA=—0.000595+0.3(1( 2.1)2)=0.0674 


j = —0.000595 + 0.3 ( aa ) = 0.0674 
an) candelas 


R12.6 (a) A linear model is not appropriate because the scatterplot clearly shows 
that the relationship between weight and price is curved. (b) A power model is 
more appropriate in this case because the scatterplot showing In(price) versus 
In(weight) is fairly linear, whereas the scatterplot showing In(price) versus weight 
is curved. (c) For the power model, In yA=9.7062+2.2913 In(2)=11.294 


In y = 9.7062 + 2.2913 In (2) = 11.294 ang ya=e1 1.294=80,338.16 
y =e % — 80, 338.16 dollars. For the exponential model, In 


~ _ 11.0291 _ 
y=e11.0291=61,642.08Y — © °°" = 61, 642.08 dollars. Based on my answer to 


part (b), I think the power model will give a better prediction. 


Answers to Chapter 12 AP® Practice Test 
T1Z.1¢ 
T12.2b 
TiZg.od 
T12.4d 
T12.5d 
T12.6d 
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T12.7 e 

T1I2.6.d 

T12.9d 

T12.10¢c 

T12.11 (a) (i) b1=4.832301 = 4.8323. for each increase of 1 milligram in growth 
hormone, the average weight gain increases by 4.8323 ounces. (ii) bO=4.5459 


(z= 0) the 


predicted weight gain is 4.546 ounces, on average. (iii) s=3.1358 = 3.135; the 


bo = 4.5459. if a chicken is given no growth hormone (x=0) 


actual weight gain will typically vary by about 3.135 ounces from the weight gain 


predicted with the least-squares regression line using x=doset = dose of growth 


hormone. (iv) SEb1=1.0164> 2h: = 1.0164 ; if we repeated the random assignment 


many times, the slope of the sample regression line would typically vary by about 
1.0164 from the slope of the true regression line for predicting weight gain from 
dose of growth hormone. 

(b) STATE: HO: p1=0/0 >i = 0 Ha: p1z04a : Pi # 0. where B1=the slope 

5; = the slope of the true regression line relating y=weight¥ = Weight gain to 
x=doset = dose of growth hormone; a=0.05@ = 0.05. PLAN: t test for the slope. 
The conditions are met. DO: t=4.75t = 4.75 and P-value=0.0004 

P-value = 0.0004, CONCLUDE: Because the P-value of 0.0004<a=0.05 

0.0004 < a = 0.05, we reject HO/0. There is convincing evidence of a linear 
relationship between the dose of growth hormone and weight gain for chickens like 
these. (c) STATE: B1=the slope? 1 = the slope of the true regression line relating 
y=weightY = Weight gain to x=doset = dose of growth hormone. PLAN: t 
interval for the slope. The conditions are met. DO: df=13df = 13, 

(2.6369, 7.0277)(2-6369, 7.0277) CONCLUDE: We are 95% confident that the 
interval from 2.6369 to 7.0277 captures the slope of the true regression line 
relating y=weightY = weight gain to x=doset = dose of growth hormone for 
chickens like these. 


T12.12 (a) There is clear curvature evident in both the scatterplot and the residual 
“ fan\3 _ 
plot. (b) Option 1: yA=2.078+0.0042597(30)3=9 = 2-078 + 0.0042597(30)" = 


117.09117.09 board feet; Option 2: In y\=1.2319+0.113417(30)= 


Iny = 1.2319 + 0.113417 (30) = 4 634414.63441 and yA=e4.63441=102.967 


j = e* "4" = 102.967 hoard feet, (c) The residual plot for Option 1 is much 


more scattered, while the residual plot for Option 2 is curved, meaning that the 
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model relating the amount of usable lumber to cube of the diameter is more 
appropriate. Thus, the prediction of 117.09 board feet seems more reliable. 


Answers to Cumulative AP® Practice Test 4 
AP4.1e 
AP4.2 c 
AP4.3 d 
AP4.4 a 
AP4.5b 
AP4.6e 
AP4.7 d 
AP4.8 a 
AP4.9e 
AP4.10 a 
AP4.11b 
AP4.12 d 
AP4.13 e 
AP4.14d 
AP4.15b 
AP4.16a 
AP4.17 e 
AP4.18 c 
AP4.19 a 
AP4.20b 
AP4.21e 
AP4.22 c 
AP4.23 b 
AP4.24 c 
AP4.25 d 
AP4.26e 
AP4.27 d 
AP4.28 c 
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AP4.29 b 

AP4.30 b 

AP4.31 d 

AP4.32 a 

AP4.33 e 

AP4.34 b 

AP4.35 a 

AP4.36 b 

AP4.37 d 

AP4.38 e 

AP4.39 d 

AP4.40 d 

AP4.41 STATE: HO: p1-2=0, Ha: 11-1120 
Ho : 1 — Ho = 0, Ha : i — He FO where 1=H1 = true mean difference in 
electrical potential for diabetic mice and »2=2 ~ true mean difference in 
electrical potential for normal mice; a=0.05@ = 0.05, PLAN: Two-sample t test 
for 11-2M1 ~ #2, Random: Independent random samples. 10%: n1=24<10 

my = 24 < 100% of all diabetic mice and n2=18<10"2 = 18 < 109% of all normal 
mice. Normal/Large Sample: The graphs of the data reveal no outliers or strong 
skewness so a two-sample t procedure is appropriate. DO:t=2.55DO : t = 2.55, 
Using df=38.46df = 38.46, P-value=0.0149P-value = 0.0149, CONCLUDE: 
Because the P-value of 0.0149<a=0.050.0149 < a = 0.05, we reject HOM. 
There is convincing evidence that the true mean electric potential for diabetic mice 
differs from that for normal mice. 

AP4.42 (a) HO: p1—p2=0, Ha: p1-p2<0Ho : Pi — p2 = 0, Ha : pi — po < 0, 
where p1=P1 ~ the true proportion of women like the ones in the study who were 
physically active as teens who would suffer a cognitive decline and p2=P?2 ~ the 
true proportion of women like these who were not physically active as teens who 
would suffer a cognitive decline. (b) A two-sample z test for pl—p2P1 ~ P2. (c) 
No; because the participants were mostly white women from only four states, the 
findings may not be generalizable to women in other racial and ethnic groups or 
who live in other states. (d) Two variables are confounded when their effects on 
the response variable (measure of cognitive decline) cannot be distinguished from 
one another. For example, women who were physically active as teens might have 
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also done other things differently as well, such as eating a healthier diet. If 
healthier diets lead to less cognitive decline, we would be unable to determine if it 
was their physically active youth or their healthier diet that slowed their level of 
cognitive decline. 


AP4.43 (a) Because the first question called it a “fat tax,” people may have reacted 
negatively because they believe this is a tax on those who are overweight. The 
second question provides extra information that gets people thinking about the 
obesity problem in the U.S. and the increased health care that could be provided as 
a benefit with the tax money, which might make them respond more positively to 
the proposed tax. The question should be worded in a more straightforward 
manner. (b) This method samples only people at fast-food restaurants. They may 
go to these restaurants because they like the sugary drinks and wouldn’t want to 
pay a tax on their favorite beverages. Thus, it is likely that the proportion of those 
who would oppose such a tax will be overestimated with this method. A random 
sample of all New York State residents should be taken to provide a better estimate 
of the level of support for such a tax. (c) Use a stratified random sampling method 
in which each state is a stratum. Select a random sample from each state to obtain 
results for each state and combine the random samples to obtain an overall estimate 
for the nation as a whole. 
AP4.44 Let W=Mr.W = Mr. Worcester arrives first, L=_Mr.L = Mr. Legacy 
arrives first, C=Dr.C = Dr. Currier arrives first, and S=the coffeeS = the coffee 
is strong. The tree diagram below organizes the given information. 
o® 5 

WwW <0, 0 <¢ 

= 


0 5 
LD ¢ 


os 
Me 0.99 6 

(a) P(s)=0.16? (S) = 9-16. there is a 0.16 probability that the coffee will be 

strong on a randomly selected morning. (b) P(C|S)=0.05/0.16= 

P(C\S) = 0.05/0.16 = 9 3195, Given that the coffee is strong, there is a 0.3125 

probability that it was brewed by Dr. Currier. 

AP4.45 (a) The scatterplot reveals a strong, negative curved relationship between 

seed count and seed weight. (b) Model B is better because the scatterplot shows a 

much more linear pattern and its residual plot shows no leftover curved pattern. 
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The scatterplot for Model A still has a curved pattern and the residual plot 
has a leftover U-shaped pattern. (c) In(weight)\=15.491-—1.5222 In(3700)=2.984 
In (weight) = 15.491 — 1.5222 In (3700) = 2.984 ., 


weight\=e2.984=19.77mg, Weight = e”®* — 19.77mg. 

AP4.46 (a) Let X=X = diameter of a randomly selected lid. Shape: Normal 
distribution. Center: px =44z ~ 4 inches. Variability: Because 25 is less than 
10% of all lids produced that hour, ox =0.004%: ~ 0.004 inches. (b) We want to 


find P(x <3.99 or x >4.01)P (# < 3.99 org > 4.01) 
3.99—4 


(i) 2=3.99-40.004=-2.507 ~ 0.007 ~ —2-°9 ang 7=4.01-40,004=2.50 
_ 401-4 _ 
“0.004 *“; the proportion of z-scores less than -2.50~2-99 or greater 


than 2.50 is 0.0062+0.0062=0.01240-0062 + 0.0062 = 0.0124) (jj) 1—- 
normalcdf(lower: 3.99, upper: 4.01, mean: 4, SD: 0.004) =0.0124= 0.0124, 
Assuming that the machine is working properly, there is a 0.0124 probability that 
the mean diameter of a sample of 25 lids is less than 3.99 inches or greater than 


4.01 inches. (c) We want to find P(4<x-<4.01)P (4 < % < 4.01), 


ys gh y= A01n4 _ 9 Bg 
(i) z=4-40.004=0" (0.004 = and z=4.01-40.004=2.50° 0.004"; the 


proportion of z-scores between 0 and 2.50 is 0.9938—0.5000=0.4938 

0.9938 — 0.5000 = 0.4938 | (ii) normalcdf(lower: 4, upper: 4.01, mean: 4, SD: 
0.004) =0.49380.4938. Assuming that the machine is working properly, there is a 
0.4938 probability that the mean diameter of a sample of 25 lids is between 4.00 
and 4.01 inches. (d) Let Y=Y = the number of samples (out of 5) in which the 
sample mean is between 4.00 and 4.01. The random variable Y has a binomial 
distribution with n=5n = 5 and p=0.4938P = 9.4938. Tech: P(X>4)=1 

P(X 2 4) = 1_binomcdf(trials: 5, p: 0.4938, x value: 3) = 0.17980.1798, 
Assuming that the manufacturing process is working correctly, there is a 0.1798 
probability that in 5 consecutive samples, 4 or 5 of the sample means will be above 
the desired mean of 4.00 but below the upper boundary of 4.01. (e) Because the 
probability found in part (b) is less than the probability found in part (d), getting a 
sample mean below 3.99 or above 4.01 is more convincing evidence that the 
machine should be shut down. This event is much less likely to happen by chance 
when the machine is working correctly. 
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Glossary/Glosario 


English 


1.5xIQR1-5 X IQR yule for outliers 

An observation is called an outlier if it falls more than 
1.5xIQRI-5 x IQR above the third quartile or below the 
first quartile. (p. 64) 


10% condition 
When taking an SRS of size n” from a population of size 


N,V; check that n<0.10N.n < 0.10N. (p. 419) 


68—95-99.7 rule 

(also known as the empirical rule) In the Normal 
distribution with mean p and standard deviation 0,7° (a) 
approximately 68% of the observations fall within o@% of the 
mean 1,» (b) approximately 95% of the observations fall 
within 202¢ of p,/ and (c) approximately 99.7% of the 
observations fall within 303 of p./: (p. 117) 


addition rule for mutually exclusive events 
If A and B are mutually exclusive events, P(A or 
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Espanol 


regla 1.5xla.... 
cuartiles para 
Se le dice valoi 
observacion si 
1.5xlal.5 x la 
cuartiles por er 
cuartil o por de 
cuartil. (pag. 6. 


condicion del 

Cuando se tom 
aleatoria sencil 
de una poblacit 
N, se verifica: 
n<0.10N. (pag. 41° 


regla 68-95-9 
A la que tambi 
“regla empirice 
distribucién no 
ut y desviacid 
(a) aproximada 
las observacior 
de o& de la me 
aproximadame 
observaciones | 
2020 de pM \ 
aproximadame 
las observacior 
de 303 de p.# 


regla de suma 
que se excluye 


alternative hypothesis 


HaHa The claim that we are trying to find evidence for ina 
significance test. (p. 554) 


anonymity 
The names of individuals participating in a study are not 
known even to the director of the study. (p. 279) 


association 

A relationship between two variables in which knowing the 
value of one variable helps predict the value of the other. If 
knowing the value of one variable does not help predict the 
value of the other, there is no association between the 
variables. (p. 19) 


back-to-back stemplot (also called back-to-back stem- 
and-leaf plot) 

Plot used to compare the distribution of a quantitative 
variable for two groups. Each observation in both groups is 
separated into a stem, consisting of all but the final digit, 
and a leaf, the final digit. The stems are arranged in a 
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Si Ay Bsone 
excluyen entre 
B)=P(A)+P(B) 
P(A or B) = 


(pag. 317) 


hipotesis 

Ha‘fa alternat 
proposicion de 
prueba de signi 
estadistica esta 
hallar evidenci. 
favor. (pag. 55: 


anonimato 

Cuando se des« 
nombres de las 
participan en u 
inclusive el dir 
los ignora. (pas 


asociacion 

Relacion entre 
la cual saber el 
variable facilit 
del valor de la 
valor de una ve 
la prediccion d 
otra, entonces | 
asociacion entr 


(pag. 19) 


diagrama de t 
(también se le « 
tallos y hojas c 
utiliza para cor 
distribucién de 
Cuantitativa en 


vertical column with the smallest at the top. The values 
from one group are plotted on the left side of the stem and 
the values from the other group are plotted on the right side 
of the stem. Each leaf is written in the row next to its stem, 
with the leaves arranged in increasing order out from the 
stem. (p. 39) 


bar graph 

Graph used to display the distribution of a categorical 
variable or to compare the sizes of different quantities. The 
horizontal axis of a bar graph identifies the categories or 
quantities being compared. The heights of the bars show the 
frequency or relative frequency for each value of the 
categorical variable. The graph is drawn with blank spaces 
between the bars to separate the items being compared. (p. 
10) 


bias 

The design of a statistical study shows bias if it is very 
likely to underestimate or very likely to overestimate the 
value you want to know. (p. 224) 
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Cada observac: 
ambos grupos : 
tallo, que consi 
digitos salvo el 
hoja, que const 
digito. Los tall: 
en una column. 
cifras mas peql 
valores de un g 
diagraman al le 
tallo y los valo 
se diagraman a 
del tallo. Cada 
el renglon que 
tallo, y las hoje 
orden ascendie 
extendiéndose 
partir del tallo. 


grafico de bar 
Se usa para ilu: 
distribucioén de 
categorizada o 
tamanio de dife 
cantidades. El \ 
grafico de barr. 
categorias o las 
se han de comfy 
de las barras m 
frecuencia o la 
relativa de cad 
variable categé 
dibujar con esr 
entre las barras 
las diversas cat 
desea compara 


sesgo 

E] disefo de ur 
estadistico refli 
existe una alta 
subestimar 0 sc 
valor que se bu 


biased estimator 

A statistic used to estimate a parameter is biased if the mean 
of its sampling distribution is not equal to the true value of 
the parameter being estimated. (p. 448) 


bimodal 
A graph of quantitative data with two clear peaks. (p. 33) 


binomial coefficient 
The number of ways of arranging kk successes among nt 
observations is given by the binomial coefficient (nk)=n!k! 


n n! 
(n-k)! ( 2 MON" for k=0,1,2,... nk = 0,1,2,... 


where n!=n(n-1)(n-2):...°3°2-1 
n! =n(n—1)(n—2)-...-3-2-1..4 pi=1 0! = 1. (p. 
407) 


binomial distribution 

In a binomial setting, suppose we let X=the number of 
successes. = the number of successes. The probability 
distribution of XX is a binomial distribution with 
parameters n’ and p,?> where n” is the number of trials of 
the chance process and p? is the probability of a success on 
each trial. (p. 406) 
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calculador ses 
La estadistica ¢ 
computar un peé 
sesgada si la m 
distribucioén de 
equivale al valk 
parametro que 
computando. (] 


bimodal 
Grafico de datc 
con dos picos | 


(pag. 33) 


coeficiente bin 
La cantidad de 
organizar kk a 
observaciones 
el coeficiente E 
(nk)=n!k!(n-k: 


n — n! 
kb} Mn 


_.,n& = 0,1, 2,. 
n!=n(n-1)(n-2 


distribucién b 
En un entorno” 
supongamos ql 
X=la cantidad 
X = la cantid 
La distribucién 
probabilidad de 
distribucion bil 
parametros n/t 
n? es la cantid 
proceso de prol 
la probabilidad 
cualquiera de |: 
406) 


binomial probability formula 

If XX has the binomial distribution with n” trials and 
probability p? of success on each trial, the possible values 
of XX are 0,1,2,....n.9; 1, 2;-++ 57 Tf kk is any one of 
these values, P(X=k)=(nk)pk(1—-p)n-k. 


P(X =k)= a) p*(1—p)*™. 
7 (p. 408) 


binomial random variable 
The count XX of successes in a binomial setting. The 


possible values of XX are 0,1,2,...,n.9) 1; 2)--+5™ (p, 
406) 


binomial setting 

Arises when we perform several independent trials of the 
same chance process and record the number of times that a 
particular outcome occurs. The four conditions for a 
binomial setting: 


e Binary? The possible outcomes of each trial can be 
classified as “success” or “failure.” 


e Independent? Trials must be independent; that is, 
knowing the result of one trial must not tell us anything 
about the result of any other trial. 


e Number? The number of trials n” of the chance process 
must be fixed in advance. 


e Same probability? There is the same probability p? of 
success on each trial. (p. 404) 
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formula de pr 
binomial 

Si XX tiene la 
binomial con n 
probabilidad p. 
cada ensayo, Ic 
valores de XX 
ie oe: 
cualquiera de e 
P(X=k)=(nk)p! 


pecan) =(") ear 


variable aleati 
La cuenta XX 
entorno binom: 
posibles de X4 
te oe 


entorno binon 
Surge cuando s 
ensayos indepe 
mismo proceso 
y se anota la ce 
que se produce 
dado. Las cuati 
que definen un 
binomial son: 


e ¢Binario? Lc 
posibles de ¢ 
pueden clasi 
“acierto” o “ 


e ¢Independie 
han de ser in 
decir, saber ( 
ensayo no de 
acerca del re 
ensayo. 

e ¢Numero? L 
ensayos n/t | 
probabilidad 
con anticipa: 


block 

Group of experimental units that are known before the 
experiment to be similar in some way that is expected to 
affect the response to the treatments. (p. 257) 


boxplot 

A visual representation of the five-number summary. The 
box spans the quartiles and shows the variability of the 
central half of the distribution. The median is marked within 
the box. Lines extend from the box to the smallest and 
largest observations that are not outliers. Outliers are 
marked with a special symbol such as an asterisk (*). (p. 68) 


categorical variable 
A variable that assigns labels that place each individual into 
a particular group, called a category. (p. 3) 


census 
Study that attempts to collect data from every individual in 
the population. (p. 221) 
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e ;Misma prol 
la misma prc 
lograr un aci 
ensayo. (pag 


bloque 

Grupo de unide 
experimentales 
experimento se 
similares de als 
previsible que . 
respuesta a los 


(pag. 257) 


diagrama de c 
Representaci6r 
resumen de cin 
abarca los cuar 
variabilidad de 
de la distribuci 
Caja se marca |. 
lineas se extier 
caja a las obser 
pequefia y mas 
son valores ati] 
valores atipico: 
un simbolo esp 
asterisco (*). (J 


variable categ 
Variable que a: 
etiqueta a cada 
colocarlo dentr 
particular, conc 
categoria. (pag 


censo 
Un estudio en ¢ 
recoger datos a 


central limit theorem (CLT) 

In an SRS of size n? from any population with mean p 
and finite standard deviation 0,7» when n’ is large, the 
sampling distribution of the sample mean x @ is 
approximately Normal. (p. 476) 


Chebyshev’s inequality 
In any distribution, the proportion of observations falling 


within kk standard deviations of the mean is at least 1—1k2. 


1 
k ” (p. 119) 


chi-square distribution 

A distribution that is defined by a density curve that takes 
only non-negative values and is skewed to the right. A 
particular chi-square distribution is specified by giving its 
degrees of freedom. (p. 715) 


chi-square test statistic 

Measure of how far the observed counts are from the 
expected counts. The formula is 

¥, (Observed Count-Expected Count)2Expected Count 


where the sum is over all possible values of the categorical 
variable or all cells in the two-way table. (p. 712) 


individuo en la 
221) 


teorema del li 
Traza una mue 
sencilla de tam 
de una poblacit 
uM y una desvi 
finita de o.0- E 
limite central n 
cuando n’t es ¢ 
distribuci6n de 
media de la mu 
aproximadame 
476) 


desigualdad d 
En cualquier di 
proporcion de « 
que yacen dent 
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chi-square test for goodness of fit 

A test of the null hypothesis that a categorical variable has a 
specified distribution. (p. 718) For more details, see the 
inference summary on page T-16. 


chi-square test for homogeneity 

A test of the null hypothesis that the distribution of a 
categorical variable is the same for two or more 
populations/treatments. (p. 737) For more details, see the 
inference summary on page T-16. 


chi-square test for independence 

A test of the null hypothesis that there is no association 
between two categorical variables in the population of 
interest. (p. 744) For more details, see the inference 
summary on page T-16. 


cluster sample 

Sample obtained by classifying the population into groups 
of individuals that are located near each other, called 
clusters, and then choosing an SRS of the clusters. All 
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prueba de ji ci 
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prueba de ji ci 
independenciz 
Prueba de la hi 
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Muestra que se 
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individuals in the chosen clusters are included in the sample. 


(p. 230) 


coefficient of determination r27” 

A measure of the percent reduction in the sum of squared 
residuals when using the least-squares regression line to 
make predictions, rather than the mean value of y.4 In 
other words, rar? measures the percent of the variability in 
the response variable that is accounted for by the least- 
squares regression line. (p. 190) 


comparison 
Experimental design principle. Use a design that compares 
two or more treatments. (p. 247) 


complement 


. AC..: 
The complement of event A, written as AC,‘* » is the event 
that A does not occur. (p. 316) 


complement rule 
The probability that an event does not occur is 1 minus the 
probability that the event does occur. In symbols, 


pcac)=1-p(ay.P (A®) = 1 = P(A): @, 316) 
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completely randomized design 
Design in which the experimental units are assigned to the 
treatments completely by chance. (p. 255) 


components 
Individual terms that are added together to produce the test 


a, 
statistic y2:X * 


mponent=(Observed Count-Expected Count)2Expected Count 


(p. 752) 


conditional distribution 

Describes how the values of one variable vary among 
individuals who have a specific value of another variable. 
There is a separate conditional distribution for each value of 


the other variable. (p. 18) 


conditional probability 

Probability that one event happens given that another event 
is already known to have happened. Suppose we know that 
event A has happened. Then the probability that event B 
happens given that event A has happened is denoted by 


P(BIA).P (B\A). To find the conditional probability 
P(BIA),- (B\A), use the formula 
. P(ANMB) 
P (B|A) =———— 


B|A)=P(AnB)P(A) P(A) 
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probabilidad « 
La probabilida: 
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(p. 331) condicional P( 


se usa la formu 
P(B|A)=P(AnB)P(Z 


. P(ANE 
P(B\A) = 
P(A) 
(pag. 331) 
conditional relative frequency frecuencia rel 
Gives the percent or proportion of individuals that have a condicional 
specific value for one categorical variable among Ofrece el porce 
individuals who share the same value of another categorical proporcion de j 
variable (the condition). (p. 17) tienen un valor 


una variable ca 
individuos que 
mismo valor de 
categorica (cor 
17) 


confidence interval Gives intervalo de ct 
an interval of plausible values for a parameter. The interval Ofrece un inter 

is calculated from the data and has the form plausibles para 
int estimate+margin of error intervalo se col 
los datos y tien 

or, alternatively, Estimado de punto+t 


Estimado de punto + margen de error 


tistic+(critical value): (standard deviation of statistic) _ . 
o alternativame 


. 497 

(P ) estadistica+(valor 
critico) ‘ (desviacion 
ESLACISTICA) ssi (nix 0) 

(pag. 497) 

confidence level CC nivel de confiz 

Success rate of the method for calculating the confidence La tasa de acie 

interval. In CC'% of all possible samples, the method would con el que se ci 

yield an interval that captures the true parameter value. (p. intervalo de co 

497) C'% de todas le 


posibles, el mé 
un intervalo qu 
verdadero del | 
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confidential 
A basic principle of data ethics that requires that an 
individual’s data be kept private. (p. 279) 


confounding 

When two variables are associated in such a way that their 
effects on a response variable cannot be distinguished from 
each other. (p. 243) 


continuous random variable 
Variable that can take any value in an interval on the 
number line. The probability distribution of a continuous 


random variable is described by a density curve. The 
probability of any event is the area under the density curve 


and above the values of the variable that make up the event. 


(p. 361) 


control 

Experimental design principle that mandates keeping other 
variables that might affect the response the same for all 
experimental units. (p. 252) 


control group 

Experimental group whose primary purpose is to provide a 
baseline for comparing the effects of the other treatments. 
Depending on the purpose of the experiment, a control 
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grupo de cont 
Grupo experim 
primario es est. 
base mediante 


group may be given a placebo or an active treatment. (p. 
248) 


convenience sample 
Sample selected by taking from the population individuals 
that are easy to reach. (p. 223) 


correlation 

1? Measures the direction and strength of the linear 
relationship between two quantitative variables. Correlation 
is usually written as r.”- We can calculate r” using the 
formula r=1n-1)'(xi-x sx)(yi-y sy). 


a3 5, (areal (arra) 
a Sz Sy (pp. 153, 156) 


critical value 

Multiplier that makes a confidence interval wide enough to 
have the stated capture rate. The critical value depends on 
both the confidence level CC and the sampling distribution 
of the statistic. (p. 504) 


cumulative relative frequency graph 

A cumulative relative frequency graph plots a point 
corresponding to the cumulative relative frequency in each 
interval at the smallest value of the next interval, starting 
with a point at a height of 0% at the smallest value of the 
first interval. Consecutive points are then connected with a 
line segment to form the graph. (p. 93) 
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data analysis 
Process of describing data using graphs and numerical 
summaries. (p. 4) 


density curve 

Curve that (a) is always on or above the horizontal axis and 
(b) has area exactly 1 underneath it. A density curve 
describes the overall pattern of a distribution. The area 
under the curve and above any interval of values on the 
horizontal axis is the proportion of all observations that fall 
in that interval. (p. 110) 


discrete random variable 

Variable that takes a fixed set of possible values with gaps 
between. The probability distribution of a discrete random 
variable gives its possible values and their probabilities. The 
probability of any event is the sum of the probabilities for 
the values of the variable that make up the event. (p. 362) 
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y sus probabili 
probabilidad de 
evento es la sul 
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distribution 
Tells what values a variable takes and how often it takes 
these values. (p. 4) 


dotplot 

A graph that displays the distribution of a quantitative 
variable by plotting each data value as a dot above its 
location on a number line. (p. 30) 


double-blind 

An experiment in which neither the subjects nor those who 
interact with them and measure the response variable know 
which treatment a subject received. (p. 249) 


event 

Any collection of outcomes from some chance process. An 
event is a subset of the sample space. Events are usually 
designated by capital letters, like A, B, C, and so on. (p. 
315) 


expected counts 
Expected numbers of individuals in the sample that would 


fall in each cell of the one-way or two-way table if Holo 
were true. (p. 711) 
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vias, Si Hoo 
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experiment 
A study in which researchers deliberately impose treatments 
on individuals to measure their responses. (p. 241) 


experimental unit 

The object to which a treatment is randomly assigned. When 
the experimental units are human beings, they are often 
called subjects. (p. 245) 


explanatory variable 
Variable that may help explain or predict changes in a 
response variable. (p. 153) 


exponential model 


Relationship of the form y=abx.¥ = ab". Tf the relationship 
between two variables follows an exponential model and we 
plot the logarithm (base 10 or base e€) of y¥ against x,”> 
we should observe a straight-line pattern in the transformed 
data. (p. 803) 


extrapolation 

Use of a regression model for prediction far outside the 
interval of values of the explanatory variable x2 used to 
obtain the model. Such predictions are often not accurate. 
(p. 178) 
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factor 
Explanatory variable in an experiment. (p. 246) 


factorial 

For any positive whole number n,’: its factorial n!”! is 
n!=n-(n-1):(n-2)-...°3°2°1 
nl=n-(n—1)-(n—2)-...-3+2°1 5, addition, we 
define 0!=1.0! = 1. (p. 407) 


fail to reject 


Hoo If the observed result is not unlikely to occur when 


the null hypothesis is true, we should fail to reject Hoo 
and say that we do not have convincing evidence for Ha. 


Aq. (p. 558) 


first quartile 
qiQ1 If the observations in a data set are ordered from 
smallest to largest, the first quartile q191 is the median of 


the observations whose position is to the left of the median. 


(p. 64) 


five-number summary 
Smallest observation, first quartile, median, third quartile, 
and largest observation, written in order from smallest to 
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tabla de frecu 
Muestra la cue: 
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frequency table 
Table that displays the count (frequency) of observations in 
each category or interval. (p. 9) 


regla general « 
Si Ay Bsond 
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general addition rule 
If A and B are two events resulting from a chance process, 
then the probability that event A or event B (or both) occur 
is P(A or B)=P(A U B)=P(A)+P(B)—-P(A NB). 


P(AorB) = P(AUB) = P(A) + P(B) — P(ANMB). probabilidad de 

(p. 320) o el evento B (: 
es P(A o 
B)=P(A UB)=1 


—P(ANB) ramen 
320) 


general multiplication rule 
The probability that events A and B both occur can be found 
using the formula P(ANB)=P(A): P(BJA) 

the conditional probability that event B occurs given that 
event A has already occurred. (p. 338) 


regla general « 
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La probabilida 
los eventos A 
determinar util 
P(AnB)=P(A) 
P(ANMB) = P(A): P(BIA) 
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geometric distribution 

In a geometric setting, suppose we let Y=the number of 
trialsY = the number of trials jt takes to get a success. 
The probability distribution of YY is a geometric 
distribution with parameter p,?> the probability of a success 
on any trial. The possible values of Y are 1,2,3,.... 


jer PS ee (p. 423) 


geometric probability formula 
If YY has the geometric distribution with probability pP of 
success on each trial, the possible values of YY are 1,2,3, 


vets 2535006 TE KK is any one of these values, P(Y=k)= 


(1-p)k-1p.P (¥ = k) = (1 p)""P. (p, 423) 


geometric random variable 
The number of trials YY that it takes to get a success in a 
geometric setting. (p. 423) 


geometric setting 

Arises when we perform independent trials of the same 
chance process and record the number of trials it takes to get 
one success. On each trial, the probability pP of success 
must be the same. (p. 422) 
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histogram 
Graph that displays the distribution of a quantitative 
variable by showing each interval of values as a bar. The 
heights of the bars show the frequencies or relative 
frequencies of values in each interval. (p. 40) 


histograma 

Muestra la dist 
variable cuanti 
expresa Cada ir 
valores en forn 
altura de las ba 
frecuencias o |: 
relativas de los 
intervalo. (pag. 


independent events eventos indept 
Two events are independent if the occurrence of one event Dos eventos so 
does not change the probability that the other event will si el hecho de ¢ 
happen. In other words, events A and B are independent if no cambia la p! 
esis : P(AIB) = P(AIBC) = P(A que el otro suc 
[B= P(A|BC)=P(A) (AB) ( | ) “) eventos A y B 
independientes 


= P((A|B)=P(A|BC)=P 
P(B\A) = P(BIAC) = PB) P(A[B) = P(A[E 
B|A)=P(B|AC)=P(B) 


i 
(p. 335) 
P(B|A)=P(B|AC)=P 
P(B\|A) =P (Ble 
(pag. 335) 
independent random variables variables aleai 
If knowing the value of XX does not help us predict the independiente 
value of Y,¥ > then XX and YY are independent random Si conocer los: 
variables. (p. 390) no Nos sive pa 


valor de y,Y; 
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individual 
An object described by a set of data. Individuals may be 
people, animals, or things. (p. 2) 


inference 
Drawing conclusions that go beyond the data at hand. (pp. 
6, 270) 


inference about cause and effect 

Conclusion from the results of an experiment that the 
treatments caused the difference in responses. Requires a 
well-designed experiment in which the treatments are 
randomly assigned to the experimental units. (p. 280) 


inference about a population 

Conclusion about the larger population based on sample 
data. Requires that the individuals taking part in a study be 
randomly selected from the population of interest. (p. 280) 


influential observation 

An observation is influential for a statistical calculation if 
removing it would markedly change the result of the 
calculation. Points that are outliers in the x# direction of a 
scatterplot are often influential. (p. 200) 
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informed consent 

Basic principle of data ethics that states that individuals 
must be informed in advance about the nature of a study and 
any risk of harm it may bring. Participating individuals must 
then consent in writing. (p. 279) 


institutional review board 
Board charged with protecting the safety and well-being of 


the participants in advance of a planned study and with 
monitoring the study itself. (p. 279) 


interquartile range 


IQR=Q3-Q1 JOR = Q3 — Q1- (p. 64) 


intersection 

The event “A and B” is called the intersection of events A 
and B. It consists of all outcomes that are common to both 
events, as is denoted by ANB.A 1 B. (p. 323) 
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tienen un valor 
una variable ca 
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joint relative frequency 
Gives the percent or proportion of individuals that have a 
specific value for one categorical variable and a specific 

value for another categorical variable. (p. 15) 


condicion de c 
Supongamos q 
representa el mi 


Large Counts condition 

Suppose XX is the number of successes and p\p is the 
proportion of successes in a random sample of size n? from 
a population with proportion of successes p.?: The Large 
Counts condition says that the distribution of XX and the 
distribution of p\P will be approximately Normal if np>10 


np > 10 and n(1-p)>10." (1 —p) = 10. (p. 420) 


y p’P es la prc 
aciertos en una 
aleatoria de tar 
de una poblacit 
proporcion de i 
condicion de c 
establece que |. 
XX y la distril 
sera aproximac 
si np>10"P = 
«ow (pag. 420) 


Large Counts condition for a chi-square test condicion de c 
It is safe to use a chi-square distribution to perform en la prueba d 
calculations if all expected counts are at least 5. (p. 717) Se puede utiliz 
la distribucion 
para realizar cc 
las cuentas pre 
menos 5. (pag. 


ley de las cifra 
Si se observan 


law of large numbers 
If we observe more and more repetitions of any chance 
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process, the proportion of times that a specific outcome 
occurs approaches a single value, which we call the 
probability of that outcome. (p. 301) 


least-squares regression line 
The line that makes the sum of the squared residuals as 
small as possible. (p. 183) 


level 
Specific value of an explanatory variable (factor) in an 
experiment. (p. 246) 


margin of error 

The difference between the point estimate and the true 
parameter value will be less than the margin of error in CC 
% of all samples, where CC is the confidence level. (p. 271) 


marginal relative frequency 
Gives the percent or proportion of individuals that have a 
specific value for one categorical variable. (p. 14) 


matched pairs design 
Common form of blocking for comparing just two 
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treatments. In some matched pairs designs, each subject 
receives both treatments in a random order. In others, the 
subjects are matched in pairs as closely as possible, and 
each subject in a pair is randomly assigned to receive one of 
the treatments. (p. 260) 


mean 
The average of all the individual data values in a 
distribution of quantitative data. To find the mean, add all 
the values and divide by the total number of observations. If 
the data come from a sample, use x © to denote the sample 
mean. If the data come from a census, use p to denote the 
population mean. (p. 54) 


mean of a density curve 
Point at which a density curve would balance if made of 
solid material. (p. 112) 


mean (expected value) of a discrete random variable 
Describes the variable’s long-run average value over many, 
many repetitions of the same chance process. To find the 


mean (expected value) of xx » multiply each possible 
value by its probability, then add all the products: 


C=E(X)=x1 p1+x2p2+x3 p3+...=); xi pi 
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median media 

The midpoint of a distribution; the number such that about El punto intern 
half the observations are smaller and about half are larger. distribucion, cc 
To find the median of a distribution: (1) Arrange all que aproximad 
observations in order of size, from smallest to largest. (2) If de las observac 
the number of observations n? is odd, the median is the pequenas y lar 
middle observation in the ordered list. (3) If the number of grandes. Para t 
observations n’t is even, the median is the average of the una distribucié 
two middle observations in the ordered list. (p. 57) organizan toda 
observaciones | 
tamanio, de las 
las mas grande 
cantidad de ob: 
es impar, la me 
observacion en 
lista organizadi 
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(pag. 57) 
median of a density curve media de una 
The point with half the area under the curve to its left and densidad 
the remaining half of the area to its right. (p. 112) E] punto en el « 


area que esta d 
esta a la izquie 
mitad del area | 


(pag. 112) 
mode modo 
Value in a distribution having the greatest frequency. En una distribu 
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multiple comparisons 

Problem of how to do many comparisons at once with an 
overall measure of confidence in all our conclusions. (p. 
728) 


multiplication rule for independent events 
If A and B are independent events, then the probability that 
A and B both occur is P(ANB)=P(A): P(B). 


mutually exclusive (disjoint) events 
Two events A and B that have no outcomes in common and 
so can never occur together. That is, P(A and B)=0. 


negative association 
When above-average values of one variable tend to 
accompany below-average values of the other. (p. 157) 


nonresponse 
Occurs when an individual chosen for the sample can’t be 
contacted or refuses to participate. (p. 233) 
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Normal approximation to a binomial distribution 
Suppose that a count XX of successes has the binomial 
distribution with n” trials and success probability p.?- 
When n?? is large, the distribution of XX is approximately 
Normal with mean np”? and standard deviation np(1-p). 


v np (1 — p). We use this Normal approximation when 
np>10"P = 10 and n(1-p)=10.” (1—p) > 10. (p. 420) 


Normal curve 
Important kind of density curve that is symmetric, single- 
peaked, and bell-shaped. (p. 114) 


Normal distribution 

Distribution described by a Normal density curve. Any 
particular Normal distribution is completely specified by 
two numbers, its mean p/ and standard deviation 0.7. The 
mean of a Normal distribution is at the center of the 
symmetric Normal curve. The standard deviation is the 
distance from the center to the change-of-curvature points 
on either side. (p. 114) 


Normal/Large Sample condition 
A condition for performing inference about a mean, which 
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requires that the data come from a Normally distributed 
n > 30). 


population or that the sample size is large (n>30).( 
When the sample size is small and the shape of the 
population distribution is unknown, a graph of the sample 
data shows no strong skewness or outliers. When 
performing inference about a difference between two 
means, check that this condition is met for both samples. (p. 
532) 


Normal probability grafico de pro 
plot Plot used to assess whether a data set follows a Normal Normal 

distribution. If the points on a Normal probability plot lie Se usa para eve 
close to a straight line, the plot indicates that the data are conjunto de da 
approximately Normally distributed. (p. 133) distribucion no 
puntos en un g! 
probabilidad N 
cerca de una lil 
grafico indica ¢ 
aproximadame 
Normalmente. 


null hypothesis Hoo 

Claim we weigh evidence against in a significance test. 
Often the null hypothesis is a statement of “no difference.” 
(p. 554) 


hipotesis nula 
Contrapeso de 
una prueba de : 
menudo la hipc¢ 
declaracion de 
diferencia.” (p< 


estudio de obs 
Se observan lo: 
miden las varia 


observational study 
Study that observes individuals and measures variables of 
interest but does not attempt to influence the responses. (p. 
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242) 


observed counts 
Actual numbers of individuals in the sample that fall in each 
cell of the one-way or two-way table. (p. 711) 


one-sample tt interval for a mean difference 
Confidence interval used to estimate a population mean. (p. 
679) For more details, see the inference summary on page 
T-16. 


one-sample té test for a mean 

A test of the null hypothesis that a population mean is equal 
to a specified value. (p. 529) For more details, see the 
inference summary on page T-16. 


one-sample zz interval for a proportion 

Confidence interval used to estimate a population 
proportion. (p. 511) For more details, see the inference 
summary on page T-16. 


one-sample z2 test for a proportion 
A test of the null hypothesis that a population proportion is 
equal to a specified value. (p. 573) For more details, see the 
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inference summary on page T-16. 


one-sided alternative hypothesis 

An alternative hypothesis that states that a parameter is 
larger than the null hypothesis value or that states that the 
parameter is smaller than the null value. (p. 555) 


one-way table 
Table used to display the distribution of a single categorical 
variable. (p. 709) 


outlier 

Individual value that falls outside the overall pattern of a 
distribution. Call an observation an outlier if it falls more 
than 1.5 JQR above the third quartile or more than 1.5 IQR 
below the first quartile. (pp. 34, 99) 


outlier in regression 

Observation that lies outside the overall pattern of the other 
observations. Points that are outliers in the y¥ direction but 
not the x# direction of a scatterplot have large residuals. 
Other outliers may not have large residuals. (p. 157) 


2123 


cual la proporc 
poblacion es ig 
especificado. (] 
mas informacic 
resumen de inf 
pagina T-16. 


hipOotesis alter 
unilateral 

Hipotesis alten 
que un parame' 
que el valor de 
O que indica qu 
mas pequeno q 
(pag. 555) 


tabla de una v 
Se usa para mo 
distribucion de 
categorizada. ( 
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PP-value 

The probability of getting evidence for the alternative 
hypothesis Hatta as strong or stronger than the observed 
evidence when the null hypothesis Hoo js true. The 
smaller the PP-value, the stronger the evidence against HO 
A and in favor of Ha‘ta provided by the data. (p. 558) 


paired data 

The result of recording two values of the same quantitative 
variable for each individual or for each pair of similar 
individuals. To analyze paired data, start by computing the 
difference for each pair. Then make a graph of the 
differences. Use the mean difference and the standard 
deviation of the differences as summary statistics. (p. 673) 


paired té interval for a mean difference 

Confidence interval used to estimate a population (true) 
mean difference. (p. 677) For more details, see the inference 
summary on page T-16. 
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paired té test for a mean difference 

A test of the null hypothesis that a population (true) mean 
difference is equal to a specified value, usually 0. (p. 680) 
For more details, see the inference summary on page T-16. 


parameter 
A number that describes some characteristic of the 
population. (p. 442) 


percentile 
The pth percentile of a distribution is the value with p? 


percent of the observations less than it. (p. 91) 


pie chart 

Chart that shows the distribution of a categorical variable as 
a “pie” whose slices are sized by the counts or percents for 
the categories. A pie chart must include all the categories 
that make up a whole. (p. 10) 


placebo 
A treatment that has no active ingredient but is otherwise 
like other treatments. (p. 244) 


placebo effect 
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efecto placebo 


Describes the fact that some subjects respond favorably to 
any treatment, even an inactive one (placebo). (p. 249) 


point estimate 
Specific value of a point estimator from a sample. (p. 494) 


point estimator 
Statistic that provides an estimate of a population parameter. 
(p. 494) 


pooled or combined sample proportion 

The overall proportion of successes in the two samples is 
=count of successes in both samples combinedcount of 
lividuals in both samples combined=X1+X2n1+n2_ 


(p. 632) 


population 
In a statistical study, the entire group of individuals we want 
information about. (p. 221) 


population (true) regression line 


Regression line py=R0+B1x4y ~ Bo + Pix based on the 
entire population of data. (p. 770) 


2126 


Describe el hec 
algunos sujetos 
manera favoral 
tratamiento, in 
inactivo (con p 
249) 


estimado de pi 
El valor especi 
estimador de p 
una muestra. ([ 


estimador de | 
Estadistica que 
estimado de un 
poblacion. (pas 


proporcion co 

muestra 

La proporcion 

en las dos mue 
p= cuenta de aciert« 
muestras combinad¢ 
individuos enambas 
combinadas=X1+X: 


cuenta de aciertos en 


. ambas muestras combinadas 1 +Xq 
p= 


cuenta de individuos en nyt+ng 


ambas muestras combinadas 


(pag. 632) 


poblacion 

En un estudio ¢ 
poblacion es el 
de individuos s 
deseamos conti 
informacion. ({ 


linea de regre: 
poblacion 
La linea de reg 


py=B0+B1xMy 


positive association 

When above-average values of one variable tend to 
accompany above-average values of the other and below- 
average values also tend to occur together. (p. 157) 


power 
The probability that a test will reject Hoo at a chosen 


significance level a@ when a specified alternative value of 
the parameter is true. The power of a test against any 
alternative is 1 minus the probability of a Type II error for 
that alternative; that is, power=1-B.Power = 1 — B. (p, 
598) 


power model 


Relationship of the form y=axp.¥ = @£"- When experience 
or theory suggests that the relationship between two 
variables is described by a power model, you can transform 
the data to achieve linearity in two ways: (1) raise the values 
of the explanatory variable x% to the p? power and plot the 


points (xp,y), ("> Y)> or (2) take the pth root of the values 
of the response variable y¥ and plot the points (x,yp). 


(2, 9): If you don’t know what power to use, taking the 
logarithms of both variables should produce a linear pattern. 
(p. 796) 


2127 


basada en la to 
poblacion de d. 


asociacion pos 
Cuando los val 
del promedio d 
suelen acompa 
por encima del 
otra, y los valo 
del promedio ti 
suceder juntos. 


poder 

La probabilida 
prueba rechace 
nivel de signifi 
cuando un valc 
especificado de 
verdadero. El 5 
prueba con res] 
alternativa es 1 
probabilidad de 
para dicha alte! 
poder=1-B.P™ 
(pag. 598) 


modelo de poc 
Relacion de la 
y= az’. Cua 
experiencia 0 u 
que la relaci6n 
variables la des 
de poder, se pu 
los datos para ¢ 
linealidad de d 
elevar los valoi 
explicativa xz 
y trazar los pur 
(2”,¥)s o(2)t 
los valores de | 
respuesta y¥ y 


predicted value 

y’Y (read “yY hat’) is the predicted value of the response 
variable yY for a given value of the explanatory variable x. 
x. (p. 181) 


probability 

A number between 0 and 1 that describes the proportion of 
times an outcome of a chance process would occur in a very 
long series of repetitions. (p. 301) 


probability distribution 


Gives the possible values of a random variable and their 
probabilities. (p. 362) 


probability model 

Description of some chance process that consists of two 
parts: a sample space S that lists all possible outcomes and a 
probability for each outcome. (p. 314) 


quantitative variable 
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variable cuant 


Variable that takes number values that are quantities— 
counts or measurements. (p. 3) 


quartiles 
The quartiles of a distribution divide the ordered data set 
into four groups having roughly the same number of values. 


(p. 64) 


random assignment 

Experimental design principle. Use chance to assign 
experimental units to treatments. Doing so helps create 
roughly equivalent groups of experimental units by 
balancing the effects of other variables among the treatment 
groups. (p. 251) 


Random condition 

A condition for performing inference, which requires that 
the data come from a random sample from the population of 
interest or from a randomized experiment. When comparing 
two or more populations or treatments, check that the data 
come from independent random samples from the 
populations of interest or from groups in a randomized 
experiment. (p. 512) 
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random sampling 
Using a chance process to determine which members of a 
population are chosen for the sample. (p. 225) 


random variable 
Variable that takes numerical values that describe the 
outcomes of some chance process. (p. 362) 


randomization distribution 
Distribution of a statistic (like p\1—p\2P?1 ~ P2 or x\1-x/2 


<1 — £2) in repeated random assignments of experimental 


units to treatment groups assuming that the specific 
treatment received doesn’t affect individual responses. 
When the conditions are met, usual inference procedures 
based on the sampling distribution of the statistic will be 
approximately correct. (p. 636) 


randomized block design 


Experimental design begun by forming blocks consisting of 
individuals that are similar in some way that is important to 


the response. Random assignment of treatments is then 
carried out separately within each block. (p. 257) 
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range 

A measure of variability equal to the distance between the 
minimum value and the maximum value of a distribution. 
That is, range=maximum-minimum. 

range = maximum — minimum. (p. 60) 


regression line 

Line that describes how a response variable y¥ changes as 
an explanatory variable x2 changes. We often use a 
regression line to predict the value of y¥ for a given value 
of x.£. (p. 176) 


reject 
Hoo If the observed result is too unlikely to occur just by 
chance when the null hypothesis is true, we can reject HO 


Ho and say that there is convincing evidence for HaHa: (p. 
558) 


relative frequency table 
Table that shows the percents (relative frequencies) of 
observations in each category or interval. (p. 9) 


replication 
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replicacion 


Experimental design principle. Use enough experimental 
units in each group so that any differences in the effects of 
the treatments can be distinguished from chance differences 
between the groups. (p. 253) 


residual 
Difference between an actual value of the response variable 
and the value predicted by the regression line: 


sidual=actual y—predicted y=y—y’ 


(p. 179) 


residual plot 

A scatterplot that plots the residuals on the vertical axis and 
the explanatory variable on the horizontal axis. Residual 
plots help us assess whether a regression model is 
appropriate. (p. 185) 


resistant measure 
Statistic that is not affected very much by extreme 
observations. (p. 56) 


response bias 
Occurs when there is a consistent pattern of inaccurate 
responses to a survey question. (p. 234) 
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residual = real y — proyectado y = y — ¥ 


(pag. 179) 


grafico residu 
Grafico de disr 
los residuales s 
vertical y la va 
sobre el eje hoi 
trazados residu 
a evaluar si el 1 
regresion es ap 
185) 


medida resiste 
Estadistica que 
afectada por ot 
extremas. (pag. 


sesgo de la res 
Sucede cuando 
consistente de - 
imprecisas a ul 
una encuesta. ( 


response variable 
Variable that measures an outcome of a study. (p. 153) 


roundoff error 
Difference between the calculated approximation of a 
number and its exact mathematical value. (p. 16) 


sample 
Subset of individuals in the population from which we 
collect data. (p. 221) 


sample regression line (estimated regression line) 


Least-squares regression line y\=b0+b1xY = bo + hx 
computed from the sample data. (p. 770) 


sample space S 


Set of all possible outcomes of a chance process. (p. 314) 


sample survey 


Study that uses an organized plan to choose a sample that 
represents some specific population. We base conclusions 


about the population on data from the sample. (p. 222) 
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Variable que 1r 
de un estudio. | 


error de redo1 
La diferencia e 
aproximacion ¢ 
valor matemati 
16) 


muestra 

Subconjunto de 
poblacion a pal 
recogen datos. 


linea de regre: 
muestra (linea 
estimada) 

La linea de reg 
minimos cuadr 
y=bo +b 2, 
partir de los da 
(pag. 770) 


espacios S de | 
El conjunto de 
resultados posi 
proceso de prol 
314) 


valoracion de 
Estudio que us. 
organizado par 
muestra que re| 
poblacién espe 
las conclusione 
poblacion en d. 
la muestra. (pa 


sampling distribution 
The distribution of values taken by a statistic in all possible 
samples of the same size from the same population. (p. 444) 


sampling distribution of a sample mean 

x ® The distribution of values taken by the sample mean 
x ©& inall possible samples of the same size from the same 
population. (p. 468) 


sampling distribution of a sample proportion 


prP The distribution of values taken by the sample 


proportion p\P in all possible samples of the same size 
from the same population. (p. 459) 


sampling distribution of a slope 
b191 The distribution of values taken by the sample slope 


b141 in all possible samples of the same size from the same 
population. (p. 772) 


sampling distribution of 

p\1-p\2P1 ~ P2 The distribution of values taken by the 
statistic p\1-p’2P1 ~ P2 in all possible samples of size n1 
4 from population 1 and all possible samples of size n2"2 
from population 2. (p. 623) 


2134 


distribucion d 
La distribucion 
tomados por la 
todas las muest 
mismo tamano 
misma poblaci: 


distribucion d 
media de una 
x & Distribuc: 
tomados de la} 
x © en todas li 
muestras del m 
la misma poble 


distribucion d 
una proporci6 
p\P Distribuci 
tomados de la ] 
muestral p/P ¢€ 
posibles muest 
tamano de la 
(pag. 459) 


distribucion d 
una pendiente 
b191 Distribuc 
tomados de la | 
muestral b191 
posibles muest 
tamano de la 
(pag. 772) 


distribucion d 
p1—-p/2P: — P2 
valores tomadc 
estadistica p\1 
todas las muest 
tamafio nlm de 


sampling distribution of 

x 1-x 2%1 — ©2 The distribution of values taken by the 
statistic x 1-x 2%1 — 2 jn all possible samples of size nl 
4 from population 1 and all possible samples of size n2"2 
from population 2. (p. 647) 


sampling variability 

The fact that different random samples of the same size 
from the same population produce different estimates. (pp. 
270, 443) 


scatterplot 

Plot that shows the relationship between two quantitative 
variables measured on the same individuals. The values of 
one variable appear on the horizontal axis, and the values of 
the other variable appear on the vertical axis. Each 
individual in the data appears as a point in the graph. (p. 
154) 


segmented bar graph 

Graph used to compare the distribution of a categorical 
variable in each of several groups. For each group, there is a 
single bar with “segments” that correspond to the different 
values of the categorical variable. The height of each 
segment is determined by the percent of individuals in the 
group with that value. Each bar has a total height of 100%. 


(p. 19) 
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de todas las mt 
del tamafio n2? 
poblacion 2. (p 


distribucion d 
x 1-x 271-2: 
valores tomadc 
estadistica x 1 
todas las muest 
tamafio n1!1 ¢ 
y de todas las r 
del tamafio n2’ 
poblacion 2. (p 


variabilidad d 
El hecho de qu 
aleatorias de ul 
tomadas de la 1 
producen estirr 
(pags. 270, 44: 


grafico de dis} 
Permite aprecii 
entre dos varial 
midiendo los n 
individuos. Lo: 
variable figural 
horizontal, y lo 
otra variable fi: 
vertical. Cada i 
datos figura co 
el grafico. (pag 


grafico de bar 
Se uSa para COI 
distribucién de 
categorizada e1 
varios grupos. 

hay una sola be 
“segmentos” ql 
a los diferentes 


side-by-side bar graph 

Graph used to compare the distribution of a categorical 
variable in each of several groups. For each value of the 
categorical variable, there is a bar corresponding to each 
group. The height of each bar is determined by the count or 
percent of individuals in the group with that value. (p. 19) 


significance level 

Fixed value o@ that we use as a cutoff for deciding whether 
an observed result is too unlikely to happen by chance alone 
when the null hypothesis is true. The significance level 
gives the probability of a Type I error. (p. 558) 


significance test 

Procedure for using observed data to decide between two 
competing claims (the null hypothesis and the alternative 
hypothesis). The claims are often statements about a 
parameter. (p. 552) 
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variable catego 
de cada segme! 
el porcentaje di 
grupo que teng 
Cada barra tier 
del 100%. (pag 


grafico de bar 
Se uSa para COI 
distribucion de 
categorizada e1 
varios grupos. 
de la variable c 
una barra que ¢ 
cada grupo. La 
la determina el 
porcentaje de ii 
grupo que teng 
(pag. 19) 


nivel de signif. 
fijo 

a@ que se usa | 
corte para deci: 
resultado obser 
demasiado imp 
suceder solo al 
hipotesis nula ¢ 
nivel de signifi 
probabilidad de 
(pag. 558) 


prueba de sig 
Procedimiento 
datos observad 
entre dos opcic 
compiten entre 
nula y la hipote 
Las opciones a 
enunciados ace 
parametro. (pa 


simple random sample (SRS) 

Sample chosen in such a way that every group of n7t 
individuals in the population has an equal chance to be 
selected as the sample. (p. 226) 


simulation 
Imitation of chance behavior, based on a model that 
accurately reflects the situation. (p. 304) 


single-blind 

An experiment in which either the subjects or those who 
interact with them and measure the response variable, but 
not both, know which treatment a subject received. (p. 249) 


skewness 

A distribution is skewed to the right if the right side of the 
graph (containing the half of the observations with larger 
values) is much longer than the left side. It is skewed to the 
left if the left side of the graph is much longer than the right 
side. (p. 32) 


slope 

sine that yY is a response variable (plotted on the 
vertical axis) and x® is an explanatory variable (plotted on 
the horizontal axis). A regression line relating y¥ to x@ has 
an equation of the form y4=b0+b1x. y = bo + 612. In this 
equation, b1)1 is the slope, the amount by which y¥ is 
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muestra aleati 
Muestra tomad 
que cada grupc 
individuos en | 
la misma oport 
escogido como 
226) 


simulacion 
Imitacion de cc 
basada en un 1 
la situacion cor 
304) 


ciego sencillo 

Experimento e) 
los sujetos o bi 
interactuan cor 
miden la variat 
pero no ambos. 
el tratamiento 
sujeto. (pag. 24 


asimetria 
Distribuci6on qu 
hacia la derecl 
del grafico (qu 
mitad de las ob 
valores mas gri 
mas larga que ¢ 
Esta sesgada h 
si el lado izqui: 
es mucho mas | 
derecho. (pag. 


pendiente 

Supongamos q 
variable de res] 
en el eje vertic 
variable explic 
el eje horizonte 
regresion que r 


predicted to change when x increases by one unit. (p. 181) x@ tiene una e¢ 
forma de y4=b 
Y = bo + bye. 
ecuacion, b1)1 
la cantidad mex 
predice que y¥ 
cuando x# ten; 
una unidad. (p< 


standard deviation desviacion esti 
sxS« Statistic that measures the typical distance of the sxS« Estadistic 
values in a distribution from the mean. It is calculated by distancia tipica 
finding an “average” of the squared distances and then una distribucié 
taking the square root. In symbols, media. Se com 
“promedio” de 
cuadrado a las 
computa la raiz 
simbolos se rej 
sxF1ln-1)(xi-x )2 


(pag. 61) 
standard deviation of a discrete random variable desviacion est: 
Square root of the variance of a discrete random variable variable aleat 


La raiz cuadrac 
de una variable 
discreta 0X2.«. 
estandar mide | 
= oy, (xj — px) Pi de los valores ¢ 
distribucion a [| 
media. En simt 
OX=>. (xi-pX)2pi 


(pag. 368) 


2 
oX2.7X* The standard deviation measures the typical 
distance of the values in a distribution from their mean. In 
symbols, 


desviacion est: 


standard deviation of the residuals (s)($ ) 


If we use a least-squares line to predict the values of a residuales (s)( 
response variable y¥ from an explanatory variable x,”> the Si se hace uso | 
standard deviation of the residuals (s$) is given by cuadrados min: 
residuals2n—2=)' (yi-y/i)2n-2 predecir los va 

variable de res] 
This value measures the size of a typical residual. That is, s de una variable 
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$ measures the typical distance between the actual y¥ 
values and the predicted y¥ values. (p. 189) 


standard error 

When the standard deviation of a statistic is estimated from 
data, the result is the standard error of the statistic. The 
standard error estimates how far the value of the statistic 
typically varies from the value it is trying to estimate. (p. 
513) 


standard Normal distribution 
Normal distribution with mean O and standard deviation 1. 
(p. 120) 


standard Normal table (Table A) 

Table of areas under the standard Normal curve. The table 
entry for each value z2 is the area under the curve to the left 
of z.2- (p. 120) 


standardized score (z-score) 
If x@ is an observation from a distribution that has known 


mean and standard deviation, the standardized value of x# 
value—mean 


yy — 


is z=value—meanstandard deviation. —_—standard deviation” A 
standardized value is often called a z2-score. (p. 95) 
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la desviacién e 
residuales (s§) 
ss)’ residuals2n—2= 
__ Laan _ oa 


n—2 n—-2 


Este valor mid 
una residual tif 
mide la distanc 
valores de y re 
de y¥ esperadc 


error estandai 
Cuando se corm 
desviacion esta 
estadistica a pa 
resultado es el 
la estadistica. F 
estima cuanto \ 
el valor de la e: 
valor que se bu 
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distribucion n 
La distribucion 
media de 0 y d 
estandar 1. (pa: 


tabla normal « 
A) 

Tabla de areas 
curva normal e 
entrada a la tab 
zz es el area de 
a la izquierda c 


puntuacion es 
Si x@ es una ol 
partir de una di 
tiene una medi. 
estandar conoc 
estandarizado 


—mediadesviac 
= valor—media 


desviaciih estAndar * 


estandarizado < 
dice puntuaci6! 


Standardized test statistic 
Calculation that measures how far a sample statistic 
diverges from what we would expect if the null hypothesis 


Hoo were true, in standardized units. That is, 


estadistica de 
estandarizada 
Mide la diverg: 
estadistica de r 


esperariamos Ss. 


Ho/Z0 fuera ve 
en unidades est 
(p. 570) decir, 

estadistica de prueb. 
estandarizada=estad 
—parametrodesviaci' 
esta distic yreaatni teretn tana 


ndardized test statistic=statistic-parameterstandard deviation 


. = statistic parameter 
standardized test statistic 


standard deviation of statistic 


Statistic 


statistic 


Number that describes some characteristic of a sample. (p. 
442) 


statistically significant 

(1) When the observed results of a study are too unusual to 
be explained by chance alone, the results are called 
statistically significant. (p. 272) 


(2) If the PP-value is smaller than alpha, we say that the 
results of a statistical study are significant at level a.@- In 


that case, we reject the null hypothesis HOo and conclude 
that there is convincing evidence in favor of the alternative 


hypothesis HaHa: (p. 559) 
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dato estadistic 
Numero que de 
caracteristica d 
(pag. 442) 


estadisticame! 
(1) Cuando los 
observados de 
demasiado atip 
decir que son c 
unicamente del 
les llama estad: 
significativos. | 


(2) Si el valor | 
alfa, se dice qu 
de un estudio e 
significativos 4 
tal caso, se recl 
nula Ho/Zo ys 
hay evidencia « 
favor de la hipc 


statistics 
The science and art of collecting, analyzing, and drawing 
conclusions from data. (p. 2) 


stemplot (also called stem-and-leaf plot) 

Simple graphical display for fairly small quantitative data 
sets that gives a quick picture of the shape of a distribution 
while including the actual numerical values in the graph. 
Each observation is separated into a stem, consisting of all 
but the final digit, and a leaf, the final digit. The stems are 
arranged in a vertical column with the smallest at the top. 
Each leaf is written in the row to the right of its stem, with 


the leaves arranged in increasing order out from the stem. 
(p. 37) 


stratified random sample 

Sample obtained by classifying the population into groups 
of similar individuals, called strata, then choosing a 
separate SRS in each stratum and combining these SRSs to 
form the sample. (p. 229) 
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alternativa. (pa 


estadistica 

La ciencia y ar 
analizar y lege 
a partir de un c 
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grafico de tall 
tambien se le « 
tallos y hojas 

Representacior 
de conjuntos di 
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pequenos que ¢ 
rapida de la for 
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observacion se 
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el ultimo digitc 
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se disponen en 
vertical en la ci 
pequeno esta ai 
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derecha del tall 
dispuestas en o 
comenzando a 
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muestra aleat 
estratificada 

Muestra que se 
clasificando la 
grupos de indix 
llamados estrai 
escoge una mu 
sencilla separa: 
y se combinan 
aleatorias senc: 


subjects 
Experimental units that are human beings. (p. 245) 


symmetric 
A graph in which the right and left sides are approximately 
mirror images of each other. (p. 32) 


t€ distribution 
Draw an SRS of size n” from a large population that has a 
Normal distribution with mean yp and standard deviation o. 
o. The statistic 
tp 
=e 


XK —usxn yn 


has the té distribution with degrees of freedomdf=n-1. 
df = n — 1. This statistic will have approximately a tn—-1 


tn—1 distribution if the sample size is large enough. té 
distributions are symmetric, single-peaked, bell-shaped 
density curves. (p. 588) 
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t=) 


conformar la 1r 
229) 


sujetos 
Unidades expe 
son seres hume 


simétrico 

Si los lados dei 
de un grafico s 
aproximados u 
dice que son si 
32) 


distribucion 
tt Se grafica wi 
aleatoria sencil 
a partir de una 
que tiene una d 
normal con me 
desviacion esta 
estadistica 
TZ — 
== 


 —ysxn yn 


tiene la distribt 
grados de liber 
df=n-1. Ff. 
tendra una dist: 
aproximada tn- 
tamano de la 
suficientement 
distribuciones - 
densidad de un 
simétricas y tie 
campana. (pag. 


tt interval for the slope pf 

Confidence interval used to estimate the slope of a 
population (true) regression line. (p. 778) For more details, 
see the inference summary on page T-16. 


tt test for the slope 

A test of the null hypothesis that there is no linear 
association between two quantitative variables. (p. 783) For 
more details, see the inference summary on page T-16. 


third quartile Q3Q3 

In a data set in which the observations are ordered from 
smallest to largest, the median of the observations whose 
position is to the right of the median. (p. 64) 


transforming 
Applying a function such as the logarithm or square root to 
a quantitative variable. (p. 795) 


treatment 

Specific condition applied to the individuals in an 
experiment. If an experiment has several explanatory 
variables, a treatment is a combination of specific values of 
these variables. (p. 245) 
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intervalo tt pi 
pe 

Intervalo de co 
usa para estime 
la linea de regr 
poblacion (real 
mas informacic 
resumen de inf 
pagina T-16. 


prueba t? par 
Prueba de la hi 
cual no hay asc 
entre dos varial 
cuantitativas. (j 
mas informacic 
resumen de inf 
pagina T-16. 


tercer cuartil 

Si las observac 
conjunto de da 
de el mas pequ 
grande, el terce 
es la media de 

cuya posicion ¢ 
de la media. (p 


transformacio 
La aplicacion c 
como el logarit 
cuadrada a una 
cuantitativa se 
transformacion 
795) 


tratamiento 

Una condicion 
les aplica a los 
experimento. S 
tiene varias val 


tree diagram 

A diagram that shows the sample space of a chance process 
involving multiple stages. The probability of each outcome 
is shown on the corresponding branch of the tree. All 
probabilities after the first stage are conditional 
probabilities. (p. 339) 


two-sample té interval for a difference between two 

means 

Confidence interval used to estimate a difference in the 
means of two populations/treatments. (p. 651) For more 
details, see the inference summary on page T-16. 


two-sample té test for the difference between two means 
A test of the null hypothesis that the means of two 
populations/treatments are equal. (p. 658) For more details, 
see the inference summary on page T-16. 


two-sample z2 interval for a difference between two 
proportions 

Confidence interval used to estimate a difference in the 
proportions of successes in two populations/treatments. (p. 
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diagrama de a 
Diagrama que | 
espacio de mue 
proceso aleatot 
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intervalo té de 
para obtener | 
entre dos med 
Intervalo de co 
usa para estime 
entre los medic 
poblaciones/tre 
651) Para mas 
el resumen de j 
pagina T-16. 


prueba t# de « 
para obtener | 
entre dos med 
Prueba de la hi 
cual las medias 
poblaciones/tre 
iguales. (pag. € 
informacion, v: 
inferencia en le 


intervalo z2 d 
para obtener | 
entre dos proy 
Intervalo de co 


626) For more details, see the inference summary on page 
T-16. 


two-sample z2 test for the difference between two 
proportions 

A test of the null hypothesis that the proportions of 
successes in two populations/treatments are equal. (p. 634) 
For more details, see the inference summary on page T-16. 


two-sided alternative hypothesis 

The alternative hypothesis is two-sided if it states that the 
parameter is different from the null value (it could be either 
smaller or larger). (p. 555) 


two-way table 
Table of counts that organizes data about two categorical 
variables. (p. 14) 


Type I error 
An error that occurs if a test rejects Hoo when Hoo is 


true. That is, the test finds convincing evidence that Halla 
is true when it really isn’t. (p. 560) 
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hipotesis alter 
La hipotesis alt 
bilateral si indi 
parametro es d 
nulo (podria se 
mas grande). (| 


tabla de doble 
Una tabla de de 
cuentas organi: 
acerca de dos v 
categorizadas. 


error Tipo I 

Error que sucet 
prueba rechaza 
Ho sea verc 
la prueba ofrec 
contundente de 
verdadera cuan 
lo es. (pag. 56C 


Type II error 

An error that occurs if a test fails to reject Hoo when Ha 
Aa js true. That is, the test does not find convincing 
evidence that Ha/4o is true when it really is. (p. 560) 


unbiased estimator 

A Statistic used for estimating a parameter is unbiased if the 
mean of its sampling distribution is equal to the true value 
of the parameter being estimated. (p. 448) 


undercoverage 
Occurs when some members of the population are less 
likely to be chosen or cannot be chosen in a sample. (p. 233) 


uniform distribution 
A distribution where the relative frequency of each possible 
value is the same. (p. 33) 


unimodal 
A graph of quantitative data with a single peak. (p. 33) 


union 
The union of events A and B, denoted by A U BA UB, 
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error Tipo IT 
Error que sucet 


prueba no rech 


incluso cuando 
verdadera. Es 
no ofrece evide 
de que Hatta 
cuando si lo es 


estimador sin 
La estadistica ¢ 
computar un pe¢ 
estimador sin s 
de distribucion 
equivale al valk 
parametro que 
computando. (] 


subcobertura 
Sucede cuando 
miembros de le 
tienen menor p 
elegidos o no p 
elegidos para u 
233) 


distribucion u 
Distribucién er 
frecuencia rela 
posible es lam 


unimodal 
Grafico de datc 
con un solo pic 


union 
La union de lo: 


consists of all outcomes in A or B or both. (p. 323) 


variability of a statistic 
Describes the variation in a statistic’s sampling distribution. 
Statistics from larger samples have less variability. (p. 451) 


variable 
Any characteristic of an individual. A variable can take 
different values for different individuals. (p. 3) 


variance of a discrete random variable oX27 
Weighted average of the squared deviations of the values of 
the variable from their mean. In symbols, 


= a ye 
229 (xi-wX)2pi7* = Do (ei - wx)’Ps 


(p. 368) 


: s2 
variance sx2°« 
“Average” squared deviation of the observations in a data 
set from their mean. In symbols, 


2__ 1 = 
=—— > (a z)* 


_. 82 
P=1n-1>' (xi-x )2 


(p. 62) 
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variabilidad d 
estadistica 

Describe la var 
muestreo de ur 
una estadistica 
la amplitud de 
su muestreo. L. 
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variable 

Toda caracteris 
individuo. Una 
tener diferente: 
diferentes indi‘ 


variacion de u 


aleatoria disci 
Promedio sope 
desviaciones cl 
valores de la vi 
su media. En si 
expresa 


oX2=Y (xi-pX)2pi 


(pag. 368) 


variacion sx2. 
La desviacion ¢ 
“promedio” de 
observaciones | 
datos segun su 
media. 


En simbolos se expt 


_— .F (xi -x ) 2 En sAmbolos s¢ 


(pag. 62) 
Venn diagrams diagramas 
A diagram that consists of one or more circles surrounded Venn Diagram. 
by a rectangle. Each circle represents an event. The region uno o mas circ! 
inside the rectangle represents the sample space of the dentro de un re 
chance process. (p. 322) circulo represe 


interior del rec 
representa el es 
del proceso ale 


voluntary response sample muestra de re: 
A sample that consists of people who choose to be in the voluntaria 

sample by responding to a general invitation. Voluntary Muestra compt 
response samples are sometimes called self-selected personas que, < 
samples. (p. 224) invitacion gene 
formar parte de 
muestras de res 
voluntaria se ci 
como muestras 


autoseleccion. 
wording of questions terminologia ¢ 
An important influence on the answers given in a survey. Influyen en gre 
Confusing or leading questions can introduce strong bias, las respuestas ¢ 
and changes in wording can greatly change a survey’s sondeo. Toda fF 
outcome. Even the order in which questions are asked capciosa 0 que 
matters. (p. 234) respuesta pued 


proceso un ses} 
cambios en lat 
pueden modific 
marcada los re: 
sondeo. Inclus« 
se pueden ver < 
orden en que st 
preguntas. (pag 
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yY intercept 

Suppose that y¥ is a response variable (plotted on the 
vertical axis of a graph) and x® is an explanatory variable 
(plotted on the horizontal axis). A regression line relating y 
Y to x@ has an equation of the form y“=b0+b1x. 


Y = bo + 62. In this equation, the number b0%0 is the yy 
intercept, the predicted value of y¥ when x=0.£ = 0. (p. 
181) 
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interceptacion 
yY Supongamc 
variable de res] 
en el eje vertic. 
variable explic 
en el eje horizc 
regresion que r 
x tiene una e 
y=b0+b1x...... 
ecuacion, el nu 
interceptacion 
proyectado de 
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Index 


Addition rule, 317 
general addition rule, 318-321 


usage, 470 


Additive, impact (matched pairs design), 260 

Alternative distribution, 600 

Alternative hypothesis, 554, 710, 727 
evidence, strength (measurement), 559-560 


favoring, 558 


Alzheimer’s disease, treatment, 597-598 


American Community Survey (ACS) 
household data, 2 


nonresponse rate, 233 


Apartment cost, conditions (checking), 649-650 


Apgar scores 
discrete random variables, 363 


mean, finding/interpretation, 367 


probability distribution, histogram, 364f 

variation, standard deviation (finding/interpretation), 369 
Apgar, Virginia, 363 
Approximate sampling distribution, 444 
Areas 

calculation, 796 


determination, 122-127 


normal distribution values, 125-126 
values, finding, 127-131 


Association, 19 
causation, implication (absence), 21—22 
conditional relative frequencies (Titanic disaster), 20—21 


negative linear association, 177 


strength/consistency, 277 
term, usage, 168 


Astronomical units (AUs), measurement, 807 
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Attendance audit, SRS selection (Table D usage), 228 
Auto care, CLT (usage), 477-478 
Average income/fertility rate, relationship, 185f 


B 


Background music 
analysis, 727 
types, 730, 733 
Backpack, sampling distribution, 462-463 
Back-to-back stemplot, 39 
resting pulse rates, 39f 
Bar graphs, 10 
axes, drawing/labeling/scaling, 10 
communication method, 5f, 10f 
creation, 11 
process, 10 
histograms, contrast, 45 
interpretation, 11 
Titanic travel classes, 21f 
Baseball teams 
monetary expenditures, 155-156 


payroll, relationship, 166, 167f, 168f, 200-201 


wins, mean number, 167 


Bathwater, transformations (effect), 387 


Batteries 
boxplot, 586f 


conclusions, making, 559 


conditions, checking, 586 


histogram, 586f 
lifetimes 

probability plot, 586f 

test power, interpretation, 598 
mean lifetime, 585 
point estimators, usage, 495-496 
power, 599-600 
P-value, 588f 
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stemplot, 586f 
Batteries, standards, 440 
Bayes, Thomas, 340 
Bayes’s theorem, 340 
Beads 
conditions, checking, 513-514 


population proportion, 510 
Beta, 770 

significance test, 783—784 
Bias, 224 

avoidance, 271 


biased sampling methods, 224—225 


estimator usage, 453f 


response bias, 234 
Biased estimators, 448—450 
Binomial coefficient, 407—408 
calculation, 408 


Binomial distributions, 406 


computation, 418 
description, 412-418 
histogram, 420f 


normal approximation, 420—422 

sampling, relationship, 419-420 
Binomial mean/variance formulas, origin, 417 
Binomial probabilities 

calculation, 406-412 

distribution, graphing, 413 

finding, process, 411 

formula, 408 
Binomial random variables, 361, 402, 403-406 


mean/standard deviation, 415—41 


probability distribution 
appearance, 412-413 
histogram, 413f 


probability histogram, 414f 


standard deviation, 416 
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Binomial settings, 403-406 


conditions, 404 
BINS (Binary Independent Number Same), 404 
Biological populations, characteristics, 116 
Blinding, 249-250 
Block, 257 
Blocking, usage, 259 
Blood pressure 


means, difference (significance test), 659-660 


treatment groups, simulated random reassignments (dotplot), 662f 
Blood type inheritance, binomial probability (calculation), 409 
Blue M&M’S®, avoidance (probability rules), 317-318 
Boaty McBoatface, name (vote), 224—225 
Bones (health), P-value interpretation, 557 
Book of Odds, The, 302 
Books (reading) 

critical value/confidence interval, calculation, 516 


Ll, one-sample t interval, 535-536 
Bottled water, tap water (contrast), 360 


binomial distribution, description, 414, 416 


Boundary value, 120 
standardization, 123, 125 
usage, 558 


Boxplots 
creation process, 68, 69-70, 73 
creation, visual illustration, 68f 
five-number summary, determination, 69 
interpretation, 68, 69—70 
median, marking, 69 
outliers, identification, 69 
pumpkin selection, 69—70 
usage, 71—73 
whiskers, drawing, 69 

Brand names 
importance, 627 


preference, conditions (checking), 625 
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Breakfast 
breakfast cereal calories, dotplots/summary statistics, 132f 


consumption, standardized test statistic/P-value (calculation), 632—633 


C 


Caffeine 
comparison, completely randomized design (outline), 255f 
dependence (reality), mean difference (significance test), 680-681 
experiment, 247—249 
analysis, 273 


pulse rate 
change, 272 
relationship, 251 


Calcium 
means, difference (significance test), 659-660 
P-value, interpretation, 557 
treatment groups, simulated random reassignments (dotplot), 662f 
Cancer, causes, 277 
Candy color, distribution, 708—709 
Candy grab 
prediction, 178-179 
scatterplot, display, 154f 


slope/y intercept, interpretation, 182 


s/r2 interpretation, 191—192 
values, 152 


Candy machine, sampling distribution, 459-460 


CareerStart, effectiveness, 276 
Cars, value (loss), 176, 177f 
Categorical data 


analysis, 9 
display, 10-12 
Categorical variables, 3, 152 


data analysis, 13-16 
relationships, 17-22, 740—741 
Causation 


correlation, connection, 163 
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establishment, challenges, 277-27 


implication 
absence, 21—22 


association, relationship, 201 


Cause and effect 
inference, 682 
questions, 277 
understanding, 244 
Cause, plausibility, 278 
Census, 221 
Census At School, 3—4 
Random Data Selector, 336 


biased/unbiased estimators, 448-449 
comparison, 461, 469 
measurement, 54-57, 366—367 
skewness/outliers, effect, 58 
Central limit theorem (CLT), 474-478, 646 
usage, 477-478 
Challenger disaster, multiplication rule, 345 
Chance, 298 
process, 335, 403 


Chance behavior, predictability, 301 


Chebyshev’s inequality, 119 
Children (hunger) 
hypotheses (stating)/conditions (checking), 631 
SRS, 631 
Chi-square distributions, 715 
density curves, 715f 
P-values, relationship, 714—717 


Chi-square test statistic, 711—714 


calculation, expected count division (reason), 713 
expected counts, relationship, 728-732 
values, dotplot, 714f 
Chi-square tests, 708, 709 
execution, 717-721 
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expected counts (calculation), two-way table (data basis), 730 
performing, 735-736 
P-values, finding, 716 
selection, 747—748 
usage, 746—751 
Chocolate 
correlation, 165-166 


milk, concussions, relationship, 256 
Cholesterol 
levels, impact, 127 


P1 — P2 Significance test, performing, 635-636 


Church attendance, life expectancy (relationship), 277 
Claim (evaluation), sampling distribution (usage), 445 


Cluster sampling, 230 


strata, difference, 231 
Clusters, 230 
display, absence, 70 


Coefficient of determination r2, 188-192 


interpretation, 191—192 


s value, relationship, 191 
Coffee, drinkers (identification), 302 
Coin toss 

fairness, 299-301 


proportion, 301f 


randomness, investigation, 302—303 


scenario, 335 
College 
attendance, p sampling distribution calculations, 464—465 


cost, constant (multiplication/division effect), 385 
Communication method, distribution 

bar graph, usage, 5f 

histogram, 45 
Comparison 

experiment design, 247-249, 253 


number, execution, 728 
Complement, 316, 323 
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tule, 316 
Completely randomized designs, 255-256 
outline, 255f 


Computer regression output 
interpretation, 192—194 


t values, 779 


Conclusions, making, 558—560 
Concussions 
chocolate milk, relationship, 256 


stemplots, creation/interpretation, 38 
Conditional probability, 342, 406, 598 

calculation, 332-333 

defining, 330-334 

independence, relationship, 330, 334-337 


tree diagrams, relationship, 338-343 
usage, 332 

Conditional relative frequency, 17 
association, Titanic disaster, 20—21 
Titanic disaster, 17—18 

Confidence intervals, 495, 496—499, 552 
applet, 499-500, 502f, 503-504 
calculation, 516, 650 
construction, 511-518, 534-538 

conditions, 513, 625 


(ee) 


four-step process, 517-519 


information, increase, 579 
interpretation, 498, 499, 517—51 


90% confidence interval, 653 


one-sided tests, connection, 597 
Pi — P2, 625-630 
two-sided tests, relationship, 595-597 
usage, 595-596 

Confidence level, 497-498 
adjustment, 503 


critical value, 538 
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decrease, 504 


interpretation, 499-502 
Confidentiality, 279 
Confounding, 243 


remedy, 248 


Constant 
addition/subtraction, 382 
effect, 98-99, 382-384 
multiplication/division, 382 
effect, 100-101, 384-386 


Consumers Union, experiment, 682 


Continuous random variables, 361, 371-374 
probabilities, finding, 372 
probability distribution, 371, 372f 

Control, 252—253, 259 
groups, 248-249 


Convenience sampling, 223-224 


Coronary heart disease (CHD) 
development, anger category (bar graph), 741f 
status, 742 
Correlation, 153 
calculation, 166-168 
causation, connection, 163 
cautions, 163-166 


coefficient, 160 


estimation, 161 

explanatory variables/response variables, contrast, 168 
facts, 168-169 

impact, 164-165 

interpretation, 162 

knowledge, 197-201 


linear association, 160—162 


negative correlation, 168 


regression applet, usage, 163-164 


resistance, absence, 197—201 
usage, 161f, 164, 197-201 
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Correlation r, 160 


calculation, 166 
Craft stick problem, 447—448 
Critical value, 511 
calculation, 515, 516, 529-531 
dependence, 504—505 
finding, 515f 
Crying/IQ (relationship), beta (significance test), 783—784 
Crying peaks, 784 


Cumulative frequency, 93 

Cumulative relative frequency graphs, 92—95 

Current Population Survey (CPS), U.S. Census Bureau, 232, 269 
income questions, 441 


interviews, 440 


Customer satisfaction, sample size (determination), 520 


D 


Data 
analysis, 4—5 
categorical variables, 13-1 


collection, 746 


display, usefulness, 46 

dotplot, five-number summary (inclusion), 68f 
ethics, 278-279 

graph, 132 

histogram, 132 

limitations, 575-57 


organization, 2—5 

plotting, 109 

set 
measurement units, change, 97-98 
range, 60 

skewness, 131 


summary statistics, 132 


tests, performing, 728 
transformation, 97-103, 795-79 
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unrepresentative data, production, 223-224 


values 
inaccuracy, 68 
plotting, 155 
Degrees of freedom (df), 529, 588f, 746 
Density curves, 90, 109-110 
description, 111-113 


dotplot modeling, 110 


mean, balance point, 112f 


median, 112 
modeling, 110f 
randomness, 111 
shapes, 111f 


symmetric density curve, midpoint balance, 112 


usage, 529f 
Dependent variables, 153-154 
Diameter at breast height (DBH), 652-653 


Diamonds 
pricing, 186 
SATs, connection, 154 
Die rolls, 33 
hypotheses/expected counts/chi-square statistic, 712—713 
return, P-value (finding), 716—717 
Discrete random variables, 361, 362—364 
analysis, 364-365, 370 
mean (expected value), 366-367 


median, finding, 367 
probability distribution, 362 
standard deviation, 368—371 


variance, 368-371 


Dissolved oxygen (DO) 
Ht, significance test (performing), 592-593 
one-sample t test, Minitab output, 594f 


Distance, equation, 797 
Distracted driving, 274 
experiment, 274—275 
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Distribution, 4 
appearance, interval selection (impact), 43 
comparison, 36—37 
boxplots, usage, 71—73 
description, 34—35, 102 
median, 92 
normal distributions, 90, 109, 113-116 


percentile, 92, 118 
shape, 100 
normal density curve display, 113f 
skewed to the left, 32 
skewed to the right, 32 


variability, 98 

Dotplots, 5, 30—32 
comparative dotplots (PVC pipe length), 59f 
creation, process, 31—32 
interpretation, 31—32 
Old Faithful eruptions, display, 33f, 41f 
score dotplot, 91f 


soccer goals, 31f 


Double-blind experiment, 249 
Drink size, x 1-x 2%1 ~ ©2, sampling distribution (description), 648 


Drivers, distraction, 274 
Drop height, 776 


E 


Earnings, variation, 392—393 


ebooks/print books, readership (tree diagrams/probability), 341 
Effect size, 602 
Elections 


confidence interval, 499 
poll, confidence level (interpretation), 501 


Empirical rule, 117 


English (speaking), homogeneity (chi-square test), 738—739 
Entertainment media, teen use, 220 


Environmental club membership, bar graph, 20 
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Equal-areas point, 112 


Equal proportions, test, 718—719 
Equal-width intervals, selection, 41 
Error 
data, conversion, 100-101 
distribution, 99, 101 


margin. See Margin of error. 
standard error, 513 


Estimator 
biased estimators, 448—450 
bias/variability, 453f 
point estimator, 495 
selection, 452-453 
unbiased estimators, 448-449, 469 
Events, 314-315 
complement/intersection/union, 323f 


independent events, 335 

mutually exclusive events, 320f 
addition rule, 317 

occurrence, 316 

probability, 316 

relationship, 322f 


representation, 322f 
two-way tables, 320f 


Expected cell counts, impact, 750—751 


Expected counts 
calculation, 711 


two-way table, data basis, 730 


division, reason, 713 


observed counts, comparison, 711—714 


Expected value, 366-367 


Experimental design, principles, 253, 254 


Experimental unit, 245, 248 
Experiments, 241 
blocking, usage, 259 
design, 247-253 
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explanatory variables, usage, 246-247 
inference, 272—275, 627-629 
language, 245-247 
observational studies 

contrast, 242—245 

relationship, 244 


usage, 269 

vocabulary, 246 

yield, worthlessness, 248 
Explanatory variables, 153-154, 242 

change, 773 


effect, observational studies, 243 


response variables, contrast, 168 
usage, 246-247 


values 


association, 277 


increase, 799 


variability, increase, 775 
Exponential models, 803-806 
Extrapolation, 178 
Extrasensory perception (ESP), testing, 403 


F 


Facebook, financial incentives, 244 


Facebook/Instagram 
conditional probability, calculation, 333 


general addition rule, 321 
Venn diagrams/probability, 324 
Factor, 246 
Factoring, 631 
Federalist Papers, authors (identification), 222-223 
Feet/height 
least-squares regression line, calculation, 195 


relationship, scatterplot, 196f 
Feet, usage, 192-193 
Female president (vote), hypotheses/expected counts/chi-square test statistic, 
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731-732 
Fertility rate/average income (relationship), scatterplot/ 
residual plot (usage), 185f 
First quartile, 64 
Fisher, Ronald A., 560 
Fishing (transformation) 
logarithms (usage), power models, 800-801 
powers, usage, 797—798 
roots, usage, 799 
Fit, summary, 192f, 194 


Five-number summary, 68 


determination, 69 
dotplot, combination, 68f 
Five-second rule, 246—247 
Fixed significance level, usage, 559 
Football players, weighing (sampling), 271, 275 
Ford F-150s 
analysis, 187 
data, 181 


regression line, addition (scatterplots), 183f 


least-squares regression line, 184, 192f 
price/miles 

residual plot, 186f 

scatterplot, 177f, 186f 


price prediction, regression line (usage), 177f 


scatterplot, regression line (addition), 188f 
value 
loss, 176, 177f 


slope, confidence interval, 779-780 


Form (measurement), correlation (impact), 164 


Free throws 
confidence interval, 552-553 


proportion, 553f 
P-value, 556 
Frequency histogram (sales tax rate), 43f 


Frequency table, 9, 92—93 
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Fuel economy 
distribution, description, 35 
ratings, dotplot, 58f 


G 


Galton, Francis, 196—197 
Gapminder, 795 
Gaps, display (absence), 70 


Gasoline prices, variation, 440 

Gauss, Carl Friedrich, 116 

Gender/handedness, independence (investigation), 336 
General addition rule, 318-321 

General multiplication rule, 337—338 


General Social Survey (GSS), 656 
Geographic region, indication, 795f 


Geometric distributions, 423 


Geometric probability 
calculation, 423-425 
formula, 423 

Geometric random variable(s), 361, 402, 422-426 
mean (expected value), 426 


probability distribution, histogram, 425f 
Geometric setting, 422 
Gesell Adaptive Score, 198 
least-squares regression lines, 199f 
data, changes, 199-200 
linear model, residual plot, 199f 
scatterplot, 199f 
Gladwell, Malcolm, 718 
Goals scored 
dotplot, 58f 


mean, calculation, 55 


median, calculation, 57 
outliers, identification, 67 
Golden ticket parking lottery, 306—308 
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Goldfish, p\1-pA2P 1 — Po sampling distribution (description), 624 
Goodness of fit, chi-square test, 708, 709, 718 
expected counts, calculation, 711 


follow-up analysis, 720—721 
performing, 719-720 
conditions, 717 
P-value, 715f 
Goodness of fit, test, 733 
Grade Point Average (GPA) 
margin of error, factors, 505 


normal/large sample condition, 533-534 
Graphs, 12-13 
usage, 30 


Guesses 
constant, multiplying/dividing (effect), 100—101 


dotplots/summary statistics, 99f, 101f 


H 
Hand span, scatterplot, 154f, 180, 182, 19 
Harris Interactive survey, 625, 626 


-_— 


Hawaii Pineapple Company 
confidence intervals, 595-596 
hypotheses, stating, 555 
two-sided tests, 595-596 


Head circumference data, stemplots, 39 


Heart attacks, p; — p> significance test (performing), 635-636 
Height 
distribution, example, 119 


prediction, feet (usage), 192-193 
Height/feet 
least-squares regression line, calculation, 195 


scatterplot, 196 
Height/strides, correlation, 169 
Helicopter experiment 

conditions, checking, 775-776 
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parameters, estimation, 777—778 
Helsinki Heart Study, 635-637 
treatment groups, Pp; — P> values, 637f 


Hiring discrimination, 6 
Histograms, 40-44, 771f 
axes, scaling, 41 
bar graphs, contrast, 45 
bars, drawing, 41 
creation process, 41-44 
equal-width intervals, selection, 41 
interpretation, 42 
intervals, selection, 43 
Old Faithful eruptions, 41f 
table, creation, 41 
usage, 45-46 
word lengths, comparison, 45f 
HIV testing, probability (finding), 345-346 
Hockey birthdays, equal proportions (test), 718—719 


Homogeneity 
chi-square test, 728, 738-740 


follow-up analysis, 739-740 


performing, conditions, 733 
tests, 708 
conditions/P-values, 733-737 


hypotheses, stating, 727—728 


Household size, distributions (comparison), 36 
Hypotheses, stating, 554, 630, 655, 710 
Hypothesis test, 552 


I 
Identical twins (pairs), IQ scores 
dotplot, 674f 
parallel dotplots, 673f 
Identical twins (intelligence), mean difference (confidence interval), 677-678 
iMac, introduction/survey, 12-13 
Independence 
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absence, problem, 346-347 
chi-square test, 741—746 


performing, 726 
performing, conditions, 743 
conditional probability, relationship, 330, 334—337 


investigation, 336 

tests 
conditions/calculations, 743—744 
expected counts, 742 
hypotheses, stating, 741—742 


Independent events, 335 
multiplication rule, 344-347 


Independent random samples, 647 
data, 625 


Independent random variables, 390 


difference, standard deviation, 3923 

sum, standard deviation, 391 
Independent SRSs, selection, 624f, 648f 
Independent variables, 153-154 
Individuals, 2 


random assignment, 276 
Inference, 6—7, 270-271 

scope, 275-277 

usage, 272-275 
Influential points, usage, 200-201 


Informed consent, 279 
Instagram/Facebook 
conditional probability, calculation, 333 


general addition rule, 321 


Venn diagrams/probability, 324 


Institutional review boards, 279 


Intensity, equation, 797 
Interest, outcomes, 403 
Interquartile range (IQR), 63-65 
constancy, 98 
determination, 64-65 
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Intersection, 323 


Interval 
confidence intervals, 495, 496-499 


estimate, 496 


selection, 43 
usage, 497 
Interviewer gender (impact), conditions/P-value/conclusion, 734—735 
Inverse t (usage), 531 
Iowa Test of Basic Skills (ITBS) vocabulary scores, 113-117 
calculation, 121-122 
distribution, 114-116, 119-120 


description, 68—95—99.7 rule (usage), 118 


percentile, estimation, 127 


histogram, 113f 
level, proportion (normal distribution estimation), 121f 
90th percentile (display), normal distribution (usage), 127f 


normal curve, usage, 120f 
obtaining, 118 

quantitative data sets, 115 
seventh-grade scores, 123, 125-126 


normal distribution approximation, 125f 


proportion, normal distribution estimation, 124f 
test score data, 120 
IQ scores 
crying (relationship), beta (significance test), 783-784 
dotplots/normal probability, 134f 
normal distribution, 132—133 


J 


Jeep tours, probability distribution (display), 365 
JMP statistical software, 192f 
Joint relative frequency, 15 


calculation, 15-16 


K 
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Kepler, Johannes, 808 


L 


meeting, 570, 632 


satisfaction, 462 


usage, 573 
Law of large numbers, 301-302 


Least-squares regression, 176 


computation, 778 
lines, 770f 
dotplot, 771f 


resistance, absence, 197—201 


Least-squares regression line, 183-184, 191 
calculation, 184, 186, 192f 
feet/height relationship, 195 


calculation process, summary statistics (usage), 194 
point, addition, 198 


properties, investigation, 198 
Left skew, function, 112 
Levels, 246 
Life expectancy 


church attendance, relationship, 277 
scatterplot, 795f 


log(income per person), contrast, 796f 


Linear association, 164 
measurement, 160—162 


Linear model, appropriateness (determination), 185-188 


Linear regression, 768 
inference, 769 


Linear relationship 
description 


correlation/regression lines, usage, 197 


correlation, usage, 164 
direction, 160 


strength/direction measurement, correlation (usage), 161f 
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Linear transformations, 796 


Linearity (achievement), transformation (usage), 795, 809 

LINER (Linear Independent Normal Equal SD Random), 774—775 
LinRegTTest, p > 0, 785 

Location 


distribution description, 91 
measurement 
percentiles, usage, 91-92 
z-scores, usage, 95-97 
Logarithms, usage, 800-802 
Lower back pain treatment, p; — pz confidence interval, 628 
Luck, 422 


Lucky Day game, geometric probabilities (calculation), 423-424 


M 


Magnets, impact, 250 


Malaria, prevention, 246, 248 
Mammograms, tree diagrams/conditional probability, 342 
Margin of error, 271, 498, 538 

capability, 518 

decrease, 504 


examination, Confidence intervals applet (usage), 503-504 
factors, 502—505 


limitations, 505 


sample size, 519 

pl estimation, 539 
size, 651 
usage, 539 


Marginal relative frequency, 14 
calculation, 15-16 
Matched pairs design, 259-261 
Math (impact), paired data (analysis), 674-675 
Mean, 54—56 
balance point, 56 
calculation, 55, 469 


confidence interval, construction (conditions), 533 
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discrete random variable, 366—367 
display, 114 
finding/interpretation, 367 


median 


comparison/contrast, 58-59, 112 


relationship, dotplots (usage), 58f 
mystery mean, 494—495 


one-sample t test, 592 


performing, 594 


population mean, estimation, 525 
regression, 194—197 


standard deviation, 119 


values, regression, 196-197 


Mean difference 
confidence interval, 677-678 


construction, conditions, 676 


one-sample t interval, 677 

one-sample t test, 679, 680 

paired t interval, 677 

paired t test, 679, 680 

significance test, performing (conditions), 679 


Mean price, usage, 190f 


Means 
addition rules, usage, 470 


comparison, 645, 673 
Means, differences 
confidence interval, construction, 654 
conditions, 649 
dotplots, usage, 273f 


sampling distribution, 645-648 
significance test, performing, 656, 660-661 
two-sample ¢ interval, 650, 651 
two-sample t test, 658 

Median 
calculation, 57 
finding, 367 
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marking, 69 

mean, comparison/contrast, 58-59, 112 
mean (relationship), dotplots (usage), 58f 
measurement, 57 


Medium/large drink, x 1-x 2%1 ~ ©2, sampling distribution (description), 648 


Mileage (improvement), matched pairs design, 260 

Minitab statistical software, 192f 

M&M’S®, color distribution, 708-717 
observed/expected counts, bar graph, 712 


Monkeys (counting), sample size (determination), 539 
Movie attendance, mean/standard deviation, 470 
pl 

confidence interval, 99% level, 596 

confidence interval, construction, 534-538 

estimation, conditions, 532—534 


one-sample t interval, 535-536 


one-sample t test, 591-595 
significance test, 585-591 
significance test, performing, 592-593 


Hy — Ha confidence intervals, 649-655 


Hy — Ho Significance tests, 655-658 


calculations, 657 


hypotheses (stating)/conditions (checking), 655-656 
standardized test statistic/ 
P-value, 657 
[1 — Hz two-sample t-test, 658-664 


Hai¢e confidence intervals, 676-679 


Haire Significance tests, 679-682 


Multiple analyses, usage (limitations), 604 
Multiplication rule, 344—347 
general multiplication rule, 337-338 


Music 
background music, types, 730 
conditions (entrée distributions), relative frequency (bar graphs), 727f 
impact, paired data (analysis), 674-675 

Mutually exclusive, 317 
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Mutually exclusive events, 319 
addition rule, 317 
Mystery mean, 494—495 


N 

NASCAR cards, cereal boxes (relationship), 305-306 

National Health and Nutrition Examination Survey (NHANES), 645-646 
National Highway Traffic Safety Administration (NHTSA), seat belt report, 220 
Nausea drug interaction, test power (factors), 602 


Negative correlation, 168 
Negative linear association, 177 


Nonlinear relationships, strength (measurement), 164 


Non-normal distributions, 131, 135 
Non-normal population, sample mean (sampling distribution), 475 


Nonresponse, 233 
Nonsmokers, two-sided test, 577—578 
Normal approximation, usage, 464—465 


Normal curve, 113 
mean/standard deviation, display, 114f 
sketching, 115, 119 
usage, 120f 
Normal density curve, 125 
display, 113f 
usage, 116-117 
Normal distributions, 90, 109, 113-118, 421 


approximation, 125f 


areas, finding, 119-127 


process, 122 


values, determination, 128 
drawing, 122, 123, 129, 373 
estimation, 124f 
facts, 114 


importance, 116 


mean, standard deviation, 123 


mean/variance, 396 
modeling, 645-646 
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sampling distribution models, 571f 
shape, 114 
sketching, 115 


specialness, 116-117 


standard normal distribution, 120 


symmetry, 118 
usage, 127f 


values, areas (determination), 121-122, 129 


Normal/Large Sample condition, 532 


allowance, 591 
checking, 533-534 

Normal population, sampling, 471-474 
distribution, examination, 471-472 


sample mean, sampling distribution, 472 
Normal probability distributions, 373, 487f 
Normal probability plots, 133-136 

creation, 136 


interpretation, 135 


linear/curved form, 134 

nonlinear forms, 134 

usage, 135, 136 
Normal quantile plot, 133 


Normal random variables 
combining, 394—397 
histograms, 395f 

Normality, assessment, 131—136 


process, normal probability plot (usage), 135 
deviation, 603 


general form, 630 
homogeneity usage, 727 


restaurant experiment, 729 


Null value, alternative parameter value (difference), 600 
Numbers, usage, 54 
Numerical summaries 

calculation, 109 
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computation, 66 


technology, usage, 65-66 


O 


Obama, Barack, age (percentile), 94 
Observational studies, 241—242 
experiments 
contrast, 242—245 
relationship, 2434 


Observed counts, expected counts (comparison), 711—714 
Observed values, standardization, 97 
Occurrence, indication, 68 
Oil changes, CLT (usage), 477-478 
Old Faithful eruptions 
durations 

dotplots, 33f, 41f 

histogram, 41f 

scatterplot, 769f 


interval, scatterplot, 769f 
sampling, 768—769 
SRS, scatterplots/least-squares regression line, 770f 
subpopulation, 773-774 
“1 in 6 wins” game, 298, 305 
One-sample t interval, 677 
One-sample t test, 591-595 
Minitab output, 594f 


N 


One-sample z test for a proportion, 572—57 


One-sided alternative hypothesis, 554—555 


One-sided tests, confidence intervals (connection), 597 
Outcomes, 372 

collection, 315 

likelihood, 315 

outcome of interest, 403 

probability, 316 
Outliers, 34 

effect, 58 
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identification, 66—69 

influence, 198 

usage, 200-201 
Outliers (Gladwell), 718 


P 


P1 —P2 confidence intervals, 625-630 


endpoints, 629 
formula, usage, 626 
lower back pain treatment, 628 
P1 — P2 significance tests, 630-634 
calculations, 631—633 
P1 — P2 two-sample z test, 634-637 


Pain (repulsion), magnets (usage), 250 
Paired data 
analysis, 673-676 


means, comparison, 619, 673 


two samples, contrast, 683-685 
Paired t interval, 677 
Parameters, 442—443 

estimation, 776—778 


Part-time jobs, standardized test statistic/P-value (calculation), 571-572 
Pass the Pigs (game), 403 
Payroll 

baseball teams, relationship, 166, 167f, 168f, 200-201 


wins, correlation, 162 
Peaks, display (absence), 70 
Penny 

date, proportion, 440-441 

distribution, 443f 
Percentiles, 90 


finding/interpreting, 92 
usage, 91—92 
z-scores, connection, 121 


Period, equation, 797 
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Physicians’ Health Study, experimental design (principles), 254—255 
Pictograph, 12—13 
Pie chart, 10 

example, 10f, 11f 


Pierced ears (user identification) 
probability example, 331 
two-way tables/probability, usage, 319 
Placebo, 244 
effect, 249-250 
Planet, model (selection), 807—808 
Plants, mean number, 230 


Plausible, term (meaning), 497 
Point estimate, 495, 498 
Point estimator, 495 
usage, 495-496 
Political polls, response rates, 233 


Pooled two-sample t procedures, avoidance, 663 
Pooled two-sample t statistic, 663 

Pooling, function, 663-664 

Popcorn, blocking (usage), 259 


Population, 221 
distribution, 446 
shape, 469 
normal population, sampling, 471-474 
regression line, 770 
samples, relationship, 221—222 
shapes, sample sizes (sampling distributions), 476f 
slope, marking, 771f 


variance, biased estimator, 450 


Population mean, 585 


confidence interval construction, 537—538 
estimation, 525 


one-sample t interval, 529 


one-sample z interval, 526 


one-sided tests, confidence intervals (connection), 597 


Population parameter 
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capture, confidence interval (usage), 501 


confidence intervals, simulation, 526, 528f, 532 
estimation, 451 
Population proportion 


confidence interval, construction, 518 


estimation, 510 
one-sample interval, 511 
tests, 568 
Potatoes, production 
proportion, significance test, 574-575 
Type I/II errors, 561 
Power, 598, 602 
calculation, 602 


increase, 601 
model, 796, 800-801 
finding, 802 
form, 800 
Type IJ error, relationship, 599 


Type I/II errors, connection, 601f 
Power of a test. See Tests 
Practical importance, statistical significance (relationship), 603-604 
Prediction, 177-179 

avoidance, 178 

candy grab, 178-17 


errors, addition, 183 


Price, reference, 177 
Probability, 298, 299, 318-321 


conditional probability, 330 
defining, 299-300 
determination, 344 

finding, 315, 345-346, 372 
idea, 299-304 
interpretation, 302 

models, 314-315, 361 

plot, 771f 

rules, 313, 316-318 
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tree diagrams, relationship, 341 


usage, 319 

Venn diagrams, relationship, 322-325 
Probability distribution, 361, 362 

constant, adding/subtracting (effect), 383 


construction, 391 
continuous random variable usage, 372f 
display, 365 
histogram, 364f, 383, 425f 
Proportions 
comparison, 619, 621, 748—750 


conditions, significance test (performing), 569 


equal proportions, test, 718—719 


Large Counts condition, 568 
one-sample z test, 573, 576-577 
sample proportion, 622 
combination, 632 
significance test, 574—575 


Proportions, difference 


confidence interval, construction, 629 


conditions, 625 


sampling distribution, 621-624 
significance test, performing, 633-634 


conditions, 630 


two-sample z interval, 626 
two-sample z test, 634 

PROVE-IT experiment, 619 

pP sampling distribution, 459-464 
binomial count, relationship, 463 
calculations, 464—465 


conditions, violation, 514-515 


lon) 
N 
No 
| 
N 
(ee) 


confidence interval 
construction, 511-516, 517-518 


interpretation, 517-51 


description, 624 


estimation, conditions, 511-514 
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mean, 622 


formulas, origin, 623 
normal approximation, usage, 464—465 
one-sample z test, 572—577 


significance test, performing, 568-572 


simulation, 622f 

standard deviation, formulas (origin), 623 
Pulse rate. See Standing/sitting pulse rates 

change, 252 

effect, absence, 273f 


Pumpkin selection, boxplots (usage), 69—70 
P-values, 631-633, 657 
calculations, 570—572, 587-590 


chi-square distributions, relationship, 714—717 


computation, t distributions (usage), 590-591 
display, 571f, 588f 

estimation, 637 

finding, 716-717 


limitations, 589 


interpretation, 556—560 


PVC pipe length, comparative dotplots, 59f 


Q 


Quantitative data 
category grouping, 750 
description, numbers (usage), 54 
display, graphs (usage), 30 
distribution, 102, 112 
description, 113 
sets, 115 
Quantitative table, frequency table, 92-93 


Quantitative variables, 3, 152 


distribution 
description process, 34 


skewness, 131 


requirements, 168 
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values, 525 
Quartiles, 64 
identification process, 66 


Questions, wording, 234 
Quiz scores, 33 


R 


r2 values, 189-192 
interpretation, 191—192 
role, 188-192 

Random assignment, 251—253, 275 
process, 251 


purpose, 272 
Random condition, 512, 532, 573 
impact, 591 
Random Data Selector, 336 
Random digits, usage, 306 


Random numbers 
distribution, 111 


sequence, generation, 111 


Random reassignments (simulation), p; — pz values (dotplot), 637f 
Random sampling, 225-226, 275 

methods, 229-232 

work, inference methods, 636-637 
Random variables, 361, 362, 368 

combining, 381, 388-390 

transforming, contrast, 393-394 
defining, 361-362 


difference, mean (expected value), 389 


normal probability distribution, 387f 
normal random variables, combining, 394—397 
sum/difference 
mean (expected value), 388-390 
standard deviation, 390-394 
transforming, 381, 382-388 
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Randomized block designs, 256—261 
outline, 257f 

Randomized experiments, random sampling work (inference methods), 636-637, 
662 

Randomness 


investigation, 302—303 
understanding, 302-304 
Range, 59-60 


estimation, 449 


Regression 
analysis, 798-799, 801 
applet, usage, 163-1 


aS 


[o>) 


inference, conditions, 773-776 
knowledge, 197-201 
least-squares regression, 176 
model, 773f 


output, interpretation, 192—1 


1ee) 


population regression line, 770 


sample regression line, 770 
standard error, 777 
Regression line, 176 
addition, 188f 
interpretation, 181—182 
scatterplot, 177f 


slope, interpretation, 182 
usage, 177f, 197-201 
Relationships, display, 154-156 
Relative frequency, 93 
table, 9, 10 
usage, 45f 
Replication, 253 
Residual plots 


creation, 187 


interpretation process, 186 


patterns, examination (reason), 187 
usage, 185—188 
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fertility rate/average income, relationship, 185f 
Residual s, 188—192 
Residuals, 179-181, 780, 782 
addition, 183f 
equal variability, 775 


squared residuals, sum, 190f 
standard deviation, 188—1 


WO 


sum, minimization, 183 

Resistant, 56 

Response bias, 234, 242 

Response variables, 153-154, 243 
explanatory variables, contrast, 168 
standard deviations, 196—197 


value 
prediction, 177 
pth root, 799 


Responses, strength, 277 
Resting pulse rates, back-to-back stemplot, 39f 
Root mean squared error, 777 


Rough symmetry, 111 
Row number, 782 


S 


s value 


N 


interpretation, 191—1 


r2, relationship, 191 
role, 188-192 


standard deviation, 189 


Sales tax 
histograms, usage, 42 
rate, frequency histogram, 43f 
Sample, 221 
data, distribution, 446 
effectiveness, 225-228 
problems, 223-225 


regression line, 770 
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probability plot/histogram, 771f 
simple random sample (SRS), 225-226 


standard deviation (calculation), division (reason), 450 


stratified random sample, 229 
Sample mean (x ©) 
mean/standard deviation formulas, origin, 470 
sampling distribution, 468-470, 468f 
examination, 471—472 
mean/standard deviation, 469 
shape, 477 
Sample means, 468 


Sample proportion, sampling distribution, 446, 461-462 
Sample proportions, 458, 622 
sampling distribution, 459, 459f 


Sample size, 600 
adjustment, 503-504 


determination, 520 


margin of error, usage, 539 
increase, 504 
sampling variability, relationship, 271 
selection, 519-529, 538-540 
Sample space, 314-315 


events, relationship, 322f 
example, 339f 
outcomes, likelihood, 316 
Sample survey 
idea, 222-223 
problems, 232-235 
Sample variance, 450 
Sampling, 221 
biased sampling methods, 224—225 


binomial distributions, relationship, 419-420 
convenience sampling, 223-224 

example, 229 

heights, 444, 449 

inference, 270-271 
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methods, 231-232 


INO 


monitors, 221—222 
random sampling, 225-226 

statistical sampling, binomial distributions, 418-420 
variability, 270-271, 443 


sample size, relationship, 271 


variability, margin of error (impact), 518 
voluntary response sampling, 224 
approximate sampling distribution, 444 
approximations, 461f 

creation, 444 

description, 447-453 

mean, 461, 646 


samples, usage, 446f 


standard deviation, 462 
factors, 772 
usage, 445 
Satterthwaite, F.E., 651 
Scary movies/fear, chi-square test (selection), 747—748 


Scatterplots 
axes, label/scale, 155 


correlation, 153 
creation, 155-156 


process, 155 


data values, plotting, 155 
description, 156—160 


linear relationship, 768 
usage, 154—156 
Scholastic Aptitude Tests (SATs), normal/large sample condition, 533-534 


School assembly, sampling, 231—232 


Scores 
adjusted scores/doubled scores, graphs/numerical summaries, 100 
dotplots, 91f, 96, 98f 
report, 95 


summary Statistics, 96, 98f 
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Seat belt use, NHTSA observational study, 220 
Segmented bar graph, 29 


snowmobile distribution, 19f 


comparison, 461, 469 
description, 32—34, 364-365 


quiz scores/die rolls, 33 


determination, normal probability plot (usage), 136 


Shipments, timing (claim), 403 
Shoe ownership, JQR determination, 64—65 
Side-by-side bar graph, 19 

snowmobile distribution, 19f 
Significance level, 559-562, 600 
Significance test, 552, 554, 602 

conclusion, making, 558 


four-step process, 572 
pi usage, 585-591 
P1 —P2 significance tests, 630-634 


conditions, 630, 656, 679 
power, increase, 601 


P-value, finding, 570 

Silver, Nate, 233 

Simple random sample (SRS), 225-226 
distribution, 448f 
independent SRSs, selection, 624f 


least-squares regression lines, 770f 


mean height, sampling distribution, 474f 


sample means, simulated sampling distribution, 646f 
sample proportion, sampling distribution, 451f 
scatterplots, 770f 


selection, 621 


Table D, usage, 227-228 


selection process, 226—227 


technology, usage, 226—227 


simulation, rise, 450f 
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software, usage, 646 


taking, simulation, 526 

Simulated sampling distribution, 646f 
dotplot, 553f 

Simulation, 298, 299, 304—308 


performing, 305-307 


process, 305 
Single-blind experiment, 249 
68—95-99.7 rule, 116-119 
defining, 117 
usage, 118-119, 118f 
avoidance, 120 
Skewness, 111, 425 
effect, 58 
Slope, 181 


confidence interval, construction, 778—7 


INO 


interpretation, 182 


sampling distribution, 772 
significance test, performing, 782—786 
standard error, 777 

t interval, 778 

t test, 783 


Smartphone experiment, 258 


randomized block design, outline, 257f 
Smoking 

ADHD, confounding, 243 

impact, 277 


study, confidence interval, 579f 


Snowmobiles (impact), independence (chi-square test), 744—745 

Soccer goals, dotplot, 31f 

South Africa household size, United Kingdom household size (distribution 
comparison), 36 

Spieth, Jordan, 124 
drives, distance (proportion), 125 


values, areas (determination), 126-127 


Spinners, probability, 315 
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Spoken languages (distribution), dotplot (usage), 5f 


Squared residuals, sum, 190f 
percent reduction, calculation, 190 

Standard deviation, 368—371, 587 
calculation process, 61—62 


complexity, 62 
display, 114 
finding, 369 
formula, 623 
interpretation, 61—62, 36 


S=- 


IO 


number, interpretation, 96 
parameter, 776—777 
usage, 61-63, 469 

Standard error, 513, 587 

Standard normal distribution, 120, 121f 
usage, 121-122, 128 

Standardized score (z-score), 95 
scatterplot, 168f 


unstandardized normal distribution, usage, 122 
Standardized test statistic, 631-633, 657 

calculations, 570—572, 587-590 

equations, 587, 595 


Standing/sitting pulse rates 
comparison, 684 
parallel dotplots, 684f 
Statistical sampling, binomial distributions, 418—420 


Statistical significance, practical importance (relationship), 603-604 


Statistically significant, 272 

Statistics, 442-443 
definition, 2 
normal distributions, importance, 116 
probability distribution, 384f 
standard deviation, 511 
test, 96 

Stemplots, 37—40 
back-to-back stemplot, 39 
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creation process, 38 
head circumference data, 39f 
interpretation, 38 
Strata, 229, 230 
clusters, difference, 231 


Stratified random sample, 229 


estimates, precision, 230 
Stream health, p (significance test), 592-593 
Strength 

description, 160 


resistant measure, correlation (relationship), 165 


Students, guesses (dotplots/summary statistics), 99f, 101f 
Studies, usage, 269 
Subjects, 245 

usage, 253 


Subtraction, order (importance), 389 


Success 
condition, 406 
proportions, 624f 

Sugar, amount, 396—397 
normal distribution, 397f 


Summary statistics, 97 
usage, 194 
values, influence, 68 


Sunflowers, sampling, 229 


Surveys, 221 
sample survey, idea, 222—223 


Survival times, normality, 135 


Symmetric density curve 
mean, 112f 
median, 112f 
midpoint balance, 112 


Symmetric distribution, 32 


T 


t* critical values 
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calculation, 529-531 
finding, Table B (usage), 530 
t distributions, 588 
density curves, 529f, 588f 


modeling, 588 


symmetry, usage, 589 
usage, 590-591, 650-651 
t interval, 778 
t test, 783 
t values, 779 
Table A, usage, 121 
Table B, usage, 530-531 
Table C, usage, 734 
Table D, usage, 227—229 
Tablet quality (distribution comparison), boxplots (usage), 71—72 
Tattoos, bias project, 619-620, 627 
Tax rate (histogram), 45 


Teens 
debit cards 
binomial distribution, normal approximation, 421 


binomial distributions/sampling, 419-420 


social media, general multiplication rule, 338 
Temperatures 

parameters/statistics, 442—443 

tracking, 101-102 


allowance, 592 
Test of hypotheses, 552 
Test of significance, 552 


Test scores 
probability distribution, 384f 
scaling, constant adding/subtracting (effect), 384 
variation, 260 

Test statistic, 570, 57 


Tests, power, 597—60 


factors, 599-603, 602 


to Il 
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interpretation, 598 
Tests, usage, 603-604 
Third quartile, 64 
Three-year-olds, height 

determination, 129-130 


normal distribution, mean, 131f 
Time (effect), cause (impact), 278 
Titanic 

travel classes, bar graph, 21f 

two-way tables/conditional probabilities, 332 
Titanic disaster 

conditional relative frequencies, 17—18 

conditional relative frequencies, association, 20—21 

marginal/joint relative frequencies (calculation), 15-16 
Transformations, 97—98, 795 

connecting, 102—103 

effects, analysis, 101-102, 387 


logarithms, usage 


exponential models, 803-806 


power models, 800—802 


powers/roots, usage, 796—799 
selection, 806-810 
Treatment, 245 
Tree diagrams, 342f 


conditional probability, relationship, 338-343 
going backward, 340 

possibilities, 340f 

probability, relationship, 341 


sample space example, 339f 


Trees (comparison), [1 — 2 confidence intervals, 652-653 
Trucks 
miles driven, 779 


price/miles, scatterplot (usage), 177f 
price prediction, regression line (usage), 177f 
value 

loss, 176, 177f 
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slope, confidence interval, 779—780 
20/20 (ABC News program), blind taste test, 361 
Two-sample t interval, 650 


Two-sample t procedures, simulation studies, 651 
Two-sample z interval, 626 

Two-sample z procedures, 726 

Two-sample z test, 634-637 

Two-sided alternative hypothesis, 554-555 


Two-sided significance test, Minitab output, 579f 
Two-sided tests, 577-580, 589-590 
confidence intervals, relationship, 595-597 
usage, 595-596 
Two-variable data, 189 
Two-way tables, 14, 318-321, 620 
analysis, 22 


chi-square tests (performing), 735-736 
inference, 726 
Minitab output, 740f 
usage, 319, 332 
Type I errors, 560—562 
power, connection, 601f 
probability, 562 
Type II errors, 560—562 
power, relationship, 599, 601f 


Typing speed, 258 


U 

Unbiased estimators, 448-449, 469 
Unconditional probabilities, 406 
Undercoverage, 233 


Uniform density curve, 110f 
Union, 323, 331-333 
United Kingdom household size, South Africa household size (distribution 


comparison), 36 
Unknown o, problem, 525-531 
Unrepresentative data, production, 223-224 


2193 


Unstandardized normal distribution, usage, 122, 128 
U.S. Census Bureau 
Current Population Survey (CPS), 232, 269, 440-441 
student job study, conditions (checking), 569 


U.S. presidents, age (cumulative relative frequency graph), 93f 
interpretation, 94 


V 


Values 
determination, 127-131 


area, usage, 129-130 
usage, 178 


comparison, 461, 469 


estimator usage, 453f 
resistant measurement, 63 
size, 451-452 
Variability, measurement, 368-371 


interquartile range (IQR), 63-65 
range, 59-60 


standard deviation, usage, 61—63 
Variables, 2 

control, 252 

means/standard deviation, 166 
Variance, 368-371 

addition rules, usage, 470 


constant multiplication, impact, 386 
formulas, origin, 417 


Vending machine problem, 130 


Venn diagrams 
events, representation, 322f 
probability, relationship, 322-325 
sample space, 322f 
Video screen tension, p (confidence interval construction), 536—537 
Vitamin D, impact (study), 242-244 
Volume, 797 
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Voluntary response 
sampling, 224 


term, misuse, 233 


Voters, sampling monitors, 221—222 


WwW 


Wade Tract Preserve, 652 
Weight/length (contrast), scatterplot (usage), 798f 
Welch, B. L., 651 
Whiskers, drawing, 69 
Women, height 
comparison, 474f 


normal population, sampling, 472-473 
normal probability distributions, 373 


Word lengths, histograms, 45f 
Work week length 
hypotheses (stating )/conditions (checking), 656 
standardized test statistic/ 
P-value, calculation, 657—658 


X 


x £, See Sample mean 
x 1-x 2%1 ~ “2, sampling distribution, 647 
description, 648 


mean/standard deviation formulas, origin, 647 


Y 


y intercept, 181 


interpretation, 182 
Yellowstone National Park, 744-745 


Z, 


ZoomStat, usage, 160 


Z-SCores 
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calculation, 97 
connecting, 102-103 
determination, 96, 124 


interpretation, 96 


multiplication, 167 


percentiles, connection, 121 


usage, 95-97 
z-scores distribution 
mean, 102-103 

shape, 102 


standard deviation, 103 
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About the AP® Exam and AP® 
Exam Tips 


The AP® Statistics exam currently consists of two distinct sections: 
Multiple Choice and Free Response. Each section lasts 90 minutes and 
counts 50% of the total exam score. Periodically, The College Board!M 
modifies the composition of the exam, so it is wise to check the Ap® 
Statistics website (apcentral.collegeboard.org/courses/ap-statistics) to 
verify the breakdown before you begin your preparation for the exam. 
Formulas and tables like the ones on the next few pages are provided 
on both sections of the exam. You may use your calculator throughout the 


exam. See the AP® Exam Tips that follow for advice about how to 
maximize your score. 


Section I: Multiple Choice Your score is based only on the number of 
questions you answer correctly. So don’t leave any questions 
unanswered! 


Weighted Section I Score = Number of correct answers x 1.25 


Section II: Free Response Each free response question is scored 
holistically on a 0 to 4 scale. The score categories represent different 
levels of quality in a student’s response across two dimensions: 
Statistical knowledge and communication. 


4 Complete Shows complete statistical understanding of the 
problem; Provides a clear, organized, complete 
explanation 


3 Shows substantial statistical understanding of the 
Substantial problem; Provides a fairly clear and organized, 
almost complete explanation 


2 Shows some statistical understanding of the problem; 
Developing Provides somewhat clear and organized, incomplete 
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explanation 


1 Minimal Shows limited statistical understanding of the 
problem; Provides minimal or unclear explanation 


0 Shows little or no statistical understanding of the 
problem; Provides no meaningful explanation 


Weighted Section II Score = (Sum of scores on Questions 1-5) x 1.875 
+ Question 6 score X 3.125 


Composite Score = Weighted Section I Score + Weighted Section IT 
Score 


Composite scores (on a 100-point scale) are converted to AP® Scores 
(on a 1 to 5 scale) using cutoffs determined each year based on 
statistical analysis of overall student performance on the exam. In 2012, 
for example, it took 33 points to earn a 2, 44 points to earn a 3, 57 
points to earn a 4, and 70 points to earn a 5. 


Chapter 1 


e If you learn to distinguish categorical from quantitative variables now, it 
will pay big rewards later. You will be expected to analyze categorical 
and quantitative variables correctly on the AP® Statistics exam. 


e When comparing groups of different sizes, be sure to use relative 
frequencies (percents or proportions) instead of frequencies (counts) 
when analyzing categorical data. Comparing only the frequencies can be 
misleading, as in this setting (page 18). There are many more people 
who never use snowmobiles among the non-environmental club 
members in the sample (445) than among the environmental club 
members (212). However, the percentage of environmental club 
members who never use snowmobiles is much higher (69.5% to 36.4%). 
Finally, make sure to avoid statements like “More club members never 
use snowmobiles” when you mean “A greater percentage of club 
members never use snowmobiles.” 


e Always be sure to include context when you are asked to describe a 
distribution. This means using the variable name, not just the units the 
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variable is measured in. 


e When comparing distributions of quantitative data, it’s not enough just 
to list values for the center and variability of each distribution. You have 
to explicitly compare these values, using words like “greater than,” “less 
than,” or “about the same as.” 


e If you’re asked to make a graph on a free response question, be sure to 
label and scale your axes. Unless your calculator shows labels and 
scaling, don’t just transfer a calculator screen shot to your paper. 


e The formula sheet provided with the AP® Statistics exam gives the 
sample standard deviation in the equivalent form sx=1n-1) (xi-x )2. 


$= Vai > (x; — Zz)’. 


e You may be asked to determine whether a quantitative data set has any 
outliers. Be prepared to state and use the rule for identifying outliers. 


e Use statistical terms carefully and correctly on the AP® Statistics exam. 
Don’t say “mean” if you really mean “median.” Range is a single 
number; so are Q1,@1 ’ Q3,93:; and IQR. Avoid poor use of language, 
like “the outlier skews the mean” or “the median is in the middle of the 
IQR.” Skewed is a shape and the [QR is a single number, not a region. If 
you misuse a term, expect to lose some credit. 


Chapter 2 


e Students often do not get full credit on the AP® Statistics exam 
because they only use option (ii) with “calculator-speak” to show 
their work on Normal calculation questions—for example, 
normalcdf(—1000,6,6.84,1.55). This is not considered clear 
communication. To get full credit, follow the two-step process (page 
122), making sure to carefully label each of the inputs in the 
calculator command if you use technology in Step 2: 
normalcdf(lower: -1000~10900 upper: 6, mean: 6.84, SD:1.55). 

e As noted previously, to make sure that you get full credit on the Ap® 
Statistics exam, do not use “calculator-speak” alone—for example, 
invNorm(0.90,6.84,1.55). This is not considered clear 
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communication. To get full credit, follow the two-step process (page 
129), making sure to carefully label each of the inputs in the 
calculator command if you use technology in Step 2: invNorm(area: 
0.90, mean: 6.84, SD:1.55). 

e Never say that a distribution of quantitative data is Normal. Real- 
world data always show at least slight departures from a Normal 
distribution. The most you can say is that the distribution is 
“approximately Normal.” 

¢ Normal probability plots are not included on the AP® Statistics topic 
outline. However, these graphs are very useful for assessing 
Normality. You may use them on the AP® exam if you wish—just be 
sure that you know what you’re looking for. 


Chapter 3 


e When you are asked to describe the association shown in a 
scatterplot, you are expected to discuss the direction, form, and 
strength of the association, along with any unusual features, in the 
context of the problem. This means that you need to use both variable 
names in your description. 

e If you are asked to make a scatterplot, be sure to label and scale both 
axes. Don’t just copy an unlabeled calculator graph directly onto your 
Paper. 

e When asked to interpret the slope or y intercept, it is very important to 
include the word predicted (or equivalent) in your response. 
Otherwise, it might appear that you believe the regression equation 
provides actual values of y. 

e When displaying the equation of a least-squares regression line, the 
calculator will report the slope and intercept with much more 
precision than we need. There is no firm rule for how many decimal 
places to show for answers on the AP® Statistics exam. Our advice: 
decide how much to round based on the context of the problem you 
are working on. 


Chapter 4 
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e If you’re asked to describe how the design of a sample survey leads to 
bias, you’re expected to do two things: (1) describe how the members 
of the sample might respond differently from the rest of the 
population, and (2) explain how this difference would lead to an 
underestimate or overestimate. Suppose you were asked to explain 
how using your statistics class as a sample to estimate the proportion 
of all high school students who own a graphing calculator could result 
in bias. You might respond, “This is a convenience sample. It would 
probably include a much higher proportion of students with a 
graphing calculator than in the population at large because a graphing 
calculator is required for the statistics class. So this method would 
probably lead to an overestimate of the actual population proportion.” 

e If you are asked to identify a possible confounding variable in a given 
setting, you are expected to explain how the variable you choose (1) 
is associated with the explanatory variable and (2) is associated with 
the response variable. 

e If you are asked to describe a completely randomized design, stay 
away from flipping coins. For example, suppose we ask each student 
in the caffeine experiment (page 252) to toss a coin. If it’s heads, then 
the student will drink the cola with caffeine. If it’s tails, then the 
student will drink the caffeine-free cola. As long as all 20 students 
toss a coin, this is still a completely randomized design. Of course, 
the two groups are unlikely to contain exactly 10 students because it 
is unlikely that 20 coin tosses will result in a perfect 50-50 split 
between heads and tails. 


The problem arises if we try to force the two groups to have equal 
sizes. Suppose we continue to have students toss coins until one of 
the groups has 10 students and then place the remaining students in 
the other group. In this case, the last two students in line are very 
likely to end up in the same group. However, in a completely 
randomized design, the last two subjects should only have a 50% 
chance of ending up in the same group. 

e Don’t mix the language of experiments and the language of sample 
surveys or other observational studies. You will lose credit for saying 
things like “use a randomized block design to select the sample for 
this survey” or “this experiment suffers from nonresponse because 
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some subjects dropped out during the study.” 


Chapter 5 


e On the AP® Statistics exam, you may be asked to describe how to 
perform a simulation using rows of random digits. If so, provide a 
clear enough description of your process for the reader to get the 
same results from only your written explanation. Remember that 
every label needs to be the same length. In the golden ticket lottery 
example (page 306), the labels should be 01 to 95 (all two digits), not 
1 to 95. When sampling without replacement, be sure to mention that 
repeated numbers should be ignored. 

e Many probability problems involve simple computations that you can 
do on your calculator. It may be tempting to just write down your 
final answer without showing the supporting work. Don’t do it! A 
“naked answer,” even if it’s correct, will usually be penalized on a 
free response question. 

e You can write statements like P(AIB)? (A|B) if events A and B are 
clearly defined in a problem. Otherwise, it’s probably easier to use 
contextual labels, like P(E)? (I/F) in the preceding example (page 


333). Or you can just use words: P(Instagram|Facebook) 
P(Instagram|Facebook) 


Chapter 6 


e If the mean of a random variable has a non-integer value but you 
report it as an integer, your answer will not get full credit. 

e If you are asked to calculate the mean or standard deviation of a 
discrete random variable on a free response question, you must show 
numerical values substituted into the appropriate formula, as in the 
previous two examples (pages 367 and 369). Feel free to use ellipses 
(...) if there are many terms in the summation, as we did. You may 
then use the method described in Technology Corner 12 to perform 
the calculation with 1-Var Stats. Writing only 1-Var Stats L1, L2 and 
then giving the correct values of the mean and standard deviation will 
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not earn credit for showing work. 

e Students often do not get full credit on the AP® Statistics exam 
because they only use option (ii) with “calculator-speak” to show 
their work on Normal calculation questions—for example, 
normalcdf(68,70,64,2.7). This is not considered clear communication. 
To get full credit, follow the two-step process (page 373), making 
sure to carefully label each of the inputs in the calculator command if 
you use technology in Step 2: normalcdf(lower:68, upper:70, 
mean:64, SD: 2.7). 

e Don’t rely on “calculator speak” when showing your work on free 
response questions. Writing binompdf (5,0.25,3)=0.08789 
(5, 0.25, 3) = 0.08789 will not earn you full credit for a binomial 

probability calculation. At the very least, you must indicate what each 

of those calculator inputs represents. For example, 

“binompdf(trials:5,p:0.25,x value:3) =0.08789.”= 0.08789. ” 


Chapter 7 


e Many students lose credit on the AP® Statistics exam when defining 
parameters because their description refers to the sample instead of 
the population or because the description isn’t clear about which 
group of individuals the parameter is describing. When defining a 
parameter, we suggest including the word all or the word true in your 
description to make it clear that you aren’t referring to a sample 
Statistic. 

e Terminology matters. Never just say “the distribution.” Always say 
“the distribution of [blank],” being careful to distinguish the 
distribution of the population, the distribution of sample data, and the 
sampling distribution of a statistic. Likewise, don’t use ambiguous 
terms like “sample distribution,” which could refer to the distribution 
of sample data or to the sampling distribution of a statistic. You will 
lose credit on free response questions for misusing statistical terms. 

e Make sure to understand the difference between accuracy and 
precision when writing responses on the AP® Statistics exam. Many 
students use “accurate” when they really mean “precise.” For 
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example, a response that says “increasing the sample size will make 
an estimate more accurate” is incorrect. It should say that increasing 
the sample size will make an estimate more precise. If you can’t 
remember which term to use, don’t use either of them. Instead, 
explain what you mean without using statistical vocabulary. 

e Notation matters. The symbols p,x ,n,p,u,0,up/,op,ux , 

P» Py TP) Hy Hy» %p>Hz> and ox 77 all have specific and different 
meanings. Either use notation correctly—or don’t use it at all. You 
can expect to lose credit if you use incorrect notation. 

e Many students lose credit on probability calculations involving x = 
because they forget to divide the population standard deviation by n. 
Vv". Remember that averages are less variable than individual 
observations! 


Chapter 8 


e When interpreting a confidence interval, make sure that you are 
describing the parameter and not the statistic. It’s wrong to say that 
we are 95% confident the interval from 0.613 to 0.687 captures the 
proportion of U.S. adults who admitted they would experience 
financial difficulty. The “proportion who admitted they would 
experience financial difficulty” is the sample proportion, which is 
known to be 0.65. The interval gives plausible values for the 
proportion who would admit to experiencing some financial difficulty 
if asked. 

e Ona given problem, you may be asked to interpret the confidence 
interval, the confidence level, or both. Be sure you understand the 
difference: the confidence interval gives a set of plausible values for 
the parameter and the confidence level describes the overall capture 
rate of the method. 

e Ifa free response question asks you to construct and interpret a 
confidence interval, you are expected to do the entire four-step 
process. That includes clearly defining the parameter, identifying the 
procedure, and checking conditions. 

e You may use your calculator to compute a confidence interval on the 
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AP® Statistics exam. But there’s a risk involved. If you just give the 
calculator answer with no work, you’ll get either full credit for the 
“Do” step (if the interval is correct) or no credit (if it’s wrong). If you 
opt for the calculator-only method, be sure to complete the other three 
steps, including identifying the procedure (e.g., one-sample z interval 
for p) and give the interval in the Do step (e.g., 0.19997 to 0.26073). 

e If a question on the AP® Statistics exam asks you to construct and 
interpret a confidence interval, all the conditions should be met. 
However, you are still required to state the conditions and show 
evidence that they are met—including a graph if the sample size is 
small and the data are provided. 

e It is not enough just to make a graph of the data on your calculator 
when assessing Normality. You must sketch the graph on your paper 
to receive credit. You don’t have to draw multiple graphs—any 
appropriate graph will do. 


Chapter 9 


e Hypotheses always refer to a population, not to a sample. Be sure to 
state HOM0 and Ha/o in terms of population parameters. It is never 
correct to write a hypothesis about a sample statistic, such as 
HO:p=0.8040 : P = 9.80 or Ha:x 431.40 :  F 31. 

e We recommend that you follow the two-sentence structure from the 
example (page 559) when writing the conclusion to a significance 
test. The first sentence should give a decision about the null 
hypothesis—reject HO/0 or fail to reject HO0—based on an 
explicit comparison of the PP-value to a stated significance level. 
The second sentence should provide a statement about whether or not 
there is convincing evidence for Ha/e in the context of the problem. 

e When a significance test leads to a fail to reject HO/0 decision, as in 
the preceding example (page 574), be sure to interpret the results as 
“We don’t have convincing evidence for Ha.” 4a.” Saying anything 
that sounds like you believe Ho/Zo is (or might be) true will lead to a 
loss of credit. For instance, it would be wrong to conclude, “There is 
convincing evidence that the true proportion of blemished potatoes is 
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0.08.” And don’t write responses as text messages, like “FTR the 
Ho.” 40.” 

e You can use your calculator to carry out the mechanics of a 
significance test on the AP® Statistics exam. But there’s a risk 
involved. If you give just the calculator answer with no work, and one 
or more of your values are incorrect, you will probably get no credit 
for the “Do” step. If you opt for the calculator-only method, be sure 
to name the procedure (one-sample z test for a proportion) and to 
report the test statistic (z)=1.15)(2) = 1.15) and PP-value (0.1243). 

e When making a conclusion in a significance test, be sure that you are 
describing the parameter and not the statistic. In the preceding 
example (page 578), it’s wrong to say that we have convincing 
evidence that the proportion of students at Yanhong’s school who 
said they have never smoked differs from the CDC’s claim of 0.68. 
The “proportion who said they have never smoked” is the sample 
proportion, which is known to be 0.60. The test gives convincing 
evidence that the proportion of all students at Yanhong’s school who 
would say they have never smoked a cigarette differs from 0.68. 

e It is not enough just to make a graph of the data on your calculator 
when assessing Normality. You must sketch the graph on your paper 
and make an appropriate comment about it to receive credit. You 
don’t have to draw more than one graph—a single appropriate graph 
will do. 

e Remember: If you give just calculator results with no work, and one 
or more values are wrong, you probably won’t get any credit for the 
“Do” step. If you opt for the calculator-only method, name the 
procedure (one-sample t test for p/) and report the test statistic 
(t)=-0.94)(¢) ~ —0.94) degrees of freedom df=14,4f = 14, and PP- 
value (0.1809). 


Chapter 10 


e The formula for the two-sample z interval for p1—p2?1 ~ P2 often 
leads to calculation errors by students. As a result, your teacher may 
recommend using the calculator’s 2-PropZInt feature to compute the 
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confidence interval on the AP® Statistics exam. Be sure to name the 
procedure (two-sample z interval for p1—p2?1 ~ P2) in the “Plan” step 
and give the interval (-0.311,0.116)(—9-311, 0.116) in the “Do” step. 
e The formula for the two-sample z statistic for a test about p1—p2 
Pi ~ P2 often leads to calculation errors by students. As a result, your 
teacher may recommend using the calculator’s 2-PropZTest feature to 
perform calculations on the AP® Statistics exam. Be sure to name the 
procedure (two-sample z interval for p1—p2?1 ~ P2) in the “Plan” step 


z= 1.17 and P-value 


and report the standardized test statistic (z=1.171 
(0.2427) in the “Do” step. 

e The formula for the two-sample t interval for 11-21 ~ #2 often 
leads to calculation errors by students. Also, the interval produced by 
technology is narrower than the one calculated using the conservative 
method. As a result, your teacher may recommend using the 
calculator’s 2-SampTInt feature to compute the confidence interval. 
Be sure to name the procedure (two-sample ¢ interval for 1-2 
#1 ~ #2) in the “Plan” step and give the interval (3.9362, 17.724) and 
df (55.728) in the “Do” step. 

e The formula for the two-sample t statistic for a test about p1—-p2 
#1 ~ #2 often leads to calculation errors by students. Also, the P- 
value from technology is smaller and more accurate than the one 
obtained using the conservative method. As a result, your teacher may 
recommend using the calculator’s 2-SampTTest feature to perform 
calculations. Be sure to name the procedure (two-sample t test for 
1—-p21 ~ #2) in the “Plan” step and to report the standardized test 
statistic (t=1.60(¢ = 1-60 p_value (0.0644), and df (15.59) in the “Do” 
step. 


Chapter 11 


e The formula for the chi-square test statistic is included on the formula 
sheet that is provided on the AP® Statistics exam. However, it 
doesn’t include the word count: 


X2=>; ( Observed—Expected)2Expected 
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| . (Observed—Expected)? 
lia Expected 


We included the word count to emphasize that you must use the 
observed and expected counts—not the observed and expected 
proportions—when calculating the chi-square test statistic. 


When checking the Large Counts condition, be sure to examine the 
expected counts, not the observed counts. And make sure to write and 
label the expected counts on your paper or you won’t receive credit. 
You can use your calculator to carry out the mechanics of a 
significance test on the AP® Statistics exam. But there’s a risk 
involved. If you just give the calculator answer with no work, and one 
or more of your values is incorrect, you will likely get no credit for 
the “Do” step. We recommend writing out the first few terms of the 
chi-square calculation followed by “.. .” . This approach might help 
you earn partial credit if you enter a number incorrectly. Be sure to 
name the procedure (chi-square test for goodness of fit) and to report 
the test statistic (y2=11.2), 0° = 11.2) 
(df= 3) and P-value (0.011). 

As with chi-square tests for goodness of fit, the expected counts 
should not be rounded to the nearest whole number. While an 
observed count of entrées ordered must be a whole number, an 
expected count need not be a whole number. The expected count 
gives the average number of entrées ordered if HO is true and the 
random assignment process is repeated many times. 

In the “Do” step, you aren’t required to show every term in the chi- 
square test statistic. Writing the first few terms of the sum followed 
by “. ..” is considered as “showing work.” We suggest that you do 
this and then let your calculator tackle the computations. 

You can use your calculator to carry out the mechanics of a 


’ degrees of freedom (df=3) 


significance test on the AP® Statistics exam. But there’s a risk 
involved. If you just give the calculator answer without showing 
work, and one or more of your entries is incorrect, you will likely get 
no credit for the “Do” step. We recommend writing out the first few 
terms of the chi-square calculation followed by “. . .”. This approach 
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may help you earn partial credit if you enter a number incorrectly. Be 
sure to name the procedure (y2=test for homogeneity) 


2 _ . . “ - . 
(x? = test for homogeneity) and to report the test statistic 


(y2=18.279),(X° = 18.279) , degrees of freedom (df=4), (df = 4); and 
P-value (0.0011). 

e Many students lose credit on the AP® Statistics exam because they 
don’t write down and label the expected counts in their response. It 
isn’t enough to claim that all the expected counts are at least 5. You 
must provide clear evidence. 

e When the P-value is very small, the calculator will report it using 
scientific notation. Remember that P-values are probabilities and 
must be between 0 and 1. If your calculator reports the P-value with a 
number that appears to be greater than 1, look to the right, and you 
will see that the P-value is being expressed in scientific notation. If 
you claim that the P-value is 4.82, you will certainly lose credit. 


Chapter 12 


e We use the same notation as the AP® Statistics exam formula sheet 
for the equation of the sample regression line y\=b0+b1x. 
y = bo + bi x. However, your graphing calculator probably uses the 
notation y\=atbx¥ = a+ 6x. Just remember: the slope is always the 
coefficient of x, no matter what symbol is used. 

e The AP® Statistics exam formula sheet gives the formula for the 
standard error of the slope as 


Sb) = 


sb1=Y(yi-yi)2n-2(xi-x )2 Vy (i= 


The numerator is just a fancy way of writing the standard deviation of 
the residuals s. Can you show that the denominator of this formula is 
the same as ours? 


e When you see a list of data values on an exam question, wait a 
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moment before typing the data into your calculator. Read the question 
through first. Often, information is provided that makes it 
unnecessary for you to enter the data at all. This can save you 
valuable time on the AP® Statistics exam. 
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Formulas for the AP® Statistics 
Exam 


Students are provided with the following formulas on both the multiple 
choice and free-response sections of the AP® Statistics exam. 


I. Descriptive Statistics 


: 
aE 


x =y xin” n 
= 6a aloo Ti—-wz 
sx=In-1¢(xi-x 2° Vi" > (#i - 2) 


| (my —1)s?+(ng —1)s3 


8p = 4! 
sp=(n1-1)s12+(n2-1)s22(n1-1)+(n2-1) * Vi (m1—1)+(n2-1) 
y\=b0+b1x¥ = 50 + bie 


b Yo (ei-2)—-(y-y) 
SS 
b1=)\(xi-x )-(yi-y ))\(xi-x )2 d (zi-2) 


b0=y -bix 6% =9- hz 


a ri—- ae | 
r=1n-1)(xi-x sx)(yi-y | _ d ( Sz ( 8 
yi-y sy) y 


Sy 


bl=rsysx”! a 


Sb, = 


sb1=Y(yi-yij2n-2)(xi-x 2 ydDtei-2” 
II. Probability 
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_ P(AMB) 
P(AIB)=P(AnB)P(BY B) = Pw 


E(X)=pX=yxipi® (*) = px =) Tipi 


Var(X)oX2=¥ (xipX)2piV a (X) o% =L(wiex) Pi 


If XX has a binomial distribution with parameters nn and p,?: then: 


eee (;;) pi(1—p)"* 


P(X=k)=(nk)pk(1-p)n-k 
uxX=nphx = np 

oX=np(1-p)°* = np (1 ~ p) 
upA=pir ~ P 


/ p(1—p) 


op\=p(1-p)n’? ~ V® 


If x £ is the mean of a random sample of size nv from an infinite 
population with mean p and standard deviation o,7> then: 


ux =pHs = # 
ox =on Mis 


III. Inferential Statistics 


Standardized test statistic=statistic-parameterstandard deviation of statistic 
statistic—parameter 


Standardized test statistic =— ——_—_—_______—___ 
standard deviation of statistic 
Confidence interval: statistic+(critical value) : (std. deviation of statistic) 
Confidence interval : statistic + (critical value) - (std. deviation of statistic) 
Single-Sample 


Statistic Standard Deviation of Statistic 
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Sample Mean 


Sample Proportion 


Statistic 


Difference of sample 
means 


Difference of sample 
proportions 


oO 


onv” 


/pU—p 
p(i-pynV on 


Two-Sample 


Standard Deviation of Statistic 


o12n1+022n2 "1 "2 


Special case whenol=o2o01n1+1n2 
Special case when 


a; = 02 


p1(1-p1)n1+p2(1-p2)n2 
jeri) Pa-24) 
ny ng 
Special case whenp1=p2p(1—-p)1n1+1n2 
Special case when 


Pi = P2 


} Thy ng 


Chi-square test statistic=>'(observed—expected)2expected 


Chi-square test statistic = ee 


(observed— expected)? 


expected 
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Probability | 


Table entry for z is the area under the standard Normal curve to the left of 


Z. 


0003 
0005 
.0007 
.0010 
0013 


0019 


0026 
0035 
.0047 


.0062 


0082 
.0107 
0139 
0179 
0228 
0287 
0359 
0446 
0548 
.0668 
.0808 
.0968 


0003 
0005 
.0007 
0009 
0013 


.0018 
0025 
.0034 
0045 


.0060 


.0080 
0104 
0136 
0174 
0222 
0281 
0351 
0436 
0537 
.0655 
0793 
0951 


0003 
0005 
0006 
0009 
.0013 


.0018 
0024 
0033 
.0044 


0059 


.0078 
0102 
0132 
.0170 
0217 
0274 
0344 
0427 
0526 
0643 
0778 
.0934 


0003 
.0004 
.0006 
0009 
0012 


0017 
0023 
0032 
.0043 


0057 


0075 
.0099 
0129 
.0166 
0212 
0268 
0336 
0418 
0516 
.0630 
.0764 
0918 
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0003 
.0004 
0006 
.0008 
0012 


.0016 
0023 
0031 
0041 


0055 


.0073 
.0096 
0125 
0162 
0207 
0262 
0329 
.0409 
0505 
.0618 
0749 
0901 


0003 
.0004 
0006 
.0008 
0011 


.0016 
0022 
.0030 
.0040 


0054 


0071 
.0094 
0122 
0158 
0202 
0256 
0322 
0401 
0495 
.0606 
0735 
0885 


0003 
.0004 
0006 
.0008 
0011 


0015 
0021 
0029 
.0039 


0052 


.0069 
0091 
.0119 
0154 
0197 
0250 
0314 
0392 
0485 
0594 
0721 
.0869 


0003 
.0004 
0005 
0008 
0011 


0015 
0021 
.0028 
.0038 


0051 


.0068 
.0089 
.0116 
0150 
0192 
0244 
.0307 
0384 
0475 
0582 
.0708 
0853 


=12 
Set 
-1.0 
—0.9 
-0.8 
—0.7 
—0.6 
-0.5 
-0.4 
U3 
-0.2 
-0.1 
—0.0 
0.0 
0.1 
0.2 
0.3 
0.4 
0.5 
0.6 
0.7 
0.8 
0.9 
1.0 
11 
1,2 
1.3 
1.4 
1.5 
1.6 
17 
1.8 
19 
2.0 
21 


L151 
1357 
1587 
1841 
.2119 
2420 
2743 
3085 
3446 
aio Pal 
4207 
4602 
.0000 
0000 
0398 
Bisse) 
.6179 
6554 
6915 
J2o7 
.7580 
.7881 
8159 
8413 
8643 
8849 
9032 
192 
9332 
9452 
9554 
9641 
9713 
0772 
9821 


se 
1335 
1562 
.1814 
.2090 
.2389 
.2709 
3050 
3409 
3783 
4168 
4562 
4960 
0040 
0438 
0832 
6217 
6591 
.6950 
J2G1 
.7611 
.7910 
.8186 
8438 
8665 
.8869 
.9049 
.9207 
9345 
9463 
9564 
.9649 
9719 
.9778 
9826 


i Be 
.1314 
Doe 
.1788 
.2061 
2358 
.2676 
3015 
Bele yi 
3745 
4129 
4522 
4920 
.0080 
0478 
0871 
6255 
.6628 
.6985 
7324 
.7642 
oo0 
8212 
8461 
.8686 
.8888 
.9066 
9222 
9357 
.9474 
9573 
9656 
9726 
.9783 
.9830 


.1093 
21292 
1515 
1762 
.2033 
2327 
.2643 
2981 
<oog0 
OT 
4090 
4483 
4880 
0120 
ols 
0910 
6293 
.6664 
.7019 
mpieys 
.7673 
.7967 
8238 
8485 
8708 
.8907 
9082 
Bo 4e) 0 
9370 
.9484 
9582 
.9664 
12 
9788 
9834 
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1075 
1271 
1492 
1736 
.2005 
.2296 
204 
.2946 
3300 
.3669 
4052 
4443 
4840 
0160 
ooo 7 
0948 
6331 
.6700 
7054 
.7389 
.7704 
Wiss 1s) 
8264 
8508 
8729 
8925 
.9099 
9251 
9382 
9495 
ook 
.9671 
9738 
9793 
.9838 


1056 
251 
.1469 
1711 
1977 
.2266 
2978 
eal? 
3264 
3632 
4013 
4404 
4801 
oL99 
22090 
25987 
.6368 
.6736 
.7088 
7422 
.7734 
8023 
8289 
8531 
8749 
8944 
9115 
9265 
9394 
9505 
9599 
.9678 
9744 
.9798 
9842 


.1038 
1230 
.1446 
.1685 
.1949 
2236 
2046 
.2877 
3228 
3094 
3974 
4364 
4761 
0239 
0636 
.6026 
.6406 
6772 
7123 
7454 
.7764 
8051 
8315 
8554 
8770 
8962 
131 
10279 
.9406 
9515 
.9608 
.9686 
9750 
.9803 
.9846 


.1020 
1210 
1423 
.1660 
922 
.2206 
2914 
.2843 
«192 
ooon, 
o030 
4325 
721 
psy hs) 
0675 
.6064 
6443 
.6808 
J 1a7 
.7486 
.7794 
.8078 
.8340 
8577 
8790 
.8980 
9147 
20292: 
9418 
9525 
.9616 
.9693 
9756 
.9808 
9850 


ae 
2.3 
2.4 
2.0 
2:6 
ar 
2.8 
2.9 
3.0 
ook 
3:2 
o:0 
3.4 


.9861 
.9893 
.9918 
9938 
9953 
9965 
9974 
9981 
.9987 
.9990 
9993 
9995 
o997 


.9864 
.9896 
.9920 
.9940 
9955 
.9966 
9975 
9982 
O07 
ooe 
9993 
9995 
9997 


.9868  .9871 
9898  .9901 
pss PP ers te P45) 
9941 .9943 
9956 = .9957 
.9967 .9968 
9976  .9977 
9982  .9983 
9987  .9988 
ool 9991 
9994 .9994 
woos 2906 
997 9997 


t* 


9875 
.9904 
oe], 
9945 
9959 
.9969 
9977 
9984 
.9988 
ooe 
9994 
.9996 
9997 


p 


Probability 


.9878 
.9906 
9929 
9946 
.9960 
.9970 
.9978 
9984 
.9989 
9992 
9994 
.9996 
9997 


.9881 
.9909 
woot 
.9948 
.9961 
O71. 
.9979 
9985 
.9989 
9992 
9994 
.9996 
9997 


.9884 
OTE 
9932 
9949 
.9962 
9972 
9979 
9985 
.9989 
902 
9995 
.9996 
O07 


Table entry for p and C is the point t* with probability p lying to its right 
and probability C lying between —t* and t* 


df 


ND UU BP WN RP 


20 
1.000 
0.816 
0.765 
0.741 
0.727 
0.718 
0.711 


.20 
1.376 
1.061 
0.978 
0.941 
0.920 
0.906 
0.896 


5 
1.963 
1.386 
1.250 
1.190 
156 
1.134 
1,119 


10 
3.078 
1.886 
1.638 
1.533 
1.476 
1.440 
1.415 


Tail probability p 


05 
6.314 
2.920 
Zoo 
232 
2.015 
1.943 
1.895 
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025 
7 
4.303 
3.182 
2/76 
Parag 
2.447 
2.365 


02 
15.89 
4.849 
3.482 
2.999 
2.707 
2.612 
2.517 


01 
oLB2 
6.965 
4.541 
3.747 
3.365 
3.143 
2.998 


005 
63.66 
3.925 
5.841 
4.604 
4.032 
3.707 
3.499 


0.706 
0.703 
0.700 
0.697 
0.695 
0.694 
0.692 
0.691 
0.690 
0.689 
0.688 
0.688 
0.687 
0.686 
0.686 
0.685 
0.685 
0.684 
0.684 
0.684 
0.683 
0.683 
0.683 
0.681 
0.679 
0.679 
0.678 
0.677 
0.675 
0.674 
50% 


0.889 
0.883 
0.879 
0.876 
0.873 
0.870 
0.868 
0.866 
0.865 
0.863 
0.862 
0.861 
0.860 
0.859 
0.858 
0.858 
0.857 
0.856 
0.856 
0.855 
0.855 
0.854 
0.854 
0.851 
0.849 
0.848 
0.846 
0.845 
0.842 
0.841 
60% 


1.108 
1.100 
1.093 
1.088 
1.083 
1.079 
1.076 
1.074 
1.071 
1.069 
1.067 
1.066 
1.064 
1.063 
1.061 
1.060 
1059 
1.058 
1.058 
1.057 
1.056 
1.055 
1.055 
1.050 
1.047 
1.045 
1.043 
1.042 
1.037 
1.036 
70% 


1307 
1.383 
1.372 
1.363 
1.356 
1.350 
1.345 
1.341 
1.337 
1.333 
1.330 
1.328 
1.325 
1.323 
32k 
1.319 
1.318 
1.316 
1.315 
1.314 
1.313 
1.3511. 
1.310 
1.303 
13299 
1.296 
1292 
1.290 
1.282 
1.282 
80% 


1.860 
1.833 
1.812 
1.796 
1.782 
Pye all 
1.761 
V7ae 
1.746 
1.740 
1.734 
1.729 
1.725 
1.721 
L717 
1.714 
L7AL 
1.708 
1.706 
1.703 
1.701 
1.699 
1.697 
1.684 
1.676 
1.671 
1.664 
1.660 
1.646 
1.645 
90% 
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2.306 
2.262 
2.228 
2.201 
279 
2.160 
2.145 
2.130 
2.120 
2.110 
2.101 
2.093 
2.086 
2.080 
2.074 
2.069 
2.064 
2.060 
2.056 
2.052 
2.048 
2.045 
2.042 
2.021 
2.009 
2.000 
1.990 
1.984 
7362 
1.960 
95% 


2.449 
2.398 
2.359 
2.328 
2.303 
2.262 
2.264 
2.249 
2230 
2,224 
2.214 
2.205 
2.197 
2.189 
2.183 
2177 
Baal We 
2.167 
2.162 
2.158 
2.154 
2.150 
2.147 
2123 
2.109 
2.099 
2.088 
2.081 
2.056 
2.054 
96% 


2.896 
2.821 
2.764 
2.718 
2.681 
2.650 
2.624 
2.602 
2.583 
2.567 
2.502 
2,039 
2.528 
2.518 
2.508 
2.500 
2.492 
2.485 
2.479 
2.473 
2.467 
2.462 
2.457 
2.423 
2.403 
2.390 
2.374 
2.364 
2.330 
2326 
98% 


Confidence level C 


3.300 
3.250 
3.169 
3.106 
3.055 
3.012 
2977 
2.947 
2.921 
2.898 
2.878 
2.861 
2.845 
2.831 
2.819 
2.807 
2/97), 
2737 
24/9 
277% 
2.763 
2.756 
2.750 
2.704 
2.678 
2.660 
2.639 
2.626 
2.581 
2.076 
99% 


Probability 


Pp 


x2 


Table entry for p is the point y* with probability p lying to its right. 


Tail probability p 
df 29 20 15 10 05 025 02 01 .005 
1 132 164 2.07 2.71 384 502 541 663 7.88 
2 277 3.22 3/9 461 399 7.38 7.82 9.21 10:60 
3 411 464 532 625 781 935 9.84 11.34 12.84 
4 5.39 5.99 6.74 7.78 949 11.14 11.67 13.28 14.86 
» 663 #729 6.12 9.24 1:07 12:83 13°39 1509 16.75 
6 784 856 9.45 10.64 12.59 14.45 15.03 16.81 18.55 
7 9.04 9.80 10.75 12.02 1407 16.01 1662 18.48 20.28 
8 10.22 11.03 12.03 13.36 15.51 17.53 18.17 20.09 21.95 
9 11.389 12.24 13.29 1468 16.92 19.02 19.68 21.67 23.59 
10 12.55 13.44 1453 15.99 18.31 20.48 21.16 23.21 25.19 
11 13.70 14.63 15.77 17.28 19.68 21.92 22.62 24.72 26.76 
12 14.85 15.81 16.99 18.55 21.03 23.34 24.05 26.22 28.30 
13. 15.98 16.98 18.20 19.81 22.36 24.74 25.47 27.69 29.82 
14 17.12 18.15 19.41 21.06 23.68 26.12 26.87 29.14 31.32 
15) 18.25 19.31 20.60 22.31 25.00 27.49 28.26 30.58 32.80 
16 «19.37 20.47 21.79 23.54 26.30 28.85 29.63 32.00 34.27 
17) 20.49 21.61 22.98 24.77 27.59 30.19 31.00 33.41 35.72 
18 21.60 22.76 2416 25.99 28.87 31.53 32.35 34.81 37.16 
19) 22.72 23.90 25.33 27.20 30.14 32.85 33.69 36.19 38.58 
20 23.83 25.04 26.50 28.41 31.41 34.17 35.02 37.57 40.00 
21 2493 26.17 27.66 29.62 32.67 35.48 36.34 38.93 41.40 
22 26.04 27.30 28.82 30.81 33.92 36.78 37.66 40.29 42.80 
23. 27.14 28.43 29.98 32.01 35.17 38.08 38.97 41.64 44.18 
24 28.24 29.55 31.13 33.20 36.42 39.36 40.27 42.98 45.56 
25 29.34 30.68 32.28 34.38 37.65 40.65 41.57 44.31 46.93 
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26 
af 
28 
29 
30 
40 
50 
60 
80 
100 


Line 
101 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 
12 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 


30.43 
31.53 
32.62 
33.71 
34.80 
45.62 
96.33 
66.98 
88.13 
109.1 


19223 
73676 
45467 
92711 
95592 
68417 
82739 
60940 
36009 
38448 
81486 
599636 
62568 
45149 
61041 
14459 
38167 
73190 
95857 
35476 
71487 
13873 


31.79 
32.91 
34.03 
35.14 
36.25 
47.27 
poe 
68.97 
90.41 
17 


95034 
47150 
71709 
38889 
94007 
35013 
97890 
72024 
19365 
48789 
69487 
88804 
70206 
32992 
77684 
26056 
98532 
32533 
07118 
99972 
09984 
81598 


33.43 
34.57 
Souf 
36.85 
37.99 
49.24 
60.35 
71.34 
=e aila! 
114.7 


05756 
99400 
77558 
93074 
69971 
15529 
20807 
17868 
15412 
18338 
60513 
04634 
40325 
75730 
94322 
31424 
62183 
04470 
87664 
39421 
29077 
95052 


35.56 
36.74 
37.92 
39.09 
40.26 
51.81 
63.17 
74.40 
96.58 
118.5 


38.89 
40.11 
41.34 
42.56 
43.77 
99.76 
67.50 
79.08 
101.9 
124.3 


28713 
01927 
00095 
60227 
91481 
72765 
47511 
24943 
39638 
24697 
09297 
FANT 
03699 
66280 
24709 
80371 
70632 
29669 
92099 
65850 
14863 
90908 
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41.92 
43.19 
44.46 
45.72 
46.98 
99.34 
71.42 
83.30 
106.6 
129.6 


96409 
27754 
32863 
40011 
60779 
85089 
81676 
61790 
85453 
39364 
00412 
19352 
71080 
03819 
73698 
65103 
23417 
84407 
98806 
04266 
61683 
USS Pe 


42.86 
44.14 
45.42 
46.69 
47.96 
60.44 
72.61 
84.58 
108.1 
red ya 


12538 
42648 
29485 
85848 
93791 
97067 
599300 
90656 
46816 
42006 
71238 
73089 
22553 
56202 
14526 
62253 
26185 
90785 
66979 
35435 
47052 
75186 


45.64 
46.96 
48.28 
49.59 
50.89 
63.69 
76.15 
88.38 
cis 
135.8 


42544 
82425 
82226 
48767 
17297 
90211 
94383 
87964 
83485 
76688 
27649 
84898 
11486 
02938 
31893 
50490 
41448 
65956 
98624 
43742 
62224 
87136 


48.29 
49.64 
50.99 
92.34 
93.67 
66.77 
79.49 
atop 
116.3 
140.2 


82853 
36290 
90056 
o2Zaso 
99335 
47487 
14893 
18883 
41979 
08708 
39950 
45785 
11776 
70915 
2502 
61181 
79932 
86382 
84826 
11937 
51025 
95761 


123 
124 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 


54580 
71035 
96746 
96927 
43909 
15689 
36759 
69051 
05007 
68732 
45740 
27816 
66925 
08421 
93645 
66831 
959988 
12975 
96767 
72829 
88565 
62964 
19687 
37609 
94973 
00694 
71546 
07511 


81507 
09001 
12149 
19951 
99477 
14227 
58984 
64817 
16632 
99259 
41807 
78416 
59658 
44753 
66812 
68908 
99404 
13258 
35964 
50232 
42628 
88145 
12633 
99057 
86278 
05977 
05233 
88915 


27102 
43367 
37823 
36809 
25330 
06565 
68288 
87174 
81194 
84292 
65561 
18329 
39100 
7I3d 7 
61421 
40772 
70708 
13048 
23822 
97892 
17797 
83083 
57857 
66967 
88737 
19664 
93946 
41267 


56027 
49497 
71868 
74192 
64359 
14374 
22913 
09517 
14873 
08796 
33302 
Ale oy 
78458 
28744 
47836 
21558 
41098 
45144 
96012 
63408 
49376 
69453 
95806 
83401 
74351 
65441 
68743 
16853 
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59892 
7F2IA9 
18442 
77567 
40085 
13352 
18638 
84534 
04197 
43165 
07051 
35213 
11206 
75992 
12609 
47781 
43563 
7/2321 
94591 
77919 
61762 
46109 
09931 
60705 
47500 
20903 
72460 
84569 


33063 
96758 
35119 
88741 
16925 
49367 
94303 
06489 
85576 
93739 
93623 
37741 
19876 
08563 
15373 
33586 
96934 
81940 
65194 
44575 
16953 
59505 
02150 
02384 
84552 
62371 
27601 
79367 


41842 
27611 
62103 
48409 
85117 
81982 
00795 
87201 
45195 
31685 
18132 
04312 
87151 
79140 
98481 
79177 
48394 
00360 
50842 
24870 
88604 
69680 
43163 
90597 
19909 
22725 
45403 
32337 


81868 
91596 
39244 
41903 
36071 
87209 
08727 
97245 
96565 
97150 
09547 
68508 
31260 
92454 
14592 
06928 
91719 
02428 
Boa s2 
04178 
12724 
00900 
58636 
93600 
67181 
93340 
88692 
03316 


Inference Summary 


Confidence intervals 
(CIs) Significance tests 


State the parameter § State the hypotheses you want to 
you want to estimate _ test and the significance level, 
and the confidence and define any parameters you 
level. use. 


Identify the Identify the appropriate inference 
appropriate inference method and check conditions. 
method and check 

conditions. 


If the conditions are _If the conditions are met, perform 
met, perform calculations. 


calculations. 
e Give the sample statistic(s). 


e Calculate the standardized test 
Statistic. 
e Find the PP-value. 


CONCLUDE: Interpret your interval Make a conclusion about the 
in the context of the hypotheses in the context of the 
problem. problem. 


Standa 


CI: statistic+(critical value) : (standard deviation of statistic) Statist 


statis 
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One-sample z2 interval for pP 


Interval (1-PropZInt) 


(8.2) 
7s 
ots \/ iv dRai 
pA+z*pA(1-p)n n 


1 
One-sample z2 test for pP? 
(1-PropZTest) 
Test “ 
(9.2) a P~PO0 
po (1—P9) 
z=p\—p0p0(1-p0)n a 
Proportions Two-sample z2 interval for pl-p2P?1 ~ P2 
(2-PropZInt) 
Interval 
(10.1) (p\1-p’2)+z*p\1(1-p/1)n1+p/2(1-p’2)n2 
~ 2, 4 /P1(l-P1) , B2(1-P2) 
( - 8) S24 
a | "9 
Two-sample z2 test for pl-p2P1 ~ P2 
2 (2-PropZTest) 
z=(p'1—p2)—Op’ (1-p’)n1 +p’ (1-p%) n 2 
Test 
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(10.1) - (py —py)—0 


where p/=total successestotal sample 
size=X1+X2n1+n2 


total successes _ X14+X 2 


where p =—————_- > 
e total sample size n ,+n9 


One-sample t? interval for pM 


Interval (TInterval) 
(8.3) 
8 
tt df=n-1 
x +t*sxn df=n-1 vn 
1 
One-sample t? test for py (T-Test) 
Test z=— i) 
(9.3) t ane df=n-1 
tx —pOsxn df=n-1 vn 
Two-sample tt interval for p1-p21 ~ #2 
(2-SampTInt) 
(x 1-x 2)+t*s12n1+s22n2 
Interval 
(10.2) 
df from technology or smaller of nl-1,n2-1 
n) —l,no-—1 
2 


Means Two-sample té test for p1-p2M1 ~ M2 


(2-SampT Test) 
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Test 
(10.2) 


t=(x 1-x 2)-0s12n1+s22n2 my ng 


df from technology or smaller of nl-1,n2-1 
nm) —l,no-1 


Paired t? interval for pdiffMdiff 


TI ] 

Interval (TInterval) 

me x diff+t*sdiffndiff df=ndiff-1 

7, = Sdiff 

Eaitt tt df = naite — 1 
V "diff 


Paired 


data a 
Paired tt test for pdiffMditt 


(T-Test) 


F 7 
a0) t=x diff-pOsdiffndiff df=ndiff-1 


Z Asee— 
= diff “0 apace 9 
Sdiff 
diff 


t 


Chi-square test for goodness of fit (y2GOF-Test) 
(x? GOF-Test) 


Test x2=) (observed—expected)2expecteddf=number of 

1 (11.1) categories—1 
‘ (observed—expected)? 
a Pe 


expected 


df = number of categories — 1 
Distribution 
ofa 


categorical 
variable Chi-square test for homogeneity 


(y2-Test) (x g -Test) 
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x2=) (observed—expected)2expecteddf=(# of rows 
—1)(# of columns-1) 
‘ (observed—expected)? 
7 = ere 


2 or Test 
more (11.2) 


expected 
df = (# of rows — 1)(# of columns — 1) 


Chi-square test for independence 


(x?-Test) 


2-Test 
Relationship " 
between 2 Test x2=> (observed-expected)2expecteddf=(# of rows 
categorical (11.2) —1)(# of columns-1) 
variables 2 
" (observed—expected) 
- yeas 


expected 
df = (# of rows — 1)(# of columns — 1) 


tt interval for the slope 


fatewval (LinRegTInt) 
12.1 : 
piat*(sEb1! +e (SEs; ) with af=n-2 
df=n-2 
Relationship 
between 2 
quantitative 
variables tt test for the slope 
] 
core) (LinRegTTest) 
Test t=b1-hypothesized slopeSEb1with df=n-2 
(12.1) b; —hypothesized slope 
0 SEp 
by 
with df = n — 2 
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Formulas for the AP® Statistics 
Exam 


Students are provided with the following formulas on both the multiple 
choice and free-response sections of the AP® Statistics exam. 


I. Descriptive Statistics 


: 
aE 


x =y xin” n 
= 6a aloo Ti—-wz 
sx=In-1¢(xi-x 2° Vi" > (#i - 2) 


| (my —1)s?+(ng —1)s3 


8p = 4! 
sp=(n1-1)s12+(n2-1)s22(n1-1)+(n2-1) * Vi (m1—1)+(n2-1) 
y\=b0+b1x¥ = 50 + bie 


b Yo (ei-2)—-(y-y) 
SS 
b1=)\(xi-x )-(yi-y ))\(xi-x )2 d (zi-2) 


b0=y -bix 6% =9- hz 


a ri—- ae | 
r=1n-1)(xi-x sx)(yi-y | _ d ( Sz ( 8 
yi-y sy) y 


Sy 


bl=rsysx”! a 


Sb, = 


sb1=Y(yi-yij2n-2)(xi-x 2 ydDtei-2” 
II. Probability 
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_ P(AMB) 
P(AIB)=P(AnB)P(BY B) = Pw 


E(X)=pX=yxipi® (*) = px =) Tipi 


Var(X)oX2=¥ (xipX)2piV a (X) o% =L(wiex) Pi 


If XX has a binomial distribution with parameters nn and p,?: then: 


eee (;;) pi(1—p)"* 


P(X=k)=(nk)pk(1-p)n-k 
uxX=nphx = np 

oX=np(1-p)°* = np (1 ~ p) 
upA=pir ~ P 


/ p(1—p) 


op\=p(1-p)n’? ~ V® 


If x £ is the mean of a random sample of size nv from an infinite 
population with mean p and standard deviation o,7> then: 


ux =pHs = # 
ox =on Mis 


III. Inferential Statistics 


Standardized test statistic=statistic-parameterstandard deviation of statistic 
statistic—parameter 


Standardized test statistic =— ——_—_—_______—___ 
standard deviation of statistic 
Confidence interval: statistic+(critical value) : (std. deviation of statistic) 
Confidence interval : statistic + (critical value) - (std. deviation of statistic) 
Single-Sample 


Statistic Standard Deviation of Statistic 
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Sample Mean 


Sample Proportion 


Statistic 


Difference of sample 
means 


Difference of sample 
proportions 


oO 


onv” 


/pU—p 
p(i-pynV on 


Two-Sample 


Standard Deviation of Statistic 


o12n1+022n2 "1 "2 


Special case whenol=o2o01n1+1n2 
Special case when 


a; = 02 


p1(1-p1)n1+p2(1-p2)n2 
jeri) Pa-24) 
ny ng 
Special case whenp1=p2p(1—-p)1n1+1n2 
Special case when 


Pi = P2 


} Thy ng 


Chi-square test statistic=>'(observed—expected)2expected 


Chi-square test statistic = ee 


(observed— expected)? 


expected 
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TI-Nspire and other technology instructions are on the Student Site at 


highschool.bfwpub.com/tps6e. 


1.Analyzing two-way tables 

2.Making histograms 

3.Computing numerical summaries 

4.Making boxplots 

5.Finding areas from values in a Normal distribution 


6.Finding values from areas in a Normal distribution 

7.Making Normal probability plots 

8.Making scatterplots 

9.Calculating least-squares regression lines 
10.Making residual plots 
11.Choosing an SRS 
12.Analyzing discrete random variables 
13.Calculating binomial coefficients 
14.Calculating binomial probabilities 
15.Graphing binomial probability distributions 
16.Calculating geometric probabilities 
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for the AP® Exam 


Written to help you start soaring to success in your statistics course and on the AP® 
Statistics exam, The Practice of Statistics for the AP® Exam, Sixth Edition, has the 
support you want and need built into the book. Here are a few of the features designed to 
give you access to the science and art of statistics: 

* Clear explanations adhere to the language and terminology used on the AP® Statistics 
exam. 

* Paired Examples and Exercises supported by video tutoria/s, and featuring 
experienced AP® Statistics teachers, offer guidance when you want help, so that you 
don't have to circle to make a landing. 

* New to this edition, Chapter Review Exercise videos offer more chances to master 
the big ideas in each chapter by taking another look at statistical concepts with in-depth 
guidance. 

* AP® Exam Tips offer “insider information” to help you develop the habits necessary for 
success. 

* Chapter AP® and Cumulative AP” Tests offer high-quality support so that you can 
practice, practice, and practice for the AP® Statistics exam—all built right into the book, 
all of them written in the authors’ unique engaging style. 

* Graphing Calculator instruction with new video tutorials for the TI-83/84 and TI- 
Nspire leads you step by step to using your “device” more effectively. Additional print 

instructions are available online. 

* New Chapter Projects and integrated Activities give you a chance to think like a 
statistician. 


Stay ahead of the flock with additional, easy-to-access help: 

* Our new digital platform with e-Book and a comprehensive online homework 
program offers the ease of access you want and the flexibility you expect, with video 
tutorials and other support available at a simple click. 

* Strive for 5: Preparing for the AP® Statistics Examination by Jason Molesky and 
Michael Legacy combines a study guide and an AP® prep guide—it’s the perfect 
companion to help you get the most out of the text. 


For information about e-Book and package options and to access open-source student 
resources, visit: www.highschool.bfwpub.com/tps6e 
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